[go: up one dir, main page]

0% found this document useful (0 votes)
436 views7 pages

Kaggle: Your Machine Learning and Data Science Community

Kaggle is an online platform for data science and machine learning competitions founded in 2010. It has over 500,000 competitors worldwide and combines crowd-sourcing and data transformation. Companies provide datasets and Kaggle handles competitions, with participants developing the best models to win prizes. This benefits companies by reducing data science costs and allows data scientists to build their skills and portfolio by competing on real-world problems. Kaggle generates revenue through licensing fees and consulting services.

Uploaded by

Ronak Joshi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
436 views7 pages

Kaggle: Your Machine Learning and Data Science Community

Kaggle is an online platform for data science and machine learning competitions founded in 2010. It has over 500,000 competitors worldwide and combines crowd-sourcing and data transformation. Companies provide datasets and Kaggle handles competitions, with participants developing the best models to win prizes. This benefits companies by reducing data science costs and allows data scientists to build their skills and portfolio by competing on real-world problems. Kaggle generates revenue through licensing fees and consulting services.

Uploaded by

Ronak Joshi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 7

Kaggle

Your Machine Learning and Data Science Community


About Kaggle
• Founded by Anthony Goldbloom and Ben Hammer in 2010
• Online platform for data-mining and predictive-modelling competition
• Biggest platform for competitive data science in the world
• Currently 500k + competitors
• Great platform to share and meet up with other data freaks
• Combines the two concept
• Crowd-sourcing
• Data-transformation
• CrowdAnalytix, InnoCentive, TunedIT, Codalab, DrivenData, CrowdAI are some of the
alternatives of Kaggle
• Click here to know what is Kaggle? - https://youtu.be/TNzDMOg_zsw
How It Works
• Kaggle prepares the data and a description of the problem for
companies that wish to create an algorithm or analytics model.
• Kaggle then offers
• Consulting service
• Frame the competition
• Submissions are scored immediately and summarized on a live leaderboard
• Anonymize data, & integrate the winning model into operations
• Participants experiment with different techniques and compete
against each other to produce the best analytic models
Customer Segments & Their Benefit
• Two interdependent customer segments
• Data Providers – Organizations and individuals that provide datasets for
competitions. They include everything from corporations to governments to
academics to journalists to hobbyists
• Data Solvers – Statisticians and data miners who develop models from the
datasets.
• Reduces the expensive cost of hiring a new data mining team
• Allows the participant to build on top of the incumbent best
performing model & give chance of winning the prize if he/she
submits a model that performs better than the existing model.
Value Proposition
• 3 primary value propositions
• Accessibility
• Convenience
• Brand / status
• Largest community of data scientists in the world, with over 600,000
“Kagglers” from over 194 countries
• 4,000 forum posts a month and over 3,500 competition submissions a day
• Held over 200 competitions since being founded, and the contests typically
draw more than a thousand teams
• Numerous academic papers have been written based on findings from the
competitions
Cost Drivers & Revenue Stream
• Value Driven Structure - provide a premium proposition through
significant personal service and frequent service enhancements
• Cost Drivers
• Research / development cost
• Customer support / operations and administration
• Revenue Stream
• Fee it charges customers to license its platform
• Fees it charges for problem setup consulting
Where it can be used?
• Machine Learning & Data Science models for various Hypothesis like
• Customer segmentation
• Marketing response
• Market Analysis
• Prediction of Employee Turnover
• Stock market prediction
• HR Analysis
• Sales Forecast
• Analysis of industrial labour accidents
• Production Line Performance
• New product forecasting

You might also like