Kaggle

Kaggle, a subsidiary of Google LLC, is an online community of data scientists and machine learning practitioners. Kaggle allows users to find and publish data sets, explore and build models in a web-based data-science environment, work with other data scientists and machine learning engineers, and enter competitions to solve data science challenges.

Kaggle
TypeSubsidiary
IndustryData science
FoundedApril 2010
FounderAnthony Goldbloom, Ben
HeadquartersSan Francisco, United States
Key people
Anthony Goldbloom (CEO)
Ben Hamner (CTO)
Jeff Moser (Chief Architect)
ProductsCompetitions, Kaggle Kernels, Kaggle Datasets, Kaggle Learn
ParentGoogle
(2017–present)
Websitekaggle.com

Kaggle was first launched in 2010 by offering machine learning competitions and now also offers a public data platform,[1] a cloud-based workbench for data science, and Artificial Intelligence education. Its key personnel were Anthony Goldbloom and Jeremy Howard. Nicholas Gruen was the founding chair succeeded by Max Levchin. Equity was raised in 2011 valuing the company at $25.2 million. On 8 March 2017, Google announced that they were acquiring Kaggle.[2][3]

Kaggle community

In June 2017, Kaggle claimed it surpassed 1 million registered users[4] and, as of 2021, over 8 million.[5] The users come from 194 countries.[6]

By March 2017, the Two Sigma Investments fund was running a competition on Kaggle to code a trading algorithm.[7]

Competitions

  1. The competition host prepares the data and a description of the problem; the host may choose whether it's going to be rewarded with money or be unpaid.
  2. Participants experiment with different techniques and compete against each other to produce the best models. Work is shared publicly through Kaggle Kernels to achieve a better benchmark and to inspire new ideas. Submissions can be made through Kaggle Kernels, through manual upload or using the Kaggle API. For most competitions, submissions are scored immediately (based on their predictive accuracy relative to a hidden solution file) and summarized on a live leaderboard.
  3. After the deadline passes, the competition host pays the prize money in exchange for "a worldwide, perpetual, irrevocable and royalty-free license [...] to use the winning Entry", i.e. the algorithm, software and related intellectual property developed, which is "non-exclusive unless otherwise specified".[8]

Alongside its public competitions, Kaggle also offers private competitions limited to Kaggle's top participants. Kaggle offers a free tool for data science teachers to run academic machine-learning competitions.[9] Kaggle also hosts recruiting competitions in which data scientists compete for a chance to interview leading data science companies like Facebook, Winton Capital, and Walmart.

Competitions

Many machine-learning competitions have been run on Kaggle since the company was founded. Notable competitions include one improving gesture recognition for Microsoft Kinect,[10] making a football AI for Manchester City, and improving the search for the Higgs boson at CERN.[11]

Competitions have resulted in successful projects such as furthering HIV research,[12] chess ratings[13] and traffic forecasting.[14] Geoffrey Hinton and George Dahl used deep neural networks to win a competition hosted by Merck. Vlad Mnih (one of Hinton's students) used deep neural networks to win a competition hosted by Adzuna. This resulted in the technique being taken up by others in the Kaggle community. Tianqi Chen from the University of Washington also used Kaggle to show the power of XGBoost, which has since replaced Random Forest as one of the main methods used to win Kaggle competitions.

Several academic papers have been published on the basis of findings made in Kaggle competitions.[15] A contributor to this is the live leaderboard, which encourages participants to continue innovating beyond existing best practices.[16] The winning methods are frequently written on the Kaggle blog, Kaggle Winner's Blog.


Kaggle Progression System

Kaggle has implemented a progression system to recognize and reward users based on their contributions and achievements within the platform. This system consists of five tiers: Novice, Contributor, Expert, Master, and Grandmaster. Each tier is achieved by meeting specific criteria in competitions, kernels (code-sharing), and discussions.[17]

The highest and most prestigious tier, Kaggle Grandmaster, is awarded to users who demonstrate exceptional skills in data science and machine learning. Achieving this status is extremely challenging. As of April 4, 2023, out of 12 million Kaggle users, only 2,331 (about 1 out of every 5500 users) have reached the Master level.

Among these Masters, only 422 (approximately 1 out of every 5 Masters) have achieved the coveted Kaggle Grandmaster status.[18]

The other tiers in the progression system include:

  • 13 thousand Experts
  • 200 thousand Contributors
  • 12 million Novices.

The progression system serves to motivate users to continuously improve their skills and contribute to the Kaggle community.

Models

In February 2023, Kaggle introduced Models which allows users to discover and use pre-trained models through deep integrations with the rest of Kaggle’s platform.[19]

Financials

In March 2017, Fei-Fei Li, Chief Scientist at Google, announced that Google was acquiring Kaggle during her keynote at Google Next.[20]

See also

References

  1. "Google Previews Log Analytics Feature in Its Cloud Logging Service". InfoQ. Retrieved 2022-10-13.
  2. Lardinois, Frederic; Mannes, John; Lynley, Matthew (March 8, 2017). "Google is acquiring data science community Kaggle". Techcrunch. Archived from the original on March 9, 2017. Retrieved March 9, 2017. Sources tell us that Google is acquiring Kaggle [...] the official announcement could come as early as tomorrow.
  3. "Google buys Kaggle and its gaggle of AI geeks". CNET. 2017-03-08. Retrieved 2018-06-01.
  4. "We've passed 1 million members". Kaggle Winner's Blog. 2017-06-06. Retrieved 2018-08-19.
  5. "Unique Kaggle Users".
  6. Markoff, John (24 November 2012). "Scientists See Advances in Deep Learning, a Part of Artificial Intelligence". The New York Times. Retrieved 2018-08-19.
  7. Wigglesworth, Robin (March 8, 2017). "Hedge funds adopt novel methods to hunt down new tech talent". The Financial Times. United Kingdom. Retrieved October 29, 2017.
  8. Kaggle. "Terms and Conditions - Kaggle".
  9. Kaggle. "Kaggle in Class". Archived from the original on 2011-06-16. Retrieved 2011-08-12.
  10. Byrne, Ciara (December 12, 2011). "Kaggle launches competition to help Microsoft Kinect learn new gestures". VentureBeat. Retrieved 13 December 2011.
  11. "The machine learning community takes on the Higgs". Symmetry Magazine. July 15, 2014. Retrieved 14 January 2015.
  12. Carpenter, Jennifer (February 2011). "May the Best Analyst Win". Science Magazine. Vol. 331, no. 6018. pp. 698–699. doi:10.1126/science.331.6018.698. Retrieved 1 April 2011.
  13. Sonas, Jeff (20 February 2011). "The Deloitte/FIDE Chess Rating Challenge". Chessbase. Retrieved 3 May 2011.
  14. Foo, Fran (April 6, 2011). "Smartphones to predict NSW travel times?". The Australian. Retrieved 3 May 2011.
  15. "NIPS 2014 Workshop on High-energy Physics and Machine Learning". JMLR W&CP. Vol. 42.
  16. Athanasopoulos, George; Hyndman, Rob (2011). "The Value of Feedback in Forecasting Competitions" (PDF). International Journal of Forecasting. Vol. 27. pp. 845–849. Archived from the original (PDF) on 2019-02-16. Retrieved 2022-03-04.
  17. "Kaggle Progression System". Kaggle. Retrieved 2023-04-03.
  18. Carl McBride Ellis (2022-02-10). "Kaggle in Numbers". Kaggle. Retrieved 2023-04-03.
  19. "[Product Launch] Introducing Kaggle Models | Data Science and Machine Learning".
  20. "Welcome Kaggle to Google Cloud". Google Cloud Platform Blog. Retrieved 2018-08-19.

Further reading

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.