Projects
Build the confidence to code on your own. Projects let you apply your skills using tools like Jupyter Notebook and complete a data analysis from start to finish—all in a risk-free environment.
Exploring the History of Lego
Use a variety of data manipulation techniques to explore different aspects of Lego's history!
Analyzing TV Data
Use data manipulation and visualization to explore one of two different television broadcast datasets: The Super Bowl and hit sitcom The Office!
Exploring the NYC Airbnb Market
Apply your data importing and cleaning skills from the Importing and Cleaning Data with R skill track to uncover insights about the Airbnb market in New York City.
Rise and Fall of Programming Languages
Analyze the relative popularity of programming languages over time based on Stack Overflow data.
Real-time Insights from Social Media Data
Learn to analyze Twitter data and do a deep dive into a hot trend.
Disney Movies and Box Office Success
Explore Disney movie data, then build a linear regression model to predict box office success.
Analyze International Debt Statistics
Write SQL queries to answer interesting questions about international debt using data from The World Bank.
Analyze Your Runkeeper Fitness Data
Import, clean, and analyze seven years worth of training data tracked on the Runkeeper app.
Health Survey Data Analysis of BMI
Analyze health survey data to determine how BMI is associated with physical activity and smoking.
Gender Bias in Graduate Admissions
Analyze admissions data from UC Berkeley and find out if the university was biased against women.
A Text Analysis of Trump's Tweets
Apply text mining to Donald Trump's tweets to confirm if he writes the (angrier) Android half.
Classify Song Genres from Audio Data
Rock or rap? Apply machine learning methods in Python to classify songs into genres.
What Your Heart Rate Is Telling You
Examine the relationship between heart rate and heart disease using multiple logistic regression.
A New Era of Data Analysis in Baseball
Use MLB's Statcast data to compare New York Yankees sluggers Aaron Judge and Giancarlo Stanton.
A Network Analysis of Game of Thrones
Analyze the network of characters in Game of Thrones and how it changes over the course of the books.
Wrangling and Visualizing Musical Data
Wrangle and visualize musical data to find common chords and compare the styles of different artists.
Exploring the Kaggle Data Science Survey
Discover the top tools Kaggle participants use for data science and machine learning.
Dr. Semmelweis and the Discovery of Handwashing
Reanalyse the data behind one of the most important discoveries of modern medicine: handwashing.
Rasmus Bååth
Data Science Lead at castle.io
Introduction to DataCamp Projects
If you've never done a DataCamp project, this is the place to start!
Rasmus Bååth
Data Science Lead at castle.io
Phyllotaxis: Draw Flowers Using Mathematics
Use R to make art and create imaginary flowers inspired by nature.
Antonio Sánchez Chinchón
Data Scientist at Telefónica
Visualizing COVID-19
Visualize the rise of COVID-19 cases globally with ggplot2.
Richie Cotton
Curriculum Architect at DataCamp
Risk and Returns: The Sharpe Ratio
Use pandas to calculate and compare profitability and risk of different investments using the Sharpe Ratio.
Stefan Jansen
Founder & Lead Data Scientist at Applied Artificial Intelligence
Exploring the Bitcoin Cryptocurrency Market
You will explore the market capitalization of Bitcoin and other cryptocurrencies.
Juan González-Vallinas
Director Data Science at multilayer.io
Name Game: Gender Prediction using Sound
Analyze the gender distribution of children's book writers and use sound to match names to gender.
Tufool Alnuaimi
Academic entrepreneur with a focus on data science
Exploring the Evolution of Linux
Find out about the evolution of the Linux operating system by exploring its version control system.
Markus Harrer
Software Development Analyst
Recreating John Snow's Ghost Map
Recreate John Snow's famous map of the 1854 cholera outbreak in London.
Radovan Kavicky
President and Principal Data Scientist at GapData Institute
Level Difficulty in Candy Crush Saga
Analyze data from the hit mobile game, Candy Crush Saga.
Rasmus Bååth
Data Science Lead at castle.io
The Hottest Topics in Machine Learning
Use Natural Language Processing on NIPS papers to uncover the trendiest topics in machine learning research.
Lars Hulstaert
Data Scientist at Microsoft
The GitHub History of the Scala Language
Find the true Scala experts by exploring its development history in Git and GitHub.
Anita Sarma
Associate Professor at Oregon State University
Visualizing Inequalities in Life Expectancy
Compare life expectancy across countries and genders with ggplot2.
Antonio Sánchez Chinchón
Data Scientist at Telefónica
Scout your Athletics Fantasy Team
Analyze athletics data to find new ways to scout and assess jumpers and throwers.
George Perry
Sports Scientist and Entrepreneur
Mobile Games A/B Testing with Cookie Cats
Analyze an A/B test from the popular mobile puzzle game, Cookie Cats.
Rasmus Bååth
Data Science Lead at castle.io
Naïve Bees: Image Loading and Processing
Load, transform, and understand images of honey bees and bumble bees in Python.
Peter Bull
Co-founder of DrivenData
Generating Keywords for Google Ads
Automatically generate keywords for a search engine marketing campaign using Python.
Elias Dabbas
Owner at The Media Supermarket
Naïve Bees: Predict Species from Images
Build a model that can automatically detect honey bees and bumble bees in images.
Peter Bull
Co-founder of DrivenData
Partnering to Protect You from Peril
Examine the network of connections among local health departments in the United States.
Jenine Harris
Associate Professor at Washington University in St. Louis
Explore 538's Halloween Candy Rankings
Get ready for Halloween by digging into a FiveThirtyEight dataset with all your favorite candy!
Nick Solomon
Data Scientist
Who's Tweeting? Trump or Trudeau?
Build a machine learning classifier that knows whether President Trump or Prime Minister Trudeau is tweeting!
Katharine Jarmul
Founder, kjamistan
Where Would You Open a Chipotle?
Create and explore interactive maps using Leaflet to determine where to open the next Chipotle.
Rich Majerus
Assistant Vice President at Colby College
Do Left-handed People Really Die Young?
Use pandas and Bayesian statistics to see if left-handed people actually die earlier than righties.
Madeleine Bonsma-Fisher
PhD Candidate at University of Toronto
Drunken Datetimes in Ames, Iowa
Apply your skills from "Working with Dates and Times in R" to breathalyzer data from Ames, Iowa.
Samantha Tyner
Postdoctoral Research Associate at Iowa State University
Predict Taxi Fares with Random Forests
Use regression trees and random forests to find places where New York taxi drivers earn the most.
Robert Grant
Founder & Data Sherpa at bayescamp.com
Which Debts Are Worth the Bank's Effort?
Play bank data scientist and use regression discontinuity to see which debts are worth collecting.
Howard Friedman
Adjunct Professor at Columbia University
ASL Recognition with Deep Learning
Build a convolutional neural network to classify images of letters from American Sign Language.
Alexis Cook
Machine Learning Educator at Kaggle
Where Are the Fishes?
Explore acoustic backscatter data to find fish in the U.S. Atlantic Ocean.
Erin LaBrecque
Instructor at DataCamp
Clustering Heart Disease Patient Data
Experiment with clustering algorithms to help doctors inform treatment for heart disease patients.
Megan Robertson
Data Scientist
Naïve Bees: Deep Learning with Images
Build a deep learning model that can automatically detect honey bees and bumble bees in images.
Emily Miller
Data Scientist at DrivenData
Predicting Credit Card Approvals
Build a machine learning model to predict if a credit card application will get approved.
Sayak Paul
Deep Learning Associate at PyImageSearch
Going Down to South Park: A Text Analysis
Analyze the dialog and IMDB ratings of 287 South Park episodes. Warning: contains explicit language.
Patrik Drhlík
Freelance Data Scientist
Degrees That Pay You Back
Explore the salary potential of college majors with a k-means cluster analysis.
Jaclyn Burge
Senior Data Consultant at The Walt Disney Company
Book Recommendations from Charles Darwin
Build a book recommendation system using NLP and the text of books like "On the Origin of Species."
Philippe Julien
Senior Data Scientist at King
Extract Stock Sentiment from News Headlines
Scrape news headlines for FB and TSLA then apply sentiment analysis to generate investment insight.
Juan González-Vallinas
Director Data Science at multilayer.io
Data Science for Social Good: Crime Study
Use data science to catch criminals, plus find new ways to volunteer personal time for social good.
William Connell
PhD Student at University of California, San Francisco
Planning Public Policy in Argentina
Apply unsupervised learning techniques to help plan an education program in Argentina.
Rafael La Buonora
Data Scientist at Transforma Uruguay
Clustering Bustabit Gambling Behavior
Use cluster analysis to glean insights into cryptocurrency gambling behavior.
Eric Hare
Chief Data Scientist at Omni Analytics Group
Give Life: Predict Blood Donations
Build a binary classifier to predict if a blood donor is likely to donate again.
Dimitri Denisjonok
Python Backend Developer at Futrli
Find Movie Similarity from Plot Summaries
Use NLP and clustering on movie plot summaries from IMDb and Wikipedia to quantify movie similarity.
Anubhav Singh
Founder at The Code Foundation
The Impact of Climate Change on Birds
Predict the impact of climate change on bird distributions using spatial data and machine learning.
Laurens Geffert
Data Science Manager at Nielsen
Are You Ready for the Zombie Apocalypse?
Use your logistic regression skills to protect people from becoming zombies!
Jenine Harris
Associate Professor at Washington University in St. Louis
Trends in Maryland Crime Rates
Apply hierarchical and mixed-effect models to analyze Maryland crime rates.
Richard Erickson
Data Scientist
Comparing Cosmetics by Ingredients
Process ingredient lists for cosmetics on Sephora then visualize similarity using t-SNE and Bokeh.
Jiwon Jeong
Graduate Research Assistant at Yonsei University
Kidney Stones and Simpson's Paradox
Use logistic regression to determine which treatment procedure is more effective for kidney stone removal.
Amy Yang
Senior Data Scientist at Uptake
What Makes a Pokémon Legendary?
Use tree-based machine learning methods to identify the characteristics of legendary Pokémon.
Joshua Feldman
Decision Scientist at Facebook
Modeling the Volatility of US Bond Yields
Discover how the US bond yields behave using descriptive statistics and advanced modeling.
József Soltész
Manager at KPMG
Importing and Cleaning Data
Apply your importing and data cleaning skills to real-world soccer data.
Erin LaBrecque
Instructor at DataCamp
Text Mining America's Toughest Game Show
Use text mining to analyze Jeopardy! data.
Alexis Lee
Intern at DataCamp
What and Where Are the World's Oldest Businesses?
Use SQL data manipulation and joins to discover the oldest businesses around the world.
Richie Cotton
Curriculum Architect at DataCamp
Streamlining Employee Data
Use DataFrames to read and merge employee data from different sources.
Hadrien Lacroix
Curriculum Manager at DataCamp
Writing Functions for Product Analysis
Use coding best practices and functions to improve a script!
Lis Sulmont
Head of Curriculum Expansion at DataCamp
Word Frequency in Classic Novels
Use web scraping and NLP to find the most frequent words in one of two pieces of classic literature: Herman Melville's novel, Moby Dick, or Peter Pan by J. M. Barrie.
Hugo Bowne-Anderson
Data Scientist at DataCamp
Bad Passwords and the NIST Guidelines
Check what passwords fail to conform to the National Institute of Standards and Technology password guidelines.
Rasmus Bååth
Data Science Lead at castle.io
Who Is Drunk and When in Ames, Iowa?
Flex your data manipulation muscles on breath alcohol test data from Ames, Iowa, USA.
Samantha Tyner
Postdoctoral Research Associate at Iowa State University
A Visual History of Nobel Prize Winners
Explore a dataset from Kaggle containing a century's worth of Nobel Laureates. Who won? Who got snubbed?
Rasmus Bååth
Data Science Lead at castle.io
Reducing Traffic Mortality in the USA
How can we find a good strategy for reducing traffic-related deaths?
Joel Östblom
PhD Candidate at University of Toronto
Functions for Food Price Forecasts
Write functions to forecast time series of food prices in Rwanda.
Richie Cotton
Curriculum Architect at DataCamp
Comparing Search Interest with Google Trends
Manipulate and plot time series data from Google Trends to analyze changes in search interest over time.
David Venturi
Data Science Educator
The Android App Market on Google Play
Load, clean, and visualize scraped Google Play Store data to gain insights into the Android app market.
Lavanya Gupta
Machine Learning Engineer at PropTiger.com
TV, Halftime Shows, and the Big Game
Load, clean, and explore Super Bowl data in the age of soaring ad costs and flashy halftime shows.
Erin LaBrecque
Instructor at DataCamp
Investigating Netflix Movies and Guest Stars in The Office
Apply the foundational Python skills you learned in Introduction to Python and Intermediate Python by manipulating and visualizing movie and TV data.
Justin Saddlemyer
Instructor