National Student Data Corps Video Library
Welcome to the NSDC Video Library where you can watch videos on data science topics based on IBM’s OpenDS4All curriculum, as well as student-created SQL and R educational materials, data science use cases, and more, presented by data enthusiasts from around the world. IBM transferred the management of the OpenDS4All GitHub repository to the NEBDHub in June 2023.
Introduction to Data Science
Data Fundamentals Flashcards
Data Science Ethics
Data Acquisition & Wrangling
Data Integration
Data Visualization & Modeling
Computer Programming
Machine Learning
Artificial Intelligence
Deep Learning
Data Science Use Case Scenarios
MasterClass Video Series
Introduction to Data Science
With Yucen Wang and Varalika Mahajan
Watch these videos to learn more about the difference between data science, data analytics, and data engineering. Become familiar with topics including data science models, knowledge graphs, and additional data science applications.
Data Fundamentals Flashcards
Learn about key STEM topics, tools, and techniques with short-form video playlists featuring data science students from the National Student Data Corps (NSDC).
Data Science Ethics
With Abhishek Sinha, Varalika Mahajan, and Rahulraj Singh
These videos provide a framework for the important topic of ethics in the collection and usage of data. Watch these videos to learn more about privacy, transparency, consent, explainability and fairness in data science, and walk through a use case using the Breast Cancer Wisconsin (Diagnostic) Data Set in Part 1 of the AI Explainability series.
Data Acquisition & Wrangling
With Varalika Mahajan, Renyin Zhang and Sanket Bhandari
Learn more about structured and unstructured data, and practice acquiring, extracting, cleaning, plotting and grouping data from a dataset with real-world examples along the way.
Data Integration
With Stephanie Guo, Lylybell Teran, and Varalika Mahajan
Familiarize yourself with the process of data integration, including breakdowns of the most common data quality issues, feature selection approaches, and partitional clustering and hierarchical clustering methods. Learn how to detect inconsistencies, find duplicates, and handle outliers within your dataset.
Data Visualization & Modeling
With Rahulraj Singh and Varalika Mahajan
Review how visual interfaces, knowledge graphs, and entity-relationship modeling can help analyze datasets and illustrate algorithmic performances, and practice your skills with a COVID Case Study.
Computer Programming
With Dashansh Prajapati, Hoang Luong and Gabriella Qi
Familiarize yourself with R, a programming language commonly used for statistical analysis. Throughout this video you will learn about RStudio’s Source Pane, Console Pane, Environment/History Pane, and more. Then, discover relational databases and relational database management systems (RDBMS) including MySQL. Learn more about common operators and practice your skills with examples along the way.
Machine Learning
with Lylybell Teran and Tomislav Galjanic
Master the basics of Supervised Machine Learning and Linear Regression with these video tutorials.
Artificial Intelligence
with Lylybell Teran and Sneha Dahiya
Watch these videos to learn how to solve machine learning and artificial intelligence problems with the use of artificial and convolutional neural networks.
Deep Learning
With Sanket Bhandari
Learn more about artificial intelligence through deep learning methods and models.
Data Science Use Cases – from the AI for Social Good Fall 2020 Symposium
In 2020, the Association for the Advancement of Artificial Intelligence’s AI for Social Good Fall Symposium featured student and researcher presentations on the role of AI can play in data science for social good initiatives.
Recent developments in the availability of big data and computational power are continuing to revolutionize several domains opening up new opportunities and challenges. In this symposium, we highlight two specific themes of humanitarian relief and healthcare where AI could be used for social good to achieve the United Nations (UN) sustainable development goals in those areas, which touch every aspect of human, social, and economic development. We expect the symposium to identify the critical needs and pathways for responsible AI solutions for achieving the sustainable goals, which demand holistic thinking on optimizing the trade-off between automation benefits and their potential side-effects.
Health Care Misinformation: An AI Challenge for Low-Resource Languages
Robust Lock-Down Optimization for COVID-19 Policy Guidance
Socioeconomic and Geographic Variations that Impact the Spread of Malaria
Asymptotic Cross-Entropy Weighting and Guided-Loss in Supervised Hierarchical Setting using Deep Attention Network
Clean Water: How the AI community can contribute to accessing water sources in developing countries
Measuring and Visualizing Social Distancing Using Deep Learning and 3D Computer Vision
Artificial Intelligence and Resource Allocation in Health Care: The Process-Outcome Divide in Perspectives on Moral Decision-Making
Two-Step Framework for Parkinson’s Disease Classification: Multiple One-Way ANOVA on Speech Features and Decision Trees
NSDC MasterClass Video Series
The NSDC MasterClass Video Series showcases experts who share how data science tools and techniques are being leveraged in various domains. Future episodes may highlight the intersection of data science and healthcare, finance, athletics, entertainment, technology, education, public policy, and more.
Check out the NSDC Educator Central and NSDC Learner Central for more data science resources.
Stay Connected with Us
Email us at nsdc@nebigdatahub.org with any inquiries or questions.
Some ways to stay connected with the NSDC community:
-
- Join our Slack channel
-
- Subscribe to the Northeast Hub YouTube channel
-
- Sign up for our NSDC mailing list
-
- Check out the REAL Program for more collaboration opportunities