News & Updates


NSDC Data Science Flashcards – Data Visualizations #7 – What is a Heat Map?

This NSDC Data Science Flashcards series will teach you about data visualizations, including scatterplots, histograms, and heat maps. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan and Sneha Dahiya. Recordings were done by Aditya Raj, Sneha Dahiya, Lauren Close, and Emily Rothenberg. You can […]


NSDC Data Science Flashcards – Data Visualizations #6 – What is a Scatterplot?

This NSDC Data Science Flashcards series will teach you about data visualizations, including scatterplots, histograms, and heat maps. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan and Sneha Dahiya. Recordings were done by Aditya Raj, Sneha Dahiya, Lauren Close, and Emily Rothenberg. You can […]


NSDC Data Science Flashcards – Data Visualizations #5 – What is a Line Graph?

This NSDC Data Science Flashcards series will teach you about data visualizations, including scatterplots, histograms, and heat maps. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan and Sneha Dahiya. Recordings were done by Aditya Raj, Sneha Dahiya, Lauren Close, and Emily Rothenberg. You can […]


NSDC Data Science Flashcards – Data Visualizations #4 – What is a Pie Chart?

This NSDC Data Science Flashcards series will teach you about data visualizations, including scatterplots, histograms, and heat maps. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan and Sneha Dahiya. Recordings were done by Aditya Raj, Sneha Dahiya, Lauren Close, and Emily Rothenberg. You can […]


NSDC Data Science Flashcards – Data Visualizations #3 – What is a Histogram?

This NSDC Data Science Flashcards series will teach you about data visualizations, including scatterplots, histograms, and heat maps. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan and Sneha Dahiya. Recordings were done by Aditya Raj, Sneha Dahiya, Lauren Close, and Emily Rothenberg. You can […]


NSDC Data Science Flashcards – Data Visualizations #2 – What is a Bar Chart?

This NSDC Data Science Flashcards series will teach you about data visualizations, including scatterplots, histograms, and heat maps. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan and Sneha Dahiya. Recordings were done by Aditya Raj, Sneha Dahiya, Lauren Close, and Emily Rothenberg. You can […]


NSDC Data Science Flashcards – Data Visualizations #1 – What are the Types of Data Visualization?

This NSDC Data Science Flashcards series will teach you about data visualizations, including scatterplots, histograms, and heat maps. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan and Sneha Dahiya. Recordings were done by Aditya Raj, Sneha Dahiya, Lauren Close, and Emily Rothenberg. You can […]


Teaching Responsible Data Science through Cybersecurity Analytics

Guest post by Dr. S. Jay Yang, Rochester Institute of Technology (RIT) This Success Story is a report on the results of one of the awards in the Northeast Big Data Innovation Hub’s 2021 Seed Fund program. There were three primary objectives for this project:  1) To compile cybersecurity datasets […]

NEBD Hub Logo

NSDC Data Science Flashcards – Time Series #6 – How Do You Evaluate Time Series Models?

This NSDC Data Science Flashcards series will teach you about time series analysis, including data preprocessing, decomposition, plots, and forecasting. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan. Recordings were done by Aditya Raj. You can find these videos on the NEBDHub Youtube channel. […]


NSDC Data Science Flashcards – Time Series #5 – What are Time Series Forecasting Methods?

This NSDC Data Science Flashcards series will teach you about time series analysis, including data preprocessing, decomposition, plots, and forecasting. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan. Recordings were done by Aditya Raj. You can find these videos on the NEBDHub Youtube channel. […]


NSDC Data Science Flashcards – Time Series #4 – What are Time Series Plots?

This NSDC Data Science Flashcards series will teach you about time series analysis, including data preprocessing, decomposition, plots, and forecasting. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan. Recordings were done by Aditya Raj. You can find these videos on the NEBDHub Youtube channel. […]


NSDC Data Science Flashcards – Time Series #3 – What is Time Series Decomposition?

This NSDC Data Science Flashcards series will teach you about time series analysis, including data preprocessing, decomposition, plots, and forecasting. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan. Recordings were done by Aditya Raj. You can find these videos on the NEBDHub Youtube channel. […]


NSDC Data Science Flashcards – Time Series #2 – What is Time Series Data Preprocessing?

This NSDC Data Science Flashcards series will teach you about time series analysis, including data preprocessing, decomposition, plots, and forecasting. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan. Recordings were done by Aditya Raj. You can find these videos on the NEBDHub Youtube channel. […]


NSDC Data Science Flashcards – Time Series #1 – What is Time Series Analysis?

This NSDC Data Science Flashcards series will teach you about time series analysis, including data preprocessing, decomposition, plots, and forecasting. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan. Recordings were done by Aditya Raj. You can find these videos on the NEBDHub Youtube channel. […]


Fall 2023 CIC Webinar Recap

The twenty-seventh of the COVID Information Commons (CIC) webinar series took place on October 10, 2023. In this forum, leading COVID-19 scientists presented their current research on the global pandemic.  Event moderators included Florence Hudson, Executive Director of the Northeast Big Data Innovation Hub at Columbia University and COVID Information Commons Principal Investigator (PI), Lauren […]

NEBD Hub Logo

NSDC Data Science Flashcards – Types of Data Card #4 – What Is Interval and Ratio Data?

This NSDC Data Science Flashcards series will teach you about the different kinds of data, including how you can use them to strengthen your research. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan. Recordings were done by Sneha Dahiya. You can find these videos […]


NSDC Data Science Flashcards – Types of Data Card #3 – What is Nominal and Ordinal Data?

This NSDC Data Science Flashcards series will teach you about the different kinds of data, including how you can use them to strengthen your research. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan. Recordings were done by Sneha Dahiya. You can find these videos […]


NSDC Data Science Flashcards – Types of Data Card #2 – What is Qualitative and Quantitative Data?

This NSDC Data Science Flashcards series will teach you about the different kinds of data, including how you can use them to strengthen your research. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan. Recordings were done by Sneha Dahiya. You can find these videos […]


NSDC Data Science Flashcards – Types of Data Card #1 – What Are the Types of Data?

This NSDC Data Science Flashcards series will teach you about the different kinds of data, including how you can use them to strengthen your research. This installment of the NSDC Data Science Flashcards series was created by Varalika Mahajan. Recordings were done by Sneha Dahiya. You can find these videos […]


NSDC Data Science Flashcards – Data Science Ethics Card #6 – 5 V’s of Big Data

This NSDC Data Science Flashcards series will teach you about the importance of data ethics. This installment of the NSDC Data Science Flashcards series was created by Florence Hudson and Varalika Mahajan. Recordings were done by Lauren Close, Florence Hudson, and Emily Rothenberg. You can find these videos on the […]


NSDC Data Science Flashcards – Data Science Ethics Card #5 – Algorithmic Fairness

This NSDC Data Science Flashcards series will teach you about the importance of data ethics. This installment of the NSDC Data Science Flashcards series was created by Florence Hudson and Varalika Mahajan. Recordings were done by Lauren Close, Florence Hudson, and Emily Rothenberg. You can find these videos on the […]


NSDC Data Science Flashcards – Data Science Ethics Card #4 – FAIR Principles Part 2

This NSDC Data Science Flashcards series will teach you about the importance of data ethics. This installment of the NSDC Data Science Flashcards series was created by Florence Hudson and Varalika Mahajan. Recordings were done by Lauren Close, Florence Hudson, and Emily Rothenberg. You can find these videos on the […]


NSDC Data Science Flashcards – Data Science Ethics Card #3 – FAIR Principles Part 1

This NSDC Data Science Flashcards series will teach you about the importance of data ethics. This installment of the NSDC Data Science Flashcards series was created by Florence Hudson and Varalika Mahajan. Recordings were done by Lauren Close, Florence Hudson, and Emily Rothenberg. You can find these videos on the […]


NSDC Data Science Flashcards – Data Science Ethics Card #2 – Intellectual Property

This NSDC Data Science Flashcards series will teach you about the importance of data ethics. This installment of the NSDC Data Science Flashcards series was created by Florence Hudson and Varalika Mahajan. Recordings were done by Lauren Close, Florence Hudson, and Emily Rothenberg. You can find these videos on the […]


NSDC Data Science Flashcards – Data Science Ethics Card #1 – Privacy, Transparency, and Consent

This NSDC Data Science Flashcards series will teach you about the importance of data ethics. This installment of the NSDC Data Science Flashcards series was created by Florence Hudson and Varalika Mahajan. Recordings were done by Lauren Close, Florence Hudson, and Emily Rothenberg. You can find these videos on the […]


Improving Data Integrity Awareness in HPC Datasets using Sparsity Profiles

Guest post by Dr. Seung Woo Son, Associate Professor, University of Massachusetts, Lowell This Success Story is a report on the results of one of the awards in the Northeast Big Data Innovation Hub’s 2021 Seed Fund program. As scientists conduct analyses that rely on large-scale simulations to achieve breakthroughs […]


NSDC Data Science Flashcards – Data Science Pipeline Card #5 – What is Visualization?

The NSDC Data Science Flashcards series will teach you how the data pipeline is developed for data science projects. This installment of the NSDC Data Science Flashcards series was created by Sneha Dahiya, a graduate student majoring in Business Analytics. You can find these videos on the NEBDHub Youtube channel. […]

Intro slide for NSDC Data Science Flashcards Youtube video series. The topic is what is data visualization, flashcard number 5 of the data pipeline collection.

Intro slide for NSDC Data Science Flashcards Youtube video series. The topic is what is data mining, flashcard number 4 of the data pipeline collection.

NSDC Data Science Flashcards – Data Pipeline Card #4 – What is Data Mining?

The NSDC Data Science Flashcards series will teach you how the data pipeline is developed for data science projects. This installment of the NSDC Data Science Flashcards series was created by Emily Rothenberg, National Student Data Corps (NSDC) Program Manager. You can find these videos on the NEBDHub Youtube channel. […]


NSDC Data Science Flashcards – Data Pipeline Card #3 – What is Data Cleaning?

The NSDC Data Science Flashcards series will teach you how the data pipeline is developed for data science projects. This installment of the NSDC Data Science Flashcards series was created by Sneha Dahiya, a graduate student majoring in Business Analytics. You can find these videos on the NEBDHub Youtube channel. […]

Intro slide for NSDC Data Science Flashcards Youtube video series. The topic is what is data cleaning, flashcard number 3 of the data pipeline collection.

Intro slide for NSDC Data Science Flashcards Youtube video series. The topic is what is data acquisition, flashcard number 2 of the data pipeline collection.

NSDC Data Science Flashcards – Data Pipeline Card #2 – What is Data Acquisition?

The NSDC Data Science Flashcards series will teach you how the data pipeline is developed for data science projects. This flashcard was created by Emily Rothenberg, National Student Data Corps (NSDC) Program Manager. You can find the full NSDC Data Science Flashcards collection of videos on the NEBDHub Youtube channel. […]


NSDC Data Science Flashcards – Data Pipeline Card #1 – What is Bias and Ethics?

The NSDC Data Science Flashcards series will teach you how the data pipeline is developed for data science projects. This flashcard was created by Emily Rothenberg, National Student Data Corps (NSDC) Program Manager. You can find the full NSDC Data Science Flashcards collection of videos on the NEBDHub Youtube channel. […]

Intro slide for NSDC Data Science Flashcards Youtube video series. The topic is what is data science bias and ethics, flashcard number 1 of the data pipeline collection.

NEBD Hub Logo

Water Data Forum Sessions

The Water Data Forum is a virtual webinar series presented by the Cleveland Water Alliance, Water Environment Federation, and Midwest Big Data Innovation Hub. These interactive web sessions engage a cross-sector panel of experts in an exploration of utility, private sector, and research approaches to collecting, managing, and measuring water […]


NEBD Hub Logo

Summer 2023 CIC Webinar Recap

The fourteenth session of the COVID Information Commons (CIC) webinar series took place on July 26th, 2023. In this forum, leading COVID-19 scientists presented their current research on the global pandemic.  Event moderators included Florence Hudson, Executive Director of the Northeast Big Data Innovation Hub at Columbia University and COVID Information Commons Principal Investigator […]


Spring 2023 CIC Webinar Recap

The thirteenth in the series of COVID Information Commons (CIC) webinars which began in 2020 took place on April 24th, 2023. In this forum, leading COVID-19 scientists presented their current research on the global pandemic.  Event moderators included Florence Hudson, Executive Director of the Northeast Big Data Innovation Hub at Columbia University and COVID […]

NEBD Hub Logo

Graphic for NEBDHub Inaugural Student Research Symposium

January 2023 NEBDHub Inaugural Student Research Symposium

Written by: Femi Johnson A recording of this event is available at the Northeast Big Data Hub’s YouTube channel. On January 27, 2023, the Northeast Big Data Innovation Hub (NEBDHub) held its 2023 Inaugural Student Research Symposium. This event highlighted the student-led research that was completed by undergraduate- and graduate-level […]


The Covid Information Commons & Columbia University Libraries – using translation & transcription to increase accessibility to NSF-funded research

by Lauren Close, Lylybell Teran, and Esther Jackson, with editorial support from Florence Hudson, Macy Moujabber, Isabella Graham-Martinez, and Jeremiah Mercurio With thanks to Lara Azar, Elia Bregman, Brian Buckley, Cora Lee Cole, Victoria Horrocks, Saanya Subasinghe, Rhyley Vaughan, and Kathryn Pope. As our scholarly communication ecosystem becomes increasingly reliant on digital […]

NEBD Hub Logo

January 2023 CIC Webinar Recap

The twelfth in the series of COVID Information Commons (CIC) webinars which began in 2020 took place on January 31st, 2023. In this forum, leading COVID-19 scientists presented their current research on the global pandemic.  Event moderators included Florence Hudson, Executive Director of the Northeast Big Data Innovation Hub at Columbia University and COVID […]

NEBD Hub Logo

Poster promoting panelists for Data Science Career Panel, including Emily Javan, Gianmarco Gabrieli, and Dr. Kobi Abayomi

January 2023 National Student Data Corps Data Science Career Panel

Written by: Sumedh Datar A recording of this event is available at the Northeast Big Data Hub’s YouTube channel. On January 13th, 2023, the National Student Data Corps (NSDC) hosted its 11th Data Science Panel, designed to showcase the various educational and professional opportunities in the field of data science. […]


World Information Transfer Logo

Leveraging Data Science and Advanced Technologies. Conversation with Ms. Florence D. Hudson

WIT Virtual Voices, a series of webinars hosted by the World Information Transfer focusing on the SDGs to promote science-based news on health and environment. This online webinar from NEBDHub Executive Director, Florence Hudson, touches on the role of data science in developing advanced technologies.


October 2022 National Student Data Corps Data Science Career Panel

On October 19th, 2022, the National Student Data Corps (NSDC) hosted the first National Hispanic Heritage Month Data Science Panel. Florence Hudson, the Executive Director of the Northeast Big Data Innovation Hub (NEBDHub), began the event by welcoming the co-moderators, the panelists, and the attendees. The event was co-moderated by Raul Cosio and Isabella Graham Martinez. Raul is a retired IBM Vice President and has received awards from the Hispanic Engineer National Achievement Awards Corporation (HENAAC) and the Society of Hispanic Professional Engineers (SHPE). Isabella is a full-time student at Columbia University, works part-time at the NEBDHub as a Project Coordinator, and is a National Hispanic Scholar Awardee.

Poster for the October 2022 Data Science Career Panel

Forecasting Salinity in Rivers during Storm Events

Guest post by Dr. Laura Dietz, University of New Hampshire This Success Story is a report on the results of the Northeast Big Data Innovation Hub’s 2020 Seed Fund program. Every winter, vast amounts of road salts are scattered onto streets across the northeastern U.S.. While important for our safety, […]

Laura Dietz

Pala Students Win ‘Best New Team’ Award at 2022 DataJam

Supercomputers can complete tasks so impressive that they have outpaced science fiction movies. However, they do more than that, they can provide inspiration to the next generation of scientists. One supercomputing center inspiring youth is the San Diego Supercomputer Center, or SDSC, at the University of California San Diego, which provides […]

NEBD Hub Logo

Greene

Curricular Structures to Blend Data Science & the Digital Humanities

Guest post by Dr. Amanda Greene, Lehigh University This Success Story is a report on the results of the Northeast Big Data Innovation Hub’s 2020 Seed Fund program. The goal of this project was to develop pedagogical resources that integrate humanist perspectives, ethics, and data science by supporting a collaborative […]