Success Stories

The Northeast Hub is a community convener, collaboration hub, and catalyst for data science innovation in the Northeast Region. The Hub amplifies successes of the community, and shares credit across the community to encourage collaboration and mutual success in data science endeavors.

Success stories highlight community activities which have accomplished significant project goals. These include outcomes that highlight the value and insights delivered from project activities, resources that can be leveraged by the community for broader impact, and requests for collaborations that could perhaps lead to new collaboration, insights and publications.


Using a data-driven approach to study health disparities and secular trends in the chemical and individual exposome in the NHANES (National Health and Nutrition Examination Surveys)

Guest post by Chirag Patel, Harvard Medical School This Success Story is a report on the results of the Northeast Big Data Innovation Hub’s 2021 Seed Fund program. This research project considered the health challenges posed by environmental hazards across the U.S., with a particular focus on the health disparities […]

NEBD Hub Logo

Gaugler

DEFLAB: Data Education and Feminism at Lafayette and Beyond

Guest post by Dr. Trent Gaugler, Lafayette College This Success Story is a report on the results of the Northeast Big Data Innovation Hub’s 2020 Seed Fund program. The goals of this Seed Fund project were to introduce students to the fundamentals of data science through socially relevant projects, to […]


Image of computer screen with code as seen through glasses lying on a table

Data Science in General Education

Guest post by Dr. Cathie LeBlanc, Plymouth State University This Success Story is a report on the results of the Northeast Big Data Innovation Hub’s 2020 Seed Fund program. Plymouth State University has a history of active faculty learning communities focused on various aspects of teaching. Our latest learning community, funded […]


3D printed visualization of a line graph

Data Visualization Beyond the Screen

Guest post by Dr. Sara Stoudt, Lecturer in Statistical and Data Sciences Program, Smith College We’ve all been acutely aware of screens lately; we cannot seem to escape them. As statisticians and data scientists we’re stereotypically buried in our spreadsheets and charts anyway, but who do we exclude when communicating […]


Researchers from NYU Tandon release 3-D data tracking human interactions outside of coronavirus hotspots

Study to set groundwork to build machine learning models that rapidly analyze how a virus spreads In April when New York City was under a strict lockdown, a team of 16 student researchers from New York University’s Tandon School of Engineering commenced a National Science Foundation Rapid Response Research (RAPID) […]

Scatter plot

CUAHSI has been selected to be the Coordinating Hub for the Critical Zone (CZ) Collaborative Network

Guest post by Jerad Bales, Executive Director, CUAHSI The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) located in Cambridge, MA, has been selected to be the Coordinating Hub for the National Science Foundation’s Critical Zone (CZ) Collaborative Network. The 5-year cooperative agreement became effective September 1, 2020. The […]

Tree Rings

Splash drop

Water Data and Software Services to Support Discovery, Reproducibility, and Collaboration in the Water-Resources Domain and Beyond

Guest post by Emily Clark, Project Manager, CUAHSI The mission of the Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) is to enable interdisciplinary collaboration in the water sciences, provide critical cyberinfrastructure, and promote water science education at all levels. CUAHSI’s services can be especially useful in […]


ASSISTments Longitudinal Data Competition challenges participants to determine correlation between early mathematics education and STEM careers

Guest post by Ryan Baker, Associate Professor in the Graduate School of Education at the University of Pennsylvania. The ASSISTments Longitudinal Data Competition (external link no longer available) invited data scientists around the world to participate in a competition around the analysis of student data. Data from middle school student […]

Industry 4.0 , Machine learning and artificial intelligence concept. Ai chipsets for robot arm , driveless cars , sports game chips in smart factory background

Largest-ever cohort of U.S. twins fuels new BD Spoke study

Studying the causes of disease is essential to medical research. However, the discussion is sometimes framed, misleadingly, as ‘nature vs. nurture’—is your condition the result of your genetics or your environment? Generally, the answer is both. But to what degree? A new study in Nature Genetics explores this question for […]

NEBD Hub Logo

NEBD Hub Logo

UMass Amherst, WPI, Penn announce winners of Northeast big data competition

Winning Schemes for Predicting Student Interest in Science  UMass Amherst, WPI, Penn announce winners of Northeast big data competition AMHERST, Mass. – After a year-long, global data-mining competition, organizers today awarded the top three winning teams from Hong Kong, Japan and Michigan at the National Science Foundation’s (NSF) Northeast Big […]


NEBD Hub Logo

Big Data in Education: News and Competition

How can big data help predict student outcomes? Ryan Baker (U Penn) and Neil Heffernan (Worcester Polytechnic Institute) of our Big Data for Education Spoke hope to do just that via the Longitudinal Educational Big Data Competition (external link no longer available). Using carefully de-identified, real-world educational data, participants will predict whether […]


“Enabling Seamless Data Sharing in Industry and Academia” Workshop Report Released

Click here to access the report Data sharing challenges are extensive in cases involving industry and academia, and highlight the need for sharable, adaptable solutions. To that end, a report summarizing the proceedings and outputs of 2016’s Northeast Hub workshop on Data Sharing has recently been published. The workshop convened data science practitioners to […]

Drexel MRC logo

Data Sharing event

NEBDIH Data Sharing Workshop a Success: “I wish I could have gone to this workshop two years ago!”

On September 29th and 30th, stakeholders from across the Northeast and beyond joined the Hub at Drexel University in Philadelphia for “Enabling Seamless Data Sharing in Industry and Academia,” a cross-sector workshop put on by our community to tackle the challenges of sharing data head-on. In short “TED talk”-style presentations and […]