Success Stories

The Northeast Hub is a community convener, collaboration hub, and catalyst for data science innovation in the Northeast Region. The Hub amplifies successes of the community, and shares credit across the community to encourage collaboration and mutual success in data science endeavors.

Success stories highlight community activities which have accomplished significant project goals. These include outcomes that highlight the value and insights delivered from project activities, resources that can be leveraged by the community for broader impact, and requests for collaborations that could perhaps lead to new collaboration, insights and publications.


COVID, Chocolate, and Collaboration: Researchers Discuss Their Work in February Webinar

During February’s COVID Information Commons (CIC) webinar, researchers, students, and non-profit leaders gathered to discuss and collaborate on research about COVID. The webinar featured talks from seven scientists from across the nation who shared insights on their NSF-funded research. Speakers included Naomi Senehi from Rice University, Nicola Sochacka from the […]


Data Visualization Beyond the Screen

Guest post by Dr. Sara Stoudt, Lecturer in Statistical and Data Sciences Program, Smith College We’ve all been acutely aware of screens lately; we cannot seem to escape them. As statisticians and data scientists we’re stereotypically buried in our spreadsheets and charts anyway, but who do we exclude when communicating […]


2021 Kicks off with New Insights by COVID-19 Researchers and the Undergraduate Student Paper Challenge

On Wednesday, January 13, 2021, seven scientists from across the nation joined us as part of the ongoing COVID-19 Research Webinar series organized through the COVID Information Commons. During the event, each researcher shared their work, providing new insights regarding the global pandemic and revealed some of the unexpected implications […]


COVID-19 researchers share insights at the November COVID Information Commons webinar.

This month’s Covid Information Commons (CIC) webinar brought together researchers, students and leaders in nonprofits. The CIC serves as a platform for researchers and scientists to share knowledge and collaborate on research about COVID. This month, the CIC project team announced the launch of a new feature at covidinfocommons.net: the […]


Researchers share new insights and opportunities for collaboration at October’s COVID Information Commons Webinar

Eight scientists spoke with an enthusiastic community at the second COVID-19 Research Webinar organized by the COVID Information Commons on October 16, 2020. Presenters who shared their latest research findings regarding the current pandemic included Rachel Wu (University of California-Riverside), Sara Rampazzi (University of Michigan), Sarah Bowman (Hauptman-Woodward Medical Research […]


Researchers from NYU Tandon release 3-D data tracking human interactions outside of coronavirus hotspots

Study to set groundwork to build machine learning models that rapidly analyze how a virus spreads In April when New York City was under a strict lockdown, a team of 16 student researchers from New York University’s Tandon School of Engineering commenced a National Science Foundation Rapid Response Research (RAPID) […]


DS4All Workshop: Computer Science for All

Guest post by Catherine Cramer, Founder and Principal, Woods Hole Institute. The Northeast Big Data Innovation Hub supports the work of DS4All and has done so since 2015. While there are several different strands to this work, the focus on data science in K-12 education continues to be prominent.  The […]


CUAHSI has been selected to be the Coordinating Hub for the Critical Zone (CZ) Collaborative Network

Guest post by Jerad Bales, Executive Director, CUAHSI The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) located in Cambridge, MA, has been selected to be the Coordinating Hub for the National Science Foundation’s Critical Zone (CZ) Collaborative Network. The 5-year cooperative agreement became effective September 1, 2020. The […]


Water Data and Software Services to Support Discovery, Reproducibility, and Collaboration in the Water-Resources Domain and Beyond

CUAHSI’s Compute Services enable education and research in the cloud. In addition to tools that enable data discovery and publication, CUAHSI develops and maintains cloud software to interoperate with CUAHSI’s data repositories to provide analysis, publication, and replication of scientific models and workflows. CUAHSI JupyterHub CUAHSI JupyterHub is a cloud service […]


Data-driven workshops help teachers understand and engage with students

Guest post by Ivon Arroyo, Associate Professor in the College of Education and the College of Computer Science at the University of Massachusetts Amherst. The Big Data for Education Spoke’s professional development workshops have empowered teachers to leverage data to identify and answer a variety of pedagogical questions about their […]


Using machine learning to optimize big data workflows for collaborative computational steering

Guest post by Chase Wu, Associate Chair of and Professor in the Department of Computer Science at NJIT. Research Background Model-based simulations have become an essential component in next-generation scientific applications and are generating big data on the order of terabyte at present and petabyte or exabyte in the predictable […]


Massive online open course teaches machine learning and data mining for education research

Guest post by Ryan Baker, Associate Professor in the Graduate School of Education at the University of Pennsylvania. The Northeast Big Data for Education Spoke has conducted considerable outreach on methods for data science for educational data sets. Workshops have been conducted in New York City, Buffalo, Philadelphia, Pittsburgh, and […]


ASSISTments Longitudinal Data Competition challenges participants to determine correlation between early mathematics education and STEM careers

Guest post by Ryan Baker, Associate Professor in the Graduate School of Education at the University of Pennsylvania. The ASSISTments Longitudinal Data Competition invited data scientists around the world to participate in a competition around the analysis of student data. Data from middle school student use of a popular online […]


Data Science for All: NE Hub workshop explores teaching data science to high schoolers

Data science is expanding rapidly in undergraduate education, but at the K-12 level, few schools have integrated this critical subject into their curricula. Many questions must be answered first: how should data science be taught to high schoolers? As a standalone course, or integrated throughout other courses? What level of […]


Largest-ever cohort of U.S. twins fuels new BD Spoke study

Studying the causes of disease is essential to medical research. However, the discussion is sometimes framed, misleadingly, as ‘nature vs. nurture’—is your condition the result of your genetics or your environment? Generally, the answer is both. But to what degree? A new study in Nature Genetics explores this question for […]


Funding Awarded for First Round of NEBDIH-Sponsored Big Data Workshops

As part of our mission to address high-priority challenges with data-driven solutions, the Northeast Big Data Innovation Hub put out a call for workshop proposals this spring. We sought to support community-driven workshops that are designed to plan and develop Big Data projects, and are delighted to announce our first […]


Big Data Hubs Partnering on Nation-Wide Data Storage Network

We live in the age of the data deluge, with research group IDC forecasting that humankind will create and replicate 163 zettabytes of data by the year 2025. For perspective, only about 2.7 zettabytes were created and replicated globally in 2012. How might we most effectively manage our abundant data […]


UMass Amherst, WPI, Penn announce winners of Northeast big data competition

Winning Schemes for Predicting Student Interest in Science  UMass Amherst, WPI, Penn announce winners of Northeast big data competition   AMHERST, Mass. – After a year-long, global data-mining competition, organizers today awarded the top three winning teams from Hong Kong, Japan and Michigan at the National Science Foundation’s (NSF) Northeast […]


New blog: $6M from Microsoft to NSF-funded Big Data Hubs and BIGDATA

Over on the Microsoft Azure Blog, Vani Mandava (Director, Data Science Research, Microsoft Research) highlights Microsoft’s partnership with NSF-funded big data initiatives including the Big Data Hubs. Microsoft has committed $6 million in cloud credits to support data science innovation across these programs, including the Northeast Big Data Hub’s Health Spoke […]


Talking Big Data Literacy in “Data Divide” series

With data touching all aspects of our lives today, how can we look beyond traditional approaches to data science education to build capacity as broadly and inclusively as possible? Leaders in data science education discussed “Alternative Avenues for Development of Data Science Education Capacity” on Friday, September 22nd, in a […]


First Steps toward Big Data Literacy Framework at Spring Workshop

Guest post by Catherine Cramer, NYSCI Increasingly, the prosperity, innovation and security of individuals
 and communities depend on a big data-literate society, which calls for a concerted effort 
to define what it means to be a big data literate citizen, information worker, researcher, or policymaker. As a step toward that […]


Big Data in Education: News and Competition

How can big data help predict student outcomes? Ryan Baker (U Penn) and Neil Heffernan (Worcester Polytechnic Institute) of our Big Data for Education Spoke hope to do just that via the Longitudinal Educational Big Data Competition. Using carefully de-identified, real-world educational data, participants will predict whether 172 students in validation and […]


“Enabling Seamless Data Sharing in Industry and Academia” Workshop Report Released

Click here to access the report Data sharing challenges are extensive in cases involving industry and academia, and highlight the need for sharable, adaptable solutions. To that end, a report summarizing the proceedings and outputs of 2016’s Northeast Hub workshop on Data Sharing has recently been published. The workshop convened data science practitioners to […]


Telling a Story with Data: Young Innovator Kenneth Graves

This post highlights one of the up-and-coming data science graduate students who participated in the Northeast Big Data Innovation Hub’s “Young Innovators” program this year. This program and others like it contribute to the Northeast Hub’s mission to build public-private partnerships to address high-priority societal challenges with data-driven solutions. The […]


NEBDIH Data Sharing Workshop a Success: “I wish I could have gone to this workshop two years ago!”

On September 29th and 30th, stakeholders from across the Northeast and beyond joined the Hub at Drexel University in Philadelphia for “Enabling Seamless Data Sharing in Industry and Academia,” a cross-sector workshop put on by our community to tackle the challenges of sharing data head-on. In short “TED talk”-style presentations and […]


Northeast Big Data Innovation Hub Awarded $3.3 million to Create Solutions to Pressing Challenges in Health, Education and Data Sharing

Click here for a PDF version of this release. Click here for the press release from the National Science Foundation (NSF). The National Science Foundation (NSF) has announced $3.3 million in grants to researchers affiliated with the Northeast Big Data Innovation Hub. A common theme underlying the projects below is […]