Success Stories

The Northeast Hub is a community convener, collaboration hub, and catalyst for data science innovation in the Northeast Region. The Hub amplifies successes of the community, and shares credit across the community to encourage collaboration and mutual success in data science endeavors.

Success stories highlight community activities which have accomplished significant project goals. These include outcomes that highlight the value and insights delivered from project activities, resources that can be leveraged by the community for broader impact, and requests for collaborations that could perhaps lead to new collaboration, insights and publications.


Guest post by Chase Wu, Associate Chair of and Professor in the Department of Computer Science at NJIT. Research Background Model-based simulations have become an essential component in next-generation scientific applications and are generating big data on the order of terabyte at present and petabyte or exabyte in the predictable […]

Using machine learning to optimize big data workflows for collaborative ...


Guest post by Ryan Baker, Associate Professor in the Graduate School of Education at the University of Pennsylvania. The Northeast Big Data for Education Spoke has conducted considerable outreach on methods for data science for educational data sets. Workshops have been conducted in New York City, Buffalo, Philadelphia, Pittsburgh, and […]

Massive online open course teaches machine learning and data mining ...


Guest post by Ryan Baker, Associate Professor in the Graduate School of Education at the University of Pennsylvania. The ASSISTments Longitudinal Data Competition invited data scientists around the world to participate in a competition around the analysis of student data. Data from middle school student use of a popular online […]

ASSISTments Longitudinal Data Competition challenges participants to determine correlation between ...


Data science is expanding rapidly in undergraduate education, but at the K-12 level, few schools have integrated this critical subject into their curricula. Many questions must be answered first: how should data science be taught to high schoolers? As a standalone course, or integrated throughout other courses? What level of […]

Data Science for All: NE Hub workshop explores teaching data ...





As part of our mission to address high-priority challenges with data-driven solutions, the Northeast Big Data Innovation Hub put out a call for workshop proposals this spring. We sought to support community-driven workshops that are designed to plan and develop Big Data projects, and are delighted to announce our first […]

Funding Awarded for First Round of NEBDIH-Sponsored Big Data Workshops





Winning Schemes for Predicting Student Interest in Science  UMass Amherst, WPI, Penn announce winners of Northeast big data competition   AMHERST, Mass. – After a year-long, global data-mining competition, organizers today awarded the top three winning teams from Hong Kong, Japan and Michigan at the National Science Foundation’s (NSF) Northeast […]

UMass Amherst, WPI, Penn announce winners of Northeast big data ...


Over on the Microsoft Azure Blog, Vani Mandava (Director, Data Science Research, Microsoft Research) highlights Microsoft’s partnership with NSF-funded big data initiatives including the Big Data Hubs. Microsoft has committed $6 million in cloud credits to support data science innovation across these programs, including the Northeast Big Data Hub’s Health Spoke […]

New blog: $6M from Microsoft to NSF-funded Big Data Hubs ...




With data touching all aspects of our lives today, how can we look beyond traditional approaches to data science education to build capacity as broadly and inclusively as possible? Leaders in data science education discussed “Alternative Avenues for Development of Data Science Education Capacity” on Friday, September 22nd, in a […]

Talking Big Data Literacy in “Data Divide” series


Guest post by Catherine Cramer, NYSCI Increasingly, the prosperity, innovation and security of individuals
 and communities depend on a big data-literate society, which calls for a concerted effort 
to define what it means to be a big data literate citizen, information worker, researcher, or policymaker. As a step toward that […]

First Steps toward Big Data Literacy Framework at Spring Workshop


How can big data help predict student outcomes? Ryan Baker (U Penn) and Neil Heffernan (Worcester Polytechnic Institute) of our Big Data for Education Spoke hope to do just that via the Longitudinal Educational Big Data Competition. Using carefully de-identified, real-world educational data, participants will predict whether 172 students in validation and […]

Big Data in Education: News and Competition


Click here to access the report Data sharing challenges are extensive in cases involving industry and academia, and highlight the need for sharable, adaptable solutions. To that end, a report summarizing the proceedings and outputs of 2016’s Northeast Hub workshop on Data Sharing has recently been published. The workshop convened data science practitioners to […]

“Enabling Seamless Data Sharing in Industry and Academia” Workshop Report ...



This post highlights one of the up-and-coming data science graduate students who participated in the Northeast Big Data Innovation Hub’s “Young Innovators” program this year. This program and others like it contribute to the Northeast Hub’s mission to build public-private partnerships to address high-priority societal challenges with data-driven solutions. The […]

Telling a Story with Data: Young Innovator Kenneth Graves


On September 29th and 30th, stakeholders from across the Northeast and beyond joined the Hub at Drexel University in Philadelphia for “Enabling Seamless Data Sharing in Industry and Academia,” a cross-sector workshop put on by our community to tackle the challenges of sharing data head-on. In short “TED talk”-style presentations and […]

NEBDIH Data Sharing Workshop a Success: “I wish I could ...


Click here for a PDF version of this release. Click here for the press release from the National Science Foundation (NSF). The National Science Foundation (NSF) has announced $3.3 million in grants to researchers affiliated with the Northeast Big Data Innovation Hub. A common theme underlying the projects below is […]

Northeast Big Data Innovation Hub Awarded $3.3 million to Create ...