News & Updates




We tried to understand to what extent people coming from a COVID-19-infected environment interact with the built and natural environment. Specifically, we wanted to understand in three dimensions what people touched, how much they touched, and where they went coming out of hospitals and urgent care facilities at the peak of the COVID-19 outbreak.

5 Questions with COVID Researchers: Debra Laefer, New York University





The role of contaminated surfaces (fomites) in community transmission of SARS-CoV-2 has been debated. Our team is sampling high-touch public surfaces during the ongoing pandemic to improve understanding of the risk of SARS-CoV-2 infection when the public accesses essential businesses such as grocery stores, pharmacies, and gas stations.

5 Questions with COVID Researchers: Amy Pickering, Tufts University








The Northeast Big Data Hub is delighted to announce our Seed Fund program this month. Designed to promote collaboration in data science, the Seed Fund will encourage the cross-pollination of ideas, data and tools across disciplines and sectors including academia, industry, government, and communities. Funding provided through this program is intended […]

Announcing the Northeast Big Data Hub Seed Fund Program



Guest post by Ivon Arroyo, Associate Professor in the College of Education and the College of Computer Science at the University of Massachusetts Amherst. The Big Data for Education Spoke’s professional development workshops have empowered teachers to leverage data to identify and answer a variety of pedagogical questions about their […]

Data-driven workshops help teachers understand and engage with students


Guest post by Chase Wu, Associate Chair of and Professor in the Department of Computer Science at NJIT. Research Background Model-based simulations have become an essential component in next-generation scientific applications and are generating big data on the order of terabyte at present and petabyte or exabyte in the predictable […]

Using machine learning to optimize big data workflows for collaborative ...


Guest post by Ryan Baker, Associate Professor in the Graduate School of Education at the University of Pennsylvania. The Northeast Big Data for Education Spoke has conducted considerable outreach on methods for data science for educational data sets. Workshops have been conducted in New York City, Buffalo, Philadelphia, Pittsburgh, and […]

Massive online open course teaches machine learning and data mining ...


Guest post by Ryan Baker, Associate Professor in the Graduate School of Education at the University of Pennsylvania. The ASSISTments Longitudinal Data Competition invited data scientists around the world to participate in a competition around the analysis of student data. Data from middle school student use of a popular online […]

ASSISTments Longitudinal Data Competition challenges participants to determine correlation between ...






Data science is expanding rapidly in undergraduate education, but at the K-12 level, few schools have integrated this critical subject into their curricula. Many questions must be answered first: how should data science be taught to high schoolers? As a standalone course, or integrated throughout other courses? What level of […]

Data Science for All: NE Hub workshop explores teaching data ...








As part of our mission to address high-priority challenges with data-driven solutions, the Northeast Big Data Innovation Hub put out a call for workshop proposals this spring. We sought to support community-driven workshops that are designed to plan and develop Big Data projects, and are delighted to announce our first […]

Funding Awarded for First Round of NEBDIH-Sponsored Big Data Workshops




Winning Schemes for Predicting Student Interest in Science  UMass Amherst, WPI, Penn announce winners of Northeast big data competition   AMHERST, Mass. – After a year-long, global data-mining competition, organizers today awarded the top three winning teams from Hong Kong, Japan and Michigan at the National Science Foundation’s (NSF) Northeast […]

UMass Amherst, WPI, Penn announce winners of Northeast big data ...



Over on the Microsoft Azure Blog, Vani Mandava (Director, Data Science Research, Microsoft Research) highlights Microsoft’s partnership with NSF-funded big data initiatives including the Big Data Hubs. Microsoft has committed $6 million in cloud credits to support data science innovation across these programs, including the Northeast Big Data Hub’s Health Spoke […]

New blog: $6M from Microsoft to NSF-funded Big Data Hubs ...




With data touching all aspects of our lives today, how can we look beyond traditional approaches to data science education to build capacity as broadly and inclusively as possible? Leaders in data science education discussed “Alternative Avenues for Development of Data Science Education Capacity” on Friday, September 22nd, in a […]

Talking Big Data Literacy in “Data Divide” series



Guest post by Catherine Cramer, NYSCI Increasingly, the prosperity, innovation and security of individuals
 and communities depend on a big data-literate society, which calls for a concerted effort 
to define what it means to be a big data literate citizen, information worker, researcher, or policymaker. As a step toward that […]

First Steps toward Big Data Literacy Framework at Spring Workshop



How can big data help predict student outcomes? Ryan Baker (U Penn) and Neil Heffernan (Worcester Polytechnic Institute) of our Big Data for Education Spoke hope to do just that via the Longitudinal Educational Big Data Competition. Using carefully de-identified, real-world educational data, participants will predict whether 172 students in validation and […]

Big Data in Education: News and Competition


Click here to access the report Data sharing challenges are extensive in cases involving industry and academia, and highlight the need for sharable, adaptable solutions. To that end, a report summarizing the proceedings and outputs of 2016’s Northeast Hub workshop on Data Sharing has recently been published. The workshop convened data science practitioners to […]

“Enabling Seamless Data Sharing in Industry and Academia” Workshop Report ...



This post highlights one of the up-and-coming data science graduate students who participated in the Northeast Big Data Innovation Hub’s “Young Innovators” program this year. This program and others like it contribute to the Northeast Hub’s mission to build public-private partnerships to address high-priority societal challenges with data-driven solutions. The […]

Telling a Story with Data: Young Innovator Kenneth Graves




The National Science Foundation (NSF) has published the Smart & Connected Communities (S&CC) program solicitation, spanning the Directorates for Computer and Information Science and Engineering (CISE); Education and Human Resources (EHR); Engineering (ENG); Geosciences (GEO); and Social, Behavioral and Economic Sciences (SBE). The goal of the S&CC program solicitation is […]

Via NSF: Smart and Connected Communities (S&CC) Program Solicitation




A symposium on differential privacy will be held at Princeton University’s Institute for Advanced Study, next month. The event will bring together speakers including Helen Nissenbaum (Cornell Tech and NYU), Aaron Roth (University of Pennsylvania), Guy Rothblum (Weizmann Institute), Kunal Talwar (Google Brain), and Jonathan Ullman (Northeastern University). To learn more and register, please visit […]

From Our Community: Differential Privacy Symposium, November 12


On September 29th and 30th, stakeholders from across the Northeast and beyond joined the Hub at Drexel University in Philadelphia for “Enabling Seamless Data Sharing in Industry and Academia,” a cross-sector workshop put on by our community to tackle the challenges of sharing data head-on. In short “TED talk”-style presentations and […]

NEBDIH Data Sharing Workshop a Success: “I wish I could ...





Click here for a PDF version of this release. Click here for the press release from the National Science Foundation (NSF). The National Science Foundation (NSF) has announced $3.3 million in grants to researchers affiliated with the Northeast Big Data Innovation Hub. A common theme underlying the projects below is […]

Northeast Big Data Innovation Hub Awarded $3.3 million to Create ...


On May 23rd, the Obama Administration released the Federal Big Data Research and Development Strategic Plan, part of the Big Data Research and Development Initiative launched in 2012. As data science capabilities and opportunities continue to grow, the Plan provides a comprehensive overview and establishes guidance for developing and expanding […]

White House Releases Big Data Strategic Plan