Responsible Data Science


NSDC Data Science Flashcards – Data Science Ethics Card #6 – 5 V’s of Big Data

This NSDC Data Science Flashcards series will teach you about the importance of data ethics. This installment of the NSDC Data Science Flashcards series was created by Florence Hudson and Varalika Mahajan. Recordings were done by Lauren Close, Florence Hudson, and Emily Rothenberg. You can find these videos on the […]

Flashcard Intro Slide: What are the 5 V's of Big Data?

Flashcard Intro Slide: What is Algorithmic Fairness?

NSDC Data Science Flashcards – Data Science Ethics Card #5 – Algorithmic Fairness

This NSDC Data Science Flashcards series will teach you about the importance of data ethics. This installment of the NSDC Data Science Flashcards series was created by Florence Hudson and Varalika Mahajan. Recordings were done by Lauren Close, Florence Hudson, and Emily Rothenberg. You can find these videos on the […]


NSDC Data Science Flashcards – Data Science Ethics Card #4 – FAIR Principles Part 2

This NSDC Data Science Flashcards series will teach you about the importance of data ethics. This installment of the NSDC Data Science Flashcards series was created by Florence Hudson and Varalika Mahajan. Recordings were done by Lauren Close, Florence Hudson, and Emily Rothenberg. You can find these videos on the […]

Flashcard Intro Slide: What are the FAIR Data Principles?

Flashcard Intro Slide: What are the FAIR Data Principles?

NSDC Data Science Flashcards – Data Science Ethics Card #3 – FAIR Principles Part 1

This NSDC Data Science Flashcards series will teach you about the importance of data ethics. This installment of the NSDC Data Science Flashcards series was created by Florence Hudson and Varalika Mahajan. Recordings were done by Lauren Close, Florence Hudson, and Emily Rothenberg. You can find these videos on the […]


NSDC Data Science Flashcards – Data Science Ethics Card #2 – Intellectual Property

This NSDC Data Science Flashcards series will teach you about the importance of data ethics. This installment of the NSDC Data Science Flashcards series was created by Florence Hudson and Varalika Mahajan. Recordings were done by Lauren Close, Florence Hudson, and Emily Rothenberg. You can find these videos on the […]

Data Science Flashcard Intro Slide: What is Intellectual Property?

Flashcard Intro Slide: What is Privacy, Transparency, and Consent?

NSDC Data Science Flashcards – Data Science Ethics Card #1 – Privacy, Transparency, and Consent

This NSDC Data Science Flashcards series will teach you about the importance of data ethics. This installment of the NSDC Data Science Flashcards series was created by Florence Hudson and Varalika Mahajan. Recordings were done by Lauren Close, Florence Hudson, and Emily Rothenberg. You can find these videos on the […]


Improving Data Integrity Awareness in HPC Datasets using Sparsity Profiles

Guest post by Dr. Seung Woo Son, Associate Professor, University of Massachusetts, Lowell This Success Story is a report on the results of one of the awards in the Northeast Big Data Innovation Hub’s 2021 Seed Fund program. As scientists conduct analyses that rely on large-scale simulations to achieve breakthroughs […]

Seung Woo Son

Summer 2023 CIC Webinar Recap

The fourteenth session of the COVID Information Commons (CIC) webinar series took place on July 26th, 2023. In this forum, leading COVID-19 scientists presented their current research on the global pandemic.  Event moderators included Florence Hudson, Executive Director of the Northeast Big Data Innovation Hub at Columbia University and COVID Information Commons Principal Investigator […]

NEBD Hub Logo

Graphic for NEBDHub Inaugural Student Research Symposium

January 2023 NEBDHub Inaugural Student Research Symposium

Written by: Femi Johnson A recording of this event is available at the Northeast Big Data Hub’s YouTube channel. On January 27, 2023, the Northeast Big Data Innovation Hub (NEBDHub) held its 2023 Inaugural Student Research Symposium. This event highlighted the student-led research that was completed by undergraduate- and graduate-level […]


NEBD Hub Logo

Notice of Special Interest (NOSI): Optimization of Data Storage and Utilization for the Sequence Read Archive (SRA)

The purpose of this Notice of Special Interest (NOSI) is to inform the scientific community of the interest of NIGMS, NLM, and ODSS in supporting efficiency optimization and cost reduction for Sequence Read Archive (SRA) data storage and utilization. Research Objectives The SRA, hosted by the National Center for Biotechnology […]


Leveraging Data Science and Advanced Technologies. Conversation with Ms. Florence D. Hudson

WIT Virtual Voices, a series of webinars hosted by the World Information Transfer focusing on the SDGs to promote science-based news on health and environment. This online webinar from NEBDHub Executive Director, Florence Hudson, touches on the role of data science in developing advanced technologies.

World Information Transfer Logo

Using Data Science To Study Environmental Racism, Justice, And Policy

Guest post by Dr. Aunshul Rege, Temple University This Success Story is a report on the results of the Northeast Big Data Innovation Hub’s 2020 Seed Fund program. This project examined environmental injustice using a qualitative criminological lens. The project surveyed known case studies of environmental injustice in the United […]

Anshul Rege

Scatter plot

Researchers from NYU Tandon release 3-D data tracking human interactions outside of coronavirus hotspots

Study to set groundwork to build machine learning models that rapidly analyze how a virus spreads In April when New York City was under a strict lockdown, a team of 16 student researchers from New York University’s Tandon School of Engineering commenced a National Science Foundation Rapid Response Research (RAPID) […]


Water Data and Software Services to Support Discovery, Reproducibility, and Collaboration in the Water-Resources Domain and Beyond

Guest post by Emily Clark, Project Manager, CUAHSI The mission of the Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) is to enable interdisciplinary collaboration in the water sciences, provide critical cyberinfrastructure, and promote water science education at all levels. CUAHSI’s services can be especially useful in […]

Splash drop

Announcing the Northeast Big Data Hub Seed Fund Program

The Northeast Big Data Hub is delighted to announce our Seed Fund program this month. Designed to promote collaboration in data science, the Seed Fund will encourage the cross-pollination of ideas, data and tools across disciplines and sectors including academia, industry, government, and communities. Funding provided through this program is intended […]

Seedlings

Drexel MRC logo

“Enabling Seamless Data Sharing in Industry and Academia” Workshop Report Released

Click here to access the report Data sharing challenges are extensive in cases involving industry and academia, and highlight the need for sharable, adaptable solutions. To that end, a report summarizing the proceedings and outputs of 2016’s Northeast Hub workshop on Data Sharing has recently been published. The workshop convened data science practitioners to […]


From Our Community: Differential Privacy Symposium, November 12

A symposium on differential privacy will be held at Princeton University’s Institute for Advanced Study, next month. The event will bring together speakers including Helen Nissenbaum (Cornell Tech and NYU), Aaron Roth (University of Pennsylvania), Guy Rothblum (Weizmann Institute), Kunal Talwar (Google Brain), and Jonathan Ullman (Northeastern University). To learn more and register, please visit […]

DP event

Data Sharing event

NEBDIH Data Sharing Workshop a Success: “I wish I could have gone to this workshop two years ago!”

On September 29th and 30th, stakeholders from across the Northeast and beyond joined the Hub at Drexel University in Philadelphia for “Enabling Seamless Data Sharing in Industry and Academia,” a cross-sector workshop put on by our community to tackle the challenges of sharing data head-on. In short “TED talk”-style presentations and […]


NEBD Hub logo

Northeast Big Data Innovation Hub Awarded $3.3 million to Create Solutions to Pressing Challenges in Health, Education and Data Sharing

Click here for a PDF version of this release. Click here for the press release from the National Science Foundation (NSF). The National Science Foundation (NSF) has announced $3.3 million in grants to researchers affiliated with the Northeast Big Data Innovation Hub. A common theme underlying the projects below is […]