Diversity, Equity, Inclusion, and Accessibility

The mission of the Northeast Big Data Innovation Hub is to build and strengthen partnerships. Great partnerships cannot rest on a foundation of inequality; they must work to advance equity and accessibility for all partners, of all backgrounds. That’s why building a diverse, equitable and inclusive community is central to the work of the Hub. The Northeast Big Data Innovation Hub is committed to advancing opportunities for those who have been underserved in the data science field, including from the perspective of racial and social equity and fairness, and for individuals with differing abilities.

As a mode of representation, data holds great power in individual and institutional decision-making. It both reveals disparities and needs within our communities and provides a path forward toward innovation and change. As a community of data science practitioners, we are committed to building diversity, equity, and inclusion in all aspects of our work, always mindful to leverage the power of data responsibly, and to improve accessibility to data and insights. Our focus on Responsible Data Science serves as a foundation for enabling data science that is secure, private, and ethical.

We realize our commitment to diversity, equity, inclusion, and accessibility through mutual collaboration, proactive outreach, and inclusivity by design. Our goal every day is to do better than we have done before.

Learn more below about some of our work to date and read our website accessibility statement.

DEIA and Data Science Resources

Race + Data Science Lecture Series

The Data Science Institute (DSI) at Columbia University hosts the Race + Data Science Lecture Series at regular intervals. This lecture series aims to celebrate and advance research in the areas of race and data, engineering, and computational science.

With this series of events, the DSI’s goal is to improve how we as data scientists and data-adjacent researchers speak about race. Through wide distribution of our speaker’s research, we envision that these talks can lead to the discovery of important resources about race, inequity, and emerging applications. This series is funded by support from the MacArthur Foundation and New America.

The People’s Speech Dataset

The People’s Speech Dataset is among the world’s largest English speech recognition corpus today that is licensed for academic and commercial usage under CC-BY-SA and CC-BY 4.0. It includes 30,000+ hours of transcribed speech in English languages with a diverse set of speakers. This open dataset is large enough to train speech-to-text systems and crucially is available with a permissive license. Just as ImageNet catalyzed machine learning for vision, the People’s Speech will unleash innovation in speech research and products that are available to users across the globe.

10 Women Who Pioneered the Tech Industry

Women programmed the very first computer, wrote the code that landed men on the moon, and came up with the idea for computer programs a century before the invention of computers. Thanks to the barrier-breaking work of women like Annie Easley, Carol Shaw, and Margaret Hamilton, today women work in every sector of the tech industry.

March 2022 Women in STEM Campaign

In honor of Women’s History Month, the NEBD Hub and the National Student Data Corps (NSDC) launched a month-long program in March 2022 called the Women in STEM campaign. This campaign leveraged social media and the NEBD Hub’s newsletters to share resources, opportunities, and information about how women can establish a successful career in data science. This campaign culminated with the March NSDC Career Panel, held on Friday, March 25th.

Upcoming DEIA and Data Science Events

DEIA and Data Science Professional Opportunities

2022 Data Science Internship Program, Massachusetts Life Sciences Center

Accepting rolling applications

2022 NSF Summit Student Application

Applications due August 1st.

AIM-AHEAD Research Fellowships

Applications due May 22nd

CISE Community Research Infrastructure (CCRI)

Proposals due June 23rd

Civic Innovation Challenge (CIVIC)

Stage 1 Deadline – May 5th
Stage 2 Deadline – February 1

HACU Scholarship Program

Applications due July 1st

Microsoft Research PhD Fellowship

Call for nominations closes June 7th

Pathways to Enable Open-Source Ecosystems (POSE)

Phase I Deadline – May 12th
Phase I Deadline – October 21st

Pilot Projects to Address Factors Contributing to Structural Racism in Public Health

Applications due June 1st

Research Fellows for On the Books: Jim Crow and Algorithms of Resistance Project

Applications due July 15th

RFI: NIH Prize Competition for Institutional Excellence in Diversity, Equity, Inclusion, and Accessibility

Comments requested by July 28th

SEEDS Grant Program, South Big Data Innovation Hub

Applications accepted through June 3rd

The S.E.E.D.S Grant Program: Southern Engagement and Enrichment in Data Science

Applications accepted through June 3rd.

DEIA and Data Science Career Opportunities

Additional DEIA Career Resources can be found on the National Student Data Corps (NSDC) Career Central page.

DEIA Success Stories


DEFLAB: Data Education and Feminism at Lafayette and Beyond

Guest post by Trent Gaugler, Lafayette College This Success Story is a report on the results of the Northeast Big Data Innovation Hub’s 2020 Seed Fund program. The goals of this Seed Fund project were to introduce students to the fundamentals of data science through socially relevant projects, to enhance Lafayette College students’ ability to […]

Garret Dash Nelson

Building Tools and Training for Public & Educational Use of Geospatial Big Data

Guest post by Garrett Dash Nelson, Norman B. Leventhal Map and Education Center at Boston Public Library This Success Story is a report on the results of the Northeast Big Data Innovation Hub’s 2020 Seed Fund program. The primary goal of this Seed Fund project was to create a gateway for adult library patrons and […]

Image of computer screen with code as seen through glasses lying on a table

Data Science in General Education

Guest post by Cathie LeBlanc, Ph.D., Plymouth State University This Success Story is a report on the results of the Northeast Big Data Innovation Hub’s 2020 Seed Fund program. Plymouth State University has a history of active faculty learning communities focused on various aspects of teaching. Our latest learning community, funded by the Northeast Big Data Innovation […]