Leadership Team


Meet some of the people behind the Hub. You can learn more about our leadership team below. To learn more about our collaborators and find out how you can get involved, visit the Collaborators page.

Steering Committee

The Steering Committee advances the mission of the Northeast Big Data Innovation Hub and acts as an oversight board.

Florence Hudson
Executive Director

Florence Hudson is Executive Director of the Northeast Big Data Innovation Hub hosted at the Data Science Institute at Columbia University. She is founder and CEO of FDHint, LLC, and Special Advisor for the NSF Cybersecurity Center of Excellence at Indiana University. She is a former IBM VP and Chief Technology Officer, Internet2 Senior VP and Chief Innovation Officer, and aerospace engineer at Grumman and NASA. Full bio available here.

LinkedIn
Jeannette Wing
Principal Investigator

Jeannette M. Wing is Avanessians Director of the Data Science Institute and Professor of Computer Science at Columbia University. She came to Columbia in July 2017 from Microsoft, where she served as Corporate Vice President of Microsoft Research, overseeing a global network of research labs. She is widely recognized for her intellectual leadership in computer science, particularly in trustworthy computing. Full bio available here.

LinkedIn
Jamie Callan

Jamie Callan is a Professor at the Language Technologies Institute, a graduate department in Carnegie Mellon’s School of Computer Science. He has a joint appointment in the School of Information Systems and Management within CMU’s Heinz College.His research and teaching focus on text-based information retrieval and analysis. His recent work develops advanced search engine architectures, architectures for large-scale distributed search, federated search of vertical search engines, adaptive information filtering, and a variety of text analysis/mining/analytics methods. Full bio available here.

Laura Dietz

Laura Dietz is an Assistant Professor at the University of New Hampshire, where she leads the lab for text retrieval, extraction, machine learning and analytics (TREMA). She is collaborating with researchers in Biogeochemistry, on data science for studying the impacts on humans on their environment. She organizes a tutorial/workshop series on Utilizing Knowledge Graphs in Text-centric Retrieval (KG4IR) and coordinated the TREC Complex Answer Retrieval Track. She received an NSF CAREER Award for utilizing fine-grained knowledge annotations in text understanding and retrieval. Previously, she was a research scientist at Mannheim University, University of Massachusetts Amherst, and graduated from the Max Planck Institute for Computer Science. Full bio available here.

Carsten Eickhoff

Carsten is an assistant professor of Medical and Computer Science at Brown University where he leads the Biomedical AI Lab, specializing in the development of data science and information retrieval techniques with the goal of improving patient safety, individual health and quality of medical care. Before coming to Brown, he studied artificial intelligence and machine learning at the University of Edinburgh, TU Delft and ETH Zurich. Carsten has authored more than 80 conference and journal articles on topics pertaining to automatic large-scale text processing and retrieval as well as information extraction from unstructured natural language resources. Aside from his academic endeavors, he is involved in several deep technology startups in the health sector that strive to translate technological innovation to improved safety and quality of life for patients. Full bio available here.

John Goodhue

John Goodhue is the Executive Director of the Massachusetts Green High Performance Computing Center, a data center and consortium that serves more than 20,000 researchers, students, and educators at Boston University, Harvard, MIT, Northeastern University, the University of Massachusetts, and other institutions in the Northeast. John’s interests include regional and national collaboration on infrastructure for research computing, and developing the next generation of computing professionals. His industry experience includes management and technology leadership positions at Cisco Systems, BBN Technologies, and several early stage startup companies. Full bio available here.

James Hendler

James Hendler is the Tetherless World Senior Constellation Professor of Computer and Cognitive Science, and Head of the Computer Science Department at Rensselaer Polytechnic Institute. He has authored over 200 technical papers in the areas of Semantic Web, artificial intelligence, agent-based computing and high performance processing. Full bio available here.

Co-Principal Investigators

James Hendler

James Hendler is the Tetherless World Senior Constellation Professor of Computer and Cognitive Science, and Head of the Computer Science Department at Rensselaer Polytechnic Institute. He has authored over 200 technical papers in the areas of Semantic Web, artificial intelligence, agent-based computing and high performance processing. Full bio available here.

Vasant Honavar

Dr. Vasant Honavar also represents the Northeast Big Data Hub, along with Florence Hudson, on the Big Data Innovation Hubs' National Coordination Committee. He received his Ph.D. in Computer Science and Cognitive Science in 1990 from the University of Wisconsin Madison, specializing in Artificial Intelligence. In September 2013, Honavar joined the faculty of Penn State University, where he currently serves as a Professor and Edward Frymoyer Chair of Information Science and Technology. He is also the founding Director of the Center for Big Data Analytics and Discovery Informatics and Associate Director of the Institute for Cyberscience. Full bio available here.

Andrew McCallum

Andrew McCallum is Professor and Director of the Information Extraction and Synthesis Laboratory in the School of Computer Science at University of Massachusetts Amherst. He has published over 250 papers in many areas of AI, including natural language processing, machine learning, data mining and reinforcement learning, and his work has received over 30,000 citations. Full bio available here.

Operations

Katie Naum
Manager of Operations

Katie Naum is passionate about responsible data science, sustainability, and  #scicomm that engages and inspires. A native of western New York and current Brooklynite, she holds a B.A. in Sustainable Development from Columbia University. Her interests include writing, hiking, and traveling.

Seed Fund Steering Committee

David Bader
Chair

David A. Bader is a Distinguished Professor in the Department of Computer Science and inaugural Director of the Institute for Data Science at New Jersey Institute of Technology. He is a Fellow of the IEEE, AAAS, and SIAM, and advises the White House, most recently on the National Strategic Computing Initiative (NSCI). His interests are at the intersection of high-performance computing and real-world applications, including cybersecurity, massive-scale analytics, and computational genomics. Full bio available here.

Florence Hudson
Ex Officio

Florence Hudson is Executive Director of the Northeast Big Data Innovation Hub hosted at the Data Science Institute at Columbia University. She is founder and CEO of FDHint, LLC, and Special Advisor for the NSF Cybersecurity Center of Excellence at Indiana University. She is a former IBM VP and Chief Technology Officer, Internet2 Senior VP and Chief Innovation Officer, and aerospace engineer at Grumman and NASA. Full bio available here.

Jenni Evans

Jenni L. Evans is the Director of Penn State’s Institute for Computational and Data Sciences (ICDS), Professor of Meteorology & Atmospheric Science and served as Centennial President of the American Meteorological Society (AMS) in 2019. Her research spans tropical climate, climate change, and hurricane lifecycles in the tropics and through extratropical transition, as well as the “music of hurricanes.” Full bio available here.

Josh Greenberg

Joshua M. Greenberg is director of the Alfred P. Sloan Foundation's Digital Information Technology program. Dr. Greenberg is an active member of the broader digital library and digital humanities communities and maintains research and teaching interests in the history and sociology of information technology, the dynamics of public engagement with expert knowledge, and the methodological implications of new digital technologies for research. Full bio available here.

Chris Hill

Chris Hill specializes in Earth and planetary computational science. He was a founding developer of the M.I.T. General Circulation Model (MITgcm), a numerical simulation tool used for a wide range of basic science and applied studies in planetary fluid dynamics, and helped launch the Earth System Modeling Framework (ESMF), a major open standard for creating multicomponent models of Earth system processes. Full bio available here.

Jill Jemison

Jill Jemison is Assistant Dean for Technology/CIO at the University of Vermont with responsibility for infrastructure, application development, support, educational technologies and AV at the Larner College of Medicine and the College of Nursing and Health Sciences. She was formerly chair of the Association of American Medical Colleges Group on Information Resources and faculty director of the GIR Leadership Institute. Full bio available here.

Daniel Lopresti

Dan Lopresti is a Professor of Computer Science and Engineering at Lehigh University, having recently completed 10 years as chair of the CSE Department, and director of Lehigh's Data X initiative. His research examines algorithmic and systems-related questions in pattern recognition, document analysis, and computer security. He is also the Vice Chair of the Computing Research Association’s CCC Council. Full bio available here.

Gideon Mann

Gideon Mann is the head of Data Science in the Office of the CTO at Bloomberg LP. At Bloomberg, he guides corporate strategy for machine learning, natural language processing (NLP) and information retrieval. He’s served as a founding member of both the Data for Good Exchange (D4GX), an annual conference on data science applications for social good, and the Shift Commission on Work, Workers and Technology. Full bio available here.

Renée Miller

Renée J. Miller is a University Distinguished Professor of Computer Science at Northeastern University. She is a Fellow of the Royal Society of Canada, Canada’s National Academy of Science, Engineering and the Humanities. She received the US Presidential Early Career Award for Scientists and Engineers (PECASE). Her work is focused on building practical data integration systems. Full bio available here.

Advisory Board

Our Advisory Board includes regional leaders from academia, industry, non-profit organizations and government, whose insights help further the Hub’s mission and who demonstrate sustained interest in engaging with the Hub in a voluntary role. Leaders of the following Thematic and Cross-Cutting Areas are indicated below.

Emeritus Leadership

Kathy McKeown
Founding Principal Investigator

Kathleen R. McKeown is the Henry and Gertrude Rothschild Professor of Computer Science at Columbia University, and was the founding Director of the Data Science Institute. Full bio available here.