Skip to Main Content
Hours & Login Menu
  • Hours
  • Login
    • Library Search Login
    • Interlibrary Loan
Dartmouth Libraries Dartmouth Libraries

Global dropdown menu

    • Borrow and Request
      • Who Can Borrow
      • What You Can Borrow
      • Loan Periods and Renewals
      • Borrow from Other Libraries
      • Request Materials
      • All Borrow and Request
    • Collections
      • Digital Collections
      • Media Collections
      • Oral Histories
      • Collections Care
      • Donate
      • All Collections
    • Course Reserves
      • Find Course Reserves
      • Create or Add Course Reserves
      • All Course Reserves
    • Off-Campus Access
    • Records Management
      • Retention and Disposition
      • Confidential Monthly Destruction
      • Electronic Records
      • Physical Records
      • Retention Schedules
      • All Records Management
    • Search and Browse
      • Library Search
      • Databases
      • Journals
      • Research Guides
      • Maps and Atlases
      • Newspapers
      • Dartmouth Digital Commons
      • Music Scores
      • BorrowDirect
      • Archives and Manuscripts
      • All Search and Browse
    • Design and Produce
      • Audio and Video
      • Book Arts
      • Digital Art and Design
      • Equipment and Hardware
      • Software
      • All Design and Produce
    • Data Services
      • Research Data Management
      • Data Analysis and Visualization
      • Data Repositories
      • Data Workshops
      • Datasets at Dartmouth
      • All Data Services
    • Digital Scholarship
    • Publishing and Copyright
      • Copyright
      • Open Access
      • Publisher Agreements
      • Publishing for Faculty
      • Publishing for Students
      • All Publishing and Copyright
    • Research Help
    • Teaching and Workshops
    • Print, Copy, Scan
    • Locations
      • Baker-Berry Library
      • Book Arts Workshop
      • Evans Map Room
      • Feldberg Business and Engineering Library
      • Health Sciences and Biomedical Libraries
      • Jones Media Center
      • Library Collections and Services Facility
      • Rauner Special Collections Library
      • Sherman Art Library
      • All Locations
    • Accessibility
    • Events
    • Exhibits
    • Hours
    • Study Spaces
    • About Dartmouth Libraries
      • Council on the Libraries
      • Diversity, Equity, and Inclusion
      • Friends of the Libraries
      • Library Departments
      • Strategic Framework
      • Staff Directory
      • All About Dartmouth Libraries
    • Employment
      • Staff and Professional Positions
      • Student Positions
      • Fellowships
      • All Employment
    • News and Highlights
    • Policies and Guidelines
    • Programs and Awards
      • Alumni Memorial Book Fund Program
      • Librarians Active Learning Institute
      • MAD Research Video Contest
      • Staff Awards
      • All Programs and Awards
    • Contact Us
    • We're Here to Help
      • Students
      • Faculty
      • Alums
      • Staff
      • Visiting Researchers and Community
      • All We're Here to Help
    • Find a Specialist
      • Subject Librarians
      • Audio and Video Production
      • Preservation and Emergency Preparedness
      • Publishing and Copyright
      • Records Management
      • Research Data Services
      • Systematic Review
      • All Find a Specialist
    • Ask Us
  • Hours
    • Library Search Login
    • Interlibrary Loan

Global dropdown menu

    • Borrow and Request
      • Who Can Borrow
      • What Can You Borrow
      • Loan Periods and Renewals
      • Borrow from Other Libraries
      • Request Materials
    • Collections
      • Digital Collections
      • Media Collections
      • Oral Histories
      • Collections Care
      • Donate
    • Course Reserves
      • Find Course Reserves
      • Create or Add Course Reserves
    • Off-Campus Access
    • Records Management
      • Retention and Disposition
      • Confidential Monthly Destruction
      • Electronic Records
      • Physical Records
      • Retention Schedules
    • Search and Browse
      • Library Search
      • Databases
      • Journals
      • Research Guides
      • Maps and Atlases
      • Newspapers
      • Dartmouth Digital Commons
      • Music Scores
      • BorrowDirect
      • Archives and Manuscripts
    • Design and Produce
      • Audio and Video
      • Book Arts
      • Design and Digital Art
      • Equipment and Hardware
      • Software
    • Data Services
      • Research Data Management
      • Data Analysis and Visualization
      • Data Repositories
      • Data Workshops
      • Datasets at Dartmouth
    • Digital Scholarship
    • Publishing and Copyright
      • Copyright
      • Open Access
      • Publisher Agreements
      • Publishing for Faculty
      • Publishing for Students
    • Research Help
    • Teaching and Workshops
    • Print, Copy, Scan
    • Locations
      • Baker-Berry Library
      • Book Arts Workshop
      • Evans Map Room
      • Feldberg Business and Engineering Library
      • Health Sciences and Biomedical Libraries
      • Jones Media Center
      • Library Collections and Services Facility
      • Rauner Special Collections Library
      • Sherman Art Library
    • Accessibility
    • Events
    • Exhibits
    • Hours
    • Study Spaces
    • About Dartmouth Libraries
      • Council on the Libraries
      • Diversity, Equity, and Inclusion
      • Friends of the Libraries
      • Library Departments
      • Strategic Framework
      • Staff Directory
    • Employment
      • Staff and Professional Positions
      • Student Positions
      • Fellowships
    • News and Highlights
    • Policies
    • Programs and Awards
      • Alumni Memorial Book Fund Program
      • Librarians Active Learning Institute
      • MAD Research Video Contest
      • Staff Awards
    • Contact Us
    • We're Here to Help
      • Students
      • Faculty
      • Alums
      • Staff
      • Visiting Researchers and Community
    • Find a Specialist
      • Subject Librarians
      • Audio and Video Production
      • Preservation and Emergency Preparedness
      • Publishing and Copyright
      • Records Management
      • Research Data Services
      • Systematic Review
    • Ask Us
  • Hours
    • Library Search Login
    • Interlibrary Loan
  1. Dartmouth Libraries
  2. Research Guides
  3. Dartmouth Libraries Guides
  4. Computer Science
  5. Computational Data Sources - Open Data

Computer Science

Resources for starting your research in computer science.
  • Getting Started
  • Tech Reports, Preprints & Bibliographies
  • Dissertations & Theses
  • Patents This link opens in a new window
  • Technical Standards
    • Finding Standards at Dartmouth
    • Techstreet
  • LaTex This link opens in a new window
  • Artificial Intelligence at Dartmouth This link opens in a new window
  • Computational Data Sources - Dartmouth Licensed
  • Computational Data Sources - Open Data
    • Multidiscipinary Data Sources
    • Business
    • Computer Networks
    • Government: United States
    • Health & Medicine
    • Images and Movement
    • Language
    • Music
    • Politics
    • Social Sciences
    • Transportation
  • News
  • Reading Room
  • Diversity, Equity, and Inclusion

Multidiscipinary Data Sources

  • Amazon Web Services (AWS) Free Datasets
    Public Data Sets on AWS provides a centralized repository of public data sets that can be seamlessly integrated into AWS cloud-based applications.
  • Data.World
    Open, Secure, Social and Linked data--building the most meaningful, collaborative, and abundant data resource in the world.
  • Fact Extraction and VERification (FEVER) dataset
    Dataset of 200,000 true and false claims
  • Github Data Packaged Core Datasets
    Important, commonly-used datasets in high quality, easy-to-use & open form as data packages
  • Google Dataset Search
    Search for datasets and related data spread across multiple data repositories on the web.
  • Kaggle Datasets
    Open datasets on everything from government, health, and science to popular games and dating trends.
  • Meta AI: Datasets for Advancing AI Research
    Datasets gathered by Meta (Facebook).
  • MRAN- Manager R Archive Network: Data Sources on the Web
    The following list of data sources has been collected and categorized for your convenience. The list has been limited to those for which there is a reasonably simple process for importing csv files. Most of the data sets listed below are free, however, some are not.
    If an (R!) appears after source this means that the data are already in R format or there exist R commands for directly importing the data from R.

Business

  • AIRBNB data
    Data behind the Inside Airbnb site is sourced from publicly available information from the Airbnb site. The data has been analyzed, cleansed and aggregated where appropriate to faciliate public discussion.
  • Open Trade Statistics
    Open Trade Statistics is an independent project that values reproducible research and provides tidy trade data. They are heavily inspired by R community values and Open Licenses views on freedom. They focus on commodities data and focus on data processing and reproducibility instead of data visualization.

Computer Networks

  • ANT Datasets
    Datasets from the ANT (Analysis of Network Traffic) Lab, whose goal is to goal is to improve the Internet by discovering new ways to understand network topology, traffic, use and abuse.
  • CRAWDAD
    Community Resource for Archiving Wireless Data At Dartmouth. CRAWDAD began in 2004 at Dartmouth College as a place to share wireless network data with the research community. Its purpose was to enable access to data from real networks and real mobile users at a time when collecting such data was challenging and expensive. The archive has continued to grow since its inception, and starting in summer 2022 is being housed on IEEE DataPort.

Government: United States

  • ArchivaL Federal Reserve Economic Data (ALFRED)
    Allows you to retrieve vintage versions of economic data that were available on specific dates in history.
  • Data.gov
    The purpose of Data.gov is to increase public access to high value, machine readable datasets generated by the Executive Branch of the Federal Government.
  • Federal Reserve Archive (FRASER)
    Various types of publications that are primarily statistical in nature, including books, magazine series, and true statistical releases.
  • Federal Reserve Economic Data (FRED)
    Online database consisting of more than 148,000 economic data time series from 59 national, international, public, and private sources. FRED, created and maintained by Research Department at the Federal Reserve Bank of St. Louis.
  • Federal Reserve of New York
    Housing and mortgage data.
  • U.S. Census Bureau Research Guide
    US Census links and explanations and mapping software.

Health & Medicine

  • Big Cities Health Inventory Data
    Access and analyze health data from 26 cities, for 34 health indicators, and across six demographic indicators.
  • Centers for Disease Control and Prevention
  • Child Health and Developmental Studies
    Data on how health and disease are passed on between generations--not just by genes, but also through social, personal, and environmental surroundings.
  • Dartmouth Atlas
    Medicare data to provide information and analysis about national, regional, and local markets, as well as hospitals and their affiliated physicians.
  • Healthcare Cost and Utilization Project (HCUP)
    Largest collection of longitudinal hospital care data in the United States.
  • Healthcare Delivery Research Program Public Data
  • HealthData.gov
    Includes clinical care provider quality information, nationwide health service provider directories, databases of the latest medical and scientific knowledge, consumer product data, community health performance information, government spending data.
  • Human Mortality Database
    Detailed mortality and population data
  • Mammographic Image Analysis
    Mammographic Image Analysis Society (MIAS) database and the Digital Database for Screening Mammography (DDSM)
  • Medicare Provider Utilization and Payment Data: Physician and Other Supplier
    Information about services and procedures provided to Medicare beneficiaries by physicians and other healthcare professionals, with information about utilization, payment, and submitted charges organized by National Provider Identifier (NPI), Healthcare Common Procedure Coding System (HCPCS) code, and place of service.
  • National Cancer Institute Data Access System
    The Cancer Data Access System ("CDAS") is a website where you may request data recorded from various research studies. For some studies, you may also request images or biospecimens.
    CDAS provides extensive public documentation for each study, including a trial summary, an overview of the data collected, and a searchable database of research projects and publications.
    more...less...
    If you are interested in obtaining study data, you may begin a CDAS project for that study. All projects are reviewed by NCI trial leadership. Upon approval, you will be granted access to the requested data and/or materials for a limited period.
  • National Center for Health Statistics (NCHS)
    Data visualization, searchable statistics, and interactive queries on health and health care.
  • OpenNEURO
    Sharing neuroimaging data

Images and Movement

  • Annotated Image Dataset of Household Objects
    This data set contains two sets of pictures of household objects, created by the RoboFEI@Home team to develop object detection systems for a domestic robot.
    The first data set was created with objects from a local supermarket. Product brands are typical from Brazil. The second data set is composed of objects from the RoboCup@Home 2018 OPL competition.
    more...less...
    The data set contains the basic resources for the creation of custom object detection systems. Users can use the provided annotated images to train their own models and validate them in a set of test images.
  • The CIFAR-10 Dataset
    The CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes, with 6000 images per class. There are 50000 training images and 10000 test images.
    more...less...
    The dataset is divided into five training batches and one test batch, each with 10000 images. The test batch contains exactly 1000 randomly-selected images from each class. The training batches contain the remaining images in random order, but some training batches may contain more images from one class than another. Between them, the training batches contain exactly 5000 images from each class.
  • HMDB: a Large Human Motion Database
    HMDB is collected from various sources, mostly from movies, and a small proportion from public databases such as the Prelinger archive, YouTube and Google videos. The dataset contains 6849 clips divided into 51 action categories, each containing a minimum of 101 clips.
  • Image-Net
    ImageNet is an image database organized according to the WordNet hierarchy (currently only the nouns), in which each node of the hierarchy is depicted by hundreds and thousands of images. The project has been instrumental in advancing computer vision and deep learning research. The data is available for free to researchers for non-commercial use.
  • Indoor Scene Recognition
    Indoor scene recognition is a challenging open problem in high level vision. Most scene recognition models that work well for outdoor scenes perform poorly in the indoor domain. The main difficulty is that while some indoor scenes (e.g. corridors) can be well characterized by global spatial properties, others (e.g., bookstores) are better characterized by the objects they contain.
    more...less...
    More generally, to address the indoor scenes recognition problem we need a model that can exploit local and global discriminative information.

Language

  • COCA: Corpus of Contemporary American English
    The corpus contains more than one billion words of text (25+ million words each year 1990-2019) from eight genres: spoken, fiction, popular magazines, newspapers, academic texts, and (with the update in March 2020): TV and Movies subtitles, blogs, and other web pages. Can be downloaded.
  • The Movie Corpus
    The Movies Corpus contains 200 million words of data in more than 25,000 movies from the 1930s to the current time. The Movie Corpus serves as a great resource to look at very informal language -- at least as well as with corpora of actual spoken English. In addition, the Movies Corpus is much larger than any other corpus of informal English
  • Word-Net
    WordNet® is a large lexical database of English. Nouns, verbs, adjectives and adverbs are grouped into sets of cognitive synonyms (synsets), each expressing a distinct concept. Synsets are interlinked by means of conceptual-semantic and lexical relations. The resulting network of meaningfully related words and concepts can be navigated with the browser. WordNet is also freely and publicly available for download. WordNet's structure makes it a useful tool for computational linguistics and natural language processing.

Music

  • FMA: A Dataset For Music Analysis
    A data dump of the Free Music Archive (FMA), an interactive library of high-quality, legal audio downloads.

Politics

  • FiveThirtyEight Data
    Data and code behind some of its articles and graphics.
    more...less...
    https://github.com/fivethirtyeight/data

Social Sciences

  • ICPSR
    Maintains a data archive of more than 250,000 files of research in the social and behavioral sciences. It hosts 21 specialized collections of data in education, aging, criminal justice, substance abuse, terrorism, and other fields.

Transportation

  • Berkeley Deep Drive
    Explore 100,000 HD video sequences of over 1,100-hour driving experience across many different times in the day, weather conditions, and driving scenarios. Video sequences also include GPS locations, IMU data, and timestamps.
    more...less...
    https://arxiv.org/abs/1805.04687
  • Public transport networks for research
    Browse, visualize, & download curated public transport network data for 20+ cities.
    Data formats: GTFS, network edge lists, event lists, GeoJson, SQLite databases.
  • << Previous: Computational Data Sources - Dartmouth Licensed
  • Next: News >>
  • Last Updated: May 12, 2025 9:10 AM
  • URL: https://researchguides.dartmouth.edu/cs
  • Print Page
Login to LibApps
Report a problem
Subjects: Computer Science
Tags: COMP, computer science, computing, cosc, feldberg, physical sciences

Dartmouth Libraries

  • Baker-Berry Library
    • Book Arts Workshop
    • Evans Map Room
    • Jones Media Center
  • Health Sciences and Biomedical Libraries
  • Feldberg Business & Engineering Library
  • Rauner Special Collections Library
  • Records Management
  • Sherman Art Library

About Us

  • Staff Directory
  • Subject Librarians
  • Library Departments
  • Policies
  • Employment
  • Accessibility
  • Federal Depository Library

Contact Us

  • 25 North Main Street
    Hanover, NH, USA 03755
  • Phone: 603-646-2567
  • Contact Us

Give Us Feedback

Dartmouth Libraries

Footer copyright

  • Dartmouth College
  • Copyright © 2025 Trustees of Dartmouth College
  • Facebook
  • Instagram
  • YouTube
Privacy Policy