Skip to main content

The Dataset Collection

The Dataset Collection consists of large data archives from both sites and individuals.



rss RSS

9,231
RESULTS


Show sorted alphabetically

Show sorted alphabetically

SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
YFCC Datasets
YFCC Datasets
collection
0
ITEMS
17
VIEWS
collection

eye 17

Part of an August 2021 download of roughly 40 % of the Flickr images referenced in the YFCC100M dataset.
Unsorted Datasets
Unsorted Datasets
collection
125
ITEMS
11,939
VIEWS
collection

eye 11,939

Unsorted Datasets
Screenshot Compilations
Screenshot Compilations
collection
0
ITEMS
61
VIEWS
collection

eye 61

Compilations of screenshots generated automatically or semi-automatically.
OpenStreetMap datasets
OpenStreetMap datasets
collection
4,636
ITEMS
26,872
VIEWS
by OpenStreetMap contributors
collection

eye 26,872

OpenStreetMap (OSM) is a collaborative project to create a free editable map of the world. What is available? Planet.osm in XML format (current and full history), dumped weekly Planet.osm in the custom Protocolbuffer Binary Format (PBF) (current and full history), dumped weekly Metadata of all changes (changesets) in XML format, dumped weekly All discussions in XML format, dumped weekly User contributed notes, dumped daily How do I search this collection? The items in this collection are...
Topics: openstreetmap, osm, maps, data, mapping, map, dumps
NIH Data Commons
NIH Data Commons
collection
10
ITEMS
1,399
VIEWS
collection

eye 1,399

The Data Commons Pilot Phase Consortium (DCPPC) is an NIH project to tackle the challenges of data-driven and data-intensive biomedical research: The data sets are too large to download There's minimal interoperability between and across data set providers Local compute capacity often is too limited to meet dynamic research needs These challenges are preventing biomedical data from reaching its full potential in basic research, clinical, and translational medicine. DCPPC aims to improve this...
MusicBrainz Data Dumps
MusicBrainz Data Dumps
collection
851
ITEMS
7,475
VIEWS
collection

eye 7,475

The MusicBrainz Database is built on the PostgreSQL relational database engine and contains all of MusicBrainz' music metadata. This data includes information about artists, release groups, releases, recordings, works, and labels, as well as the many relationships between them. The database also contains a full history of all the changes that the MusicBrainz community has made to the data. Core data Artists Name, sort name, IPI, aliases, type, begin and end dates, disambiguation comment, MBID...
Internet Census 2012
Internet Census 2012
collection
15
ITEMS
2,829
VIEWS
by Anonymous
collection

eye 2,829

Abstract While playing around with the Nmap Scripting Engine (NSE) we discovered an amazing number of open embedded devices on the Internet. Many of them are based on Linux and allow login to standard BusyBox with empty or default credentials. We used these devices to build a distributed port scanner to scan all IPv4 addresses. These scans include service probes for the most common ports, ICMP ping, reverse DNS and SYN scans. We analyzed some of the data to get an estimation of the IP address...
Imageboard Datasets
Imageboard Datasets
collection
0
ITEMS
94
VIEWS
collection

eye 94

A collection of datasets arranged around imageboards.
Harvard Dataverse
Harvard Dataverse
collection
1
ITEMS
149
VIEWS
collection

eye 149

Dumps of DISCOGS.ORG Metadata (2008-Present)
Dumps of DISCOGS.ORG Metadata (2008-Present)
collection
145
ITEMS
5,093
VIEWS
by DISCOGS.ORG
collection

eye 5,093

This is an unofficial mirror of the DISCOGS.ORG data collection, which is located at http://www.discogs.com/data/ . Discogs, short for discographies, is a website and database of information about audio recordings, including commercial releases, promotional releases, and bootleg or off-label releases. The Discogs servers, currently hosted under the domain name discogs.com, are owned by Zink Media, Inc., and are located in Portland, Oregon, USA. Discogs is one of the largest online databases of...
C.elegans behavioural database
C.elegans behavioural database
collection
0
ITEMS
0
VIEWS
collection

eye 0

This experiment is part of the C.elegans behavioural database. For more information and the complete collection of experiments visit http://movement.openworm.org
Academic Torrents
Academic Torrents
collection
2,264
ITEMS
601,877
VIEWS
by ACADEMICTORRENTS.COM
collection

eye 601,877

Welcome to Academic Torrents! Making 14.15TB of research data available. We've designed a distributed system for sharing enormous datasets - for researchers, by researchers. The result is a scalable, secure, and fault-tolerant repository for data, with blazing fast download speeds.
Academic Data and Datasets
Academic Data and Datasets
collection
0
ITEMS
234
VIEWS
collection

eye 234

A collection of datasets and data related to academic issues.