RESOURCES
This page lists some SPARQL endpoints and large sources of data.
"Live" SPARQL Sources
These sources make their data available over SPARQL, possibly as well as a complete download.
U.S. Census: 1 billion triples of U.S. census data.
GovTrak: Around 10 million triples containing census data for U.S. locations (including lat/long), brief biographical data for all members of congress, and mainly data for federal legislation and voting records going back five years.
DBLP Bibliography Database: 15 million triples on computer science bibliographic data.
DBPedia: Describing 1.6 million Wikipedia articles.
my.opera.com: Almost 2.7 million triples of my.opera.com data.
RDF/XML and N3 Sources
These sources provide raw downloads of large sets of RDF/XML or N3 or otherwise expose a large amount of RDF data.
DAML: Geographic locations, WordNet dictionary, airports, companies, military information, CIA World Fact Book, etc.
OpenCyc: (250000 triples).
Uniprot RDF: Protein sequence and annotation data. An enormous amount of data.
French geography: Departments, Cantons, Communes (590000 triples).
RDF Book Mashup: Wraps several book-related APIs.
