Skip to main content
SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
perma_cc
perma_cc
collection
3,129,222
ITEMS
1.6B
VIEWS
collection

eye 1.6B

alexa_2007
alexa_2007
collection
7,636
ITEMS
1.7B
VIEWS
collection

eye 1.7B

this data is currently not publicly accessible.
alexa_2006
alexa_2006
collection
6,507
ITEMS
1B
VIEWS
collection

eye 1B

this data is currently not publicly accessible.
alexa_web_2009
alexa_web_2009
collection
3,080
ITEMS
676.9M
VIEWS
collection

eye 676.9M

this data is currently not publicly accessible.
alexa_web_2010
alexa_web_2010
collection
2,994
ITEMS
630.8M
VIEWS
collection

eye 630.8M

this data is currently not publicly accessible.
38_crawl
38_crawl
collection
1,387
ITEMS
408.6M
VIEWS
collection

eye 408.6M

this data is currently not publicly accessible.
Around The World Crawl
Around The World Crawl
collection
2,150
ITEMS
443.5M
VIEWS
collection

eye 443.5M

Data crawled by Sloan Foundation on behalf of Internet Archive
51_crawl
51_crawl
collection
1,138
ITEMS
428.2M
VIEWS
collection

eye 428.2M

this data is currently not publicly accessible.
52_crawl
52_crawl
collection
2,589
ITEMS
377.3M
VIEWS
collection

eye 377.3M

this data is currently not publicly accessible.
26_crawl
26_crawl
collection
1,466
ITEMS
266.5M
VIEWS
collection

eye 266.5M

this data is currently not publicly accessible.
35_crawl
35_crawl
collection
1,179
ITEMS
211.1M
VIEWS
collection

eye 211.1M

this data is currently not publicly accessible.
29_crawl
29_crawl
collection
1,568
ITEMS
207.9M
VIEWS
collection

eye 207.9M

this data is currently not publicly accessible.
Alexa Crawls EA
Alexa Crawls EA
collection
1,315
ITEMS
178M
VIEWS
collection

eye 178M

Crawl data donated by Alexa Internet. This data is currently not publicly accessible
Topic: crawldata
Alexa Crawls DY
Alexa Crawls DY
collection
1,326
ITEMS
176.7M
VIEWS
collection

eye 176.7M

Crawl data donated by Alexa Internet. This data is currently not publicly accessible
alexa_ed
alexa_ed
collection
1,185
ITEMS
178.3M
VIEWS
collection

eye 178.3M

this data is currently not publicly accessible.
Alexa Crawls DU
Alexa Crawls DU
collection
946
ITEMS
143.4M
VIEWS
collection

eye 143.4M

Crawl data donated by Alexa Internet. This data is currently not publicly accessible
32_crawl
32_crawl
collection
1,045
ITEMS
136.2M
VIEWS
collection

eye 136.2M

this data is currently not publicly accessible.
alexa_1999
alexa_1999
collection
243
ITEMS
157.1M
VIEWS
collection

eye 157.1M

this data is currently not publicly accessible.
Alexa Crawls DQ
Alexa Crawls DQ
collection
887
ITEMS
128.1M
VIEWS
collection

eye 128.1M

Crawl data donated by Alexa Internet. This data is currently not publicly accessible
Alexa Crawls DO
Alexa Crawls DO
collection
493
ITEMS
137.9M
VIEWS
collection

eye 137.9M

Crawl data donated by Alexa Internet. This data is currently not publicly accessible
alexa_dt
alexa_dt
collection
787
ITEMS
122.5M
VIEWS
collection

eye 122.5M

this data is currently not publicly accessible.
Alexa Crawls DS
Alexa Crawls DS
collection
919
ITEMS
122.3M
VIEWS
collection

eye 122.3M

Crawl data donated by Alexa Internet. This data is currently not publicly accessible
Alexa Crawls DR
Alexa Crawls DR
collection
759
ITEMS
126.3M
VIEWS
collection

eye 126.3M

Crawl data donated by Alexa Internet. This data is currently not publicly accessible
alexa_dw
alexa_dw
collection
958
ITEMS
129.8M
VIEWS
collection

eye 129.8M

this data is currently not publicly accessible.
alexa_dm
alexa_dm
collection
371
ITEMS
110.8M
VIEWS
collection

eye 110.8M

this data is currently not publicly accessible.
42_crawl
42_crawl
collection
384
ITEMS
132.1M
VIEWS
collection

eye 132.1M

this data is currently not publicly accessible.
web_el_2008
collection
1,705
ITEMS
142.9M
VIEWS
collection

eye 142.9M

This data is currently not publicly accessible.
alexa_dv
alexa_dv
collection
867
ITEMS
109.4M
VIEWS
collection

eye 109.4M

this data is currently not publicly accessible.
alexa_dn
alexa_dn
collection
421
ITEMS
107.1M
VIEWS
collection

eye 107.1M

this data is currently not publicly accessible.
Green Crawl
Green Crawl
collection
148
ITEMS
123M
VIEWS
collection

eye 123M

Crawl data donated by Alexa Internet. This data is currently not publicly accessible.
National Library of Australia Crawl
collection
4,658
ITEMS
122.6M
VIEWS
collection

eye 122.6M

National Library of Austrailia crawl. This data is currently not publicly accessible.
web_el_2010
collection
1,022
ITEMS
101.3M
VIEWS
collection

eye 101.3M

This data is currently not publicly accessible.
Alexa EC
Alexa EC
collection
527
ITEMS
99.2M
VIEWS
collection

eye 99.2M

Crawl data donated by Alexa Internet. This data is currently not publicly accessible
36_crawl
36_crawl
collection
761
ITEMS
99.4M
VIEWS
collection

eye 99.4M

this data is currently not publicly accessible.
44_crawl
44_crawl
collection
471
ITEMS
92.8M
VIEWS
collection

eye 92.8M

this data is currently not publicly accessible.
Alexa Crawls DG
Alexa Crawls DG
collection
228
ITEMS
95.2M
VIEWS
collection

eye 95.2M

Crawl data donated by Alexa Internet. This data is currently not publicly accessible
Alexa Web 2008
Alexa Web 2008
collection
524
ITEMS
92M
VIEWS
collection

eye 92M

Crawl data donated by Alexa Internet. This data is currently not publicly accessible
bnf_2008
collection
715
ITEMS
98.8M
VIEWS
collection

eye 98.8M

this data is currently not publicly accessible.
39_crawl
39_crawl
collection
381
ITEMS
88M
VIEWS
collection

eye 88M

this data is currently not publicly accessible.
41_crawl
41_crawl
collection
414
ITEMS
73.8M
VIEWS
collection

eye 73.8M

this data is currently not publicly accessible.
nls_2009
collection
874
ITEMS
70.3M
VIEWS
collection

eye 70.3M

this data is currently not publicly accessible.
nls_2010
collection
972
ITEMS
64.2M
VIEWS
collection

eye 64.2M

this data is currently not publicly accessible.
alexa_dk
alexa_dk
collection
202
ITEMS
57.8M
VIEWS
collection

eye 57.8M

this data is currently not publicly accessible.
Alexa Crawls DE
Alexa Crawls DE
collection
138
ITEMS
51.2M
VIEWS
collection

eye 51.2M

Crawl data donated by Alexa Internet. This data is currently not publicly accessible
50_crawl
50_crawl
collection
410
ITEMS
50.1M
VIEWS
collection

eye 50.1M

this data is currently not publicly accessible.
nsdlweb
collection
91
ITEMS
53.4M
VIEWS
collection

eye 53.4M

this data is currently not publicly accessible.
Away from Keyboard: Aaron H. Swartz
collection
340
ITEMS
53.6M
VIEWS
collection

eye 53.6M

Aaron H. Swartz (November 8, 1986 – January 11, 2013) was an American computer programmer, writer, archivist, political organizer, and Internet activist. Swartz co-authored the "RSS 1.0" specification of RSS, and built the Web site framework web.py and the architecture for the Open Library. Swartz also focused on sociology, civic awareness and activism. In 2010 he was a member of the Harvard University Center for Ethics. He cofounded the online group Demand Progress (which recently...
JSTOR Early Journal Content
JSTOR Early Journal Content
collection
452,031
ITEMS
61.6M
VIEWS
collection

eye 61.6M

The JSTOR Early Journal Content is a selection of journal materials published prior to 1923 in the United States and prior to 1870 elsewhere. It includes discourse and scholarship in the arts and humanities, economics and politics, and in mathematics and other sciences - nearly 500,000 articles from more than 200 journals. It was uploaded to the Internet Archive in 2013. JSTOR Early Journal Content has been freely available at www.jstor.org since September 2011. Early Journal Content is updated...
Alexa Crawls DD
Alexa Crawls DD
collection
82
ITEMS
38M
VIEWS
collection

eye 38M

Crawl data donated by Alexa Internet. This data is currently not publicly accessible
43_crawl
43_crawl
collection
205
ITEMS
40.1M
VIEWS
collection

eye 40.1M

this data is currently not publicly accessible.
Alexa Crawls 2000
Alexa Crawls 2000
collection
62
ITEMS
39.2M
VIEWS
collection

eye 39.2M

Crawl data donated by Alexa Internet. This data is currently not publicly accessible
33_crawl
33_crawl
collection
308
ITEMS
42.5M
VIEWS
collection

eye 42.5M

this data is currently not publicly accessible.
nla_2008
collection
631
ITEMS
40.3M
VIEWS
collection

eye 40.3M

this data is currently not publicly accessible.
Alexa Sarah Crawl
Alexa Sarah Crawl
collection
93
ITEMS
34.1M
VIEWS
collection

eye 34.1M

Crawl data donated by Alexa Internet. This data is currently not publicly accessible
bnf_2007
collection
321
ITEMS
41M
VIEWS
collection

eye 41M

this data is currently not publicly accessible.
collection

eye 38.9M

Topics: bne, spain, web, 2013
nla_2009
collection
568
ITEMS
34.4M
VIEWS
collection

eye 34.4M

this data is currently not publicly accessible.
nl_sweden_2010
collection
309
ITEMS
35.3M
VIEWS
collection

eye 35.3M

this data is currently not publicly accessible.
bnf_2005
collection
265
ITEMS
30.5M
VIEWS
collection

eye 30.5M

this data is currently not publicly accessible.
nla_2007
collection
371
ITEMS
30.4M
VIEWS
collection

eye 30.4M

this data is currently not publicly accessible.
nla_2006
collection
384
ITEMS
29.7M
VIEWS
collection

eye 29.7M

this data is currently not publicly accessible.
27_crawl
27_crawl
collection
114
ITEMS
26.8M
VIEWS
collection

eye 26.8M

this data is currently not publicly accessible.
bnf_2006
collection
323
ITEMS
26.8M
VIEWS
collection

eye 26.8M

this data is currently not publicly accessible.
nla_2005
collection
175
ITEMS
24.9M
VIEWS
collection

eye 24.9M

this data is currently not publicly accessible.
web_clo
collection
109
ITEMS
22.2M
VIEWS
collection

eye 22.2M

Crawl performed by Internet Archive. This data is currently not publicly accessible.
web_el_2006
collection
345
ITEMS
21.7M
VIEWS
collection

eye 21.7M

This data is currently not publicly accessible.
Bowling Green State University 78rpm Collection
Bowling Green State University 78rpm Collection
collection
56,096
ITEMS
1.6M
VIEWS
collection

eye 1.6M

This collection of 78rpm records was generously donated to the Internet Archive by Bowling Green State University, to gain digital access to their great collection of records. The collection contains many unique recordings including jazz, children's and folk music. Link to donation item: https://archive.org/details/Bowling_Green_COL_1147
The Molly Astrid Organization
The Molly Astrid Organization
collection
336
ITEMS
11.7M
VIEWS
collection

eye 11.7M

Archive-It Partner 88: The Molly Astrid Organization. This data is currently not publicly accessible.
Usenet Archive
Usenet Archive
collection
79,450
ITEMS
2.1M
VIEWS
collection

eye 2.1M

Usenet is a worldwide distributed Internet discussion system. It was developed from the general purpose UUCP dial-up network architecture. Duke University graduate students Tom Truscott and Jim Ellis conceived the idea in 1979 and it was established in 1980. Users read and post messages (called articles or posts, and collectively termed news) to one or more categories, known as newsgroups. Usenet resembles a bulletin board system (BBS) in many respects, and is the precursor to Internet forums...
collection

eye 4.4M

The JSTOR Early Journal Content is a selection of journal materials published prior to 1923 in the United States and prior to 1870 elsewhere. It includes discourse and scholarship in the arts and humanities, economics and politics, and in mathematics and other sciences - nearly 500,000 articles from more than 200 journals. It was uploaded to the Internet Archive in 2013. JSTOR Early Journal Content has been freely available at www.jstor.org since September 2011. Early Journal Content is updated...
web_wkl
collection
203
ITEMS
13.9M
VIEWS
collection

eye 13.9M

Crawl performed by Internet Archive. This data is currently not publicly accessible.
Boston Public Library 78rpm Collection
Boston Public Library 78rpm Collection
collection
48,844
ITEMS
11.6M
VIEWS
collection

eye 11.6M

The Boston Public Library (BPL) sound collection includes hundreds of thousands of audio recordings in a variety of historical formats, including wax cylinders, 78 rpms, and LPs. The recordings span many genres, including classical, pop, rock, jazz, and opera – from 78s produced in the early 1900s to LPs from the 1980s. These recordings have never been circulated and were in storage for several decades, uncataloged and inaccessible to the public. By collaborating with the Internet Archive,...
30_crawl
30_crawl
collection
48
ITEMS
12.5M
VIEWS
collection

eye 12.5M

this data is currently not publicly accessible.
Data crawled by Institut national de laudiovisuel on behalf of Institut national de l’audiovisuel from Thu Aug 12 00:00:00 PDT 2010 to Thu Aug 12 00:00:00 PDT 2010
Topic: crawldata
nlnz_2010
collection
167
ITEMS
10.9M
VIEWS
collection

eye 10.9M

this data is currently not publicly accessible.
collection

eye 12M

Crawl of the Ireland web domain, .ie, performed for the National Library of Ireland in 2007. This data is currently not publicly accessible.
Data crawled by Institut national de laudiovisuel on behalf of Institut national de l’audiovisuel from Thu Aug 12 00:00:00 PDT 2010 to Thu Aug 12 00:00:00 PDT 2010
favorite ( 1 reviews )
Topic: crawldata
David Rumsey Map Collection
David Rumsey Map Collection
collection
111,792
ITEMS
1.3M
VIEWS
collection

eye 1.3M

The David Rumsey Map Collection was founded in 1985 and went online in 1999 at davidrumsey.com . The collection includes over 200,000 rare 16th through 21st century maps of the entire World and parts of the Universe. 115,000 maps and related items are available online and 60,000 of those have been georeferenced. Rumsey has been donating the physical map collection and the digital database made from it to the David Rumsey Map Center at Stanford Libraries since 2008. The map center opened in...
Data crawled by National Endowment for the Humanities and JISC on behalf of Internet Archive from Fri Aug 08 00:17:40 PDT 2008 to Thu Jun 26 05:29:33 PDT 2008
Topic: crawldata
StationList-EoY07
StationList-EoY07
collection
350
ITEMS
8.5M
VIEWS
by Jeff ubois
collection

eye 8.5M

Test crawl of station list
Leif Druedahl Collection
Leif Druedahl Collection
collection
48,714
ITEMS
4M
VIEWS
collection

eye 4M

Donation of Leif Druedahl's 78'er Klubben 78's. Collected primarily in Denmark and throughout Scandinavia. 102 boxes. Includes 78's, photographic negatives, and 5 boxes of letterpress printed cardboard record sleeves. Danish, German, and English popular and classical music. Link to donation item:  https://archive.org/details/leifdruedahl2018
Data crawled by Institut national de laudiovisuel on behalf of Institut national de l’audiovisuel from Thu Aug 12 00:00:00 PDT 2010 to Thu Aug 12 00:00:00 PDT 2010
Topic: crawldata
Data crawled by Institut national de laudiovisuel on behalf of Institut national de l’audiovisuel from Thu Aug 12 00:00:00 PDT 2010 to Thu Aug 12 00:00:00 PDT 2010
Topic: crawldata
Data crawled by Institut national de laudiovisuel on behalf of Institut national de l’audiovisuel from Thu Aug 12 00:00:00 PDT 2010 to Thu Aug 12 00:00:00 PDT 2010
Topic: crawldata
Wiretapping and the National Security Agency
Wiretapping and the National Security Agency
collection
2,061
ITEMS
4.9M
VIEWS
collection

eye 4.9M

A collection of documents and web sites relating to wiretapping, encryption, and the National Security Agency
Topic: Wiretapping
Data crawled by Institut national de laudiovisuel on behalf of Institut national de l’audiovisuel from Thu Aug 12 00:00:00 PDT 2010 to Thu Aug 12 00:00:00 PDT 2010
Topic: crawldata
Data crawled by Internet Archive on behalf of Internet Archive from Sun Aug 29 05:03:17 PDT 2004 to Sat Mar 05 01:09:59 PDT 2005
Topic: crawldata
Newspaper Archive
Newspaper Archive
collection
264
ITEMS
79,241
VIEWS
collection

eye 79,241

Data crawled by Internet Archive on behalf of Internet Archive from Wed Nov 07 07:11:52 PDT 2007 to Mon Jan 22 21:11:39 PDT 2007
Topic: crawldata
Data crawled by Internet Archive on behalf of Internet Archive from Tue May 01 06:35:30 PDT 2007 to Sun Oct 28 07:03:43 PDT 2007
Topic: crawldata
Data crawled by Internet Archive on behalf of Internet Archive from Fri Nov 01 06:23:33 PDT 2002 to Tue Nov 19 23:24:07 PDT 2002
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topic: crawldata
Data crawled by Institut national de laudiovisuel on behalf of Institut national de l’audiovisuel from Thu Aug 12 00:00:00 PDT 2010 to Thu Aug 12 00:00:00 PDT 2010
Topic: crawldata
Data crawled by Alexa Internet on behalf of Alexa Internet from Fri Apr 01 01:18:02 PDT 2005 to Sat Mar 05 10:41:36 PDT 2005
Topic: crawldata
Circleville Herald Newspaper Archive
Circleville Herald Newspaper Archive
collection
19
ITEMS
25,346
VIEWS
collection

eye 25,346

Data crawled by Institut national de laudiovisuel on behalf of Institut national de l’audiovisuel from Thu Aug 12 00:00:00 PDT 2010 to Thu Aug 12 00:00:00 PDT 2010
Topic: crawldata
38_crawl
web

eye 1.3M

favorite 1

comment 0

Data crawled by on behalf of from Mon Feb 19 09:06:59 PDT 2007 to Mon Feb 19 10:11:22 PDT 2007
Topic: crawldata
38_crawl
web

eye 1.4M

favorite 0

comment 0

Data crawled by on behalf of from Mon Feb 19 10:11:42 PDT 2007 to Mon Feb 19 11:14:00 PDT 2007
Topic: crawldata
Data crawled by Internet Archive on behalf of Internet Archive from Sat Sep 18 12:46:38 PDT 2004 to Thu May 05 09:34:36 PDT 2005
Topic: crawldata
Daniel McNeil Collection
Daniel McNeil Collection
collection
16,690
ITEMS
6M
VIEWS
collection

eye 6M

78rpm shellac discs donated from Daniel McNeil to the Archive of Contemporary Music and digitized by George Blood, LP for the Internet Archive. The collection of 22,359 ten and twelve inch seventy-eights is one of the first that ARC worked with from beginning to end, and what a pleasure. Rare for us the discs were all carefully arranged on shelves by label, and then by manufacturer number. Even better, the weeks we spent packing up meant cake and coffee everyday at 4. Here’s Mr. McNeil’s...
Source: 78
Data crawled by Alexa Internet on behalf of Alexa Internet from Sat Jun 02 21:47:31 PDT 2001 to Sun Jun 03 08:08:36 PDT 2001
Topic: crawldata