Navigation and service

The German National Library's web archive contains archival copies of websites on selected subjects, institutions or events. Our reading rooms also provide access to websites in the .DE domain which the Internet Archive archives, filters and makes available as a separate collection. The German National Library also contributes to the cooperative website collections of the International Internet Preservation Consortium (IIPC).

German National Library's web archive

The German National Library’s legal collection mandate includes the collection, indexing and archiving of websites. Using an automated process known as web harvesting, we create snapshots of the websites, index them in our catalogue and archive them in our web archive.

We collect websites according to specific formal and content-related criteria. In our web archive, you will find the websites of federal authorities and universities, blogs, topics such as history, literature and music, and websites for events such as the federal elections or the 500th anniversary of the Reformation in 2017. We create joint collections in collaboration with libraries obligated to maintain web archives at the regional level. One example of this is the Thuringia Web Archive.

Our web archive is structured by subject category and has a full-text search function. You can also access the content of our web archive through the catalogue.

On copyright grounds, it is usually only possible to access the collected websites in our reading rooms in Leipzig and Frankfurt am Main. However, certain web archive content for which we have the right holder’s consent can also be used outside the reading rooms.

Archiving German-language Twitter

On 20 February 2023, an initiative launched by the Science Data Center for Literature and the German National Library issued a call for a concerted effort to download as many German-language Tweets as possible from the Twitter archive. The goal was to create as complete an archive of German-language Tweets as possible using a crowdsourcing initiative. The German National Library has made archive servers available to facilitate permanent storage. More

"Sustainable archiving of social media data - Twitter and beyond"

Conference at the German National Library Frankfurt am Main on 19 and 20 March 2024

Archiving, cataloguing and providing dynamic data from social media present challenges which affect researchers, research institutions, libraries and archives in equal measure, and the best way to solve these problems is through collaboration and partnership. This requires wide-ranging efforts which would be impossible for a single data community or discipline. A conference at the German National Library Frankfurt am Main on 19 and 20 March 2024 will explore these questions. More

Call for Participation: Twitter Datasprint

Do you have research questions for which you are keen to analyse large volumes of German-language tweets? Are Twitter data of interest for your research in the humanities or social, natural or life sciences? Or do you have a passion for visualising social media data?

Then come to our two-day data sprint on 21 and 22 March 2024.

More

Frequently asked questions (FAQ)

How does the German National Library collect websites?

Selected websites are collected using an automated procedure (web harvesting). The harvester starts from a certain web address and stores the web page at this address along with all the linked content in the domain. This procedure creates an archival copy of the website which we refer to as snapshot.

How often is a website harvested?

As the content of websites is constantly changing, the archival copies have to be updated at regular intervals. We decide when and how often we harvest a website on an individual basis. At present, our standard procedure is to harvest the websites every six months.

Do I have to report my own website to the German National Library?

At present, we select websites according to specific formal and content-related criteria. This means you do not have to report your website to us.

Are Facebook, X etc. also harvested?

In individual cases, we also collect selected social media pages on certain topics or events. However, we do not systematically store the content of social networks at present.

Internet Archive’s web archive

In our reading rooms in Leipzig and Frankfurt am Main, we also offer a special service allowing you to access the Internet Archive operated by the organisation of that name in San Francisco. The snapshots of websites in the .DE domain which are stored in the Internet Archive form a separate collection that can be accessed here and searched using search terms or URLs.

Web archive of international thematic and event-based collections

The German National Library contributes to the cooperative collections of the International Internet Preservation Consortium (IIPC). The IIPC’s member organisations collate websites on globally relevant topics such as climate change, the refugee crisis and the coronavirus pandemic, and on events such as European elections and the Olympic Games. The result is an international perspective on the latest events and the way in which they are depicted on the internet. These thematic collections are freely accessible from anywhere in the world.

Contact

Online publication service

np-info@dnb.de

Phone +49 341 2271-282

Last changes: 20.12.2023
Short-URL: https://www.dnb.de/webarchiv

to the top