WARC-GPT: An Open-Source Tool for Exploring Web Archives Using AI | Library Innovation Lab

Open link in next tab

WARC-GPT: An Open-Source Tool for Exploring Web Archives Using AI | Library Innovation Lab

https://lil.law.harvard.edu/blog/2024/02/12/warc-gpt-an-open-source-tool-for-exploring-web-archives-with-ai/

Today we’re releasing WARC-GPT: an open-source, highly-customizable Retrieval Augmented Generation tool the web archiving community can use to explore the in...

WARC-GPT: An Open-Source Tool for Exploring Web Archives Using AI | Library Innovation Lab

Using WARC-GPT, you can ask specific questions in natural language against a collection of WARC files. Rather than relying on keyword searches and metadata filters to sort through search results, WARC-GPT provides a new starting point for search using multi-document full-text search with summarization to explore the contents of web archives. WARC-GPT lists the sources used to generate the response and relevant text excerpts, which you can use to verify the information provided and identify points of interest within a collection of web archives.