Based on a collaborative effort between the research project BodyCapital and the Bibliothèque Nationale de France (BnF), we present a two-step archiving process and analysis of audiovisual web content related to food and health history, investigating how audiovisuals have contributed to shaping our eating habits. The first step involved a web crawl with Heritrix, targeting 158 identified seed URLs compiled based on BnF science & technology lists and URLs identified by the research group. The crawl harvested 1,067,159 URLs. A content analysis identified 1,718 videos in our corpus. Content mapping and the identification of links to YouTube videos were performed, leading to the second step involving a focused collection of 34 YouTube channels harvesting 24,427 videos (2.4 TB) to be analyzed.
University of Strasbourg, France - ORCID: 0000-0003-4756-1844
University of Paris, France - ORCID: 0000-0002-9384-7360
University of Strasbourg, France - ORCID: 0009-0007-1730-2135
Chapter Title
Food, cooking and health in a selected corpus of websites and connected YouTube channels in France. Collecting and archiving the audiovisual web
Authors
Christian Bonah, Solène Lellinger, Caroline Sala
Language
English
DOI
10.36253/979-12-215-0413-2.24
Peer Reviewed
Publication Year
2024
Copyright Information
© 2024 Author(s)
Content License
Metadata License
Book Title
Exploring the Archived Web during a Highly Transformative Age
Book Subtitle
Proceedings of the 5th international RESAW conference, Marseille, June 2023
Editors
Sophie Gebeil, Jean-Christophe Peyssard
Peer Reviewed
Number of Pages
362
Publication Year
2024
Copyright Information
© 2024 Author(s)
Content License
Metadata License
Publisher Name
Firenze University Press
DOI
10.36253/979-12-215-0413-2
ISBN Print
979-12-215-0412-5
eISBN (pdf)
979-12-215-0413-2
eISBN (xml)
979-12-215-0414-9
Series Title
Proceedings e report
Series ISSN
2704-601X
Series E-ISSN
2704-5846