Paper by Jiarui Liu, Wenkai Li, Zhijing Jin, Mona Diab: “In an era of model and data proliferation in machine learning/AI especially marked by the rapid advancement of open-sourced technologies, there arises a critical need for standardized consistent documentation. Our work addresses the information incompleteness in current human-generated model and data cards. We propose an automated generation approach using Large Language Models (LLMs). Our key contributions include the establishment of CardBench, a comprehensive dataset aggregated from over 4.8k model cards and 1.4k data cards, coupled with the development of the CardGen pipeline comprising a two-step retrieval process. Our approach exhibits enhanced completeness, objectivity, and faithfulness in generated model and data cards, a significant step in responsible AI documentation practices ensuring better accountability and traceability…(More)”.
Automatic Generation of Model and Data Cards: A Step Towards Responsible AI
How to contribute:
Did you come across – or create – a compelling project/report/book/app at the leading edge of innovation in governance?
Share it with us at info@thelivinglib.org so that we can add it to the Collection!
About the Curator
Get the latest news right in you inbox
Subscribe to curated findings and actionable knowledge from The Living Library, delivered to your inbox every Friday
Related articles
artificial intelligence, DATA
The geography of AI compute: Mapping what is available and where
Posted in November 4, 2025 by Stefaan Verhulst
artificial intelligence, DATA
Experts find flaws in hundreds of tests that check AI safety and effectiveness
Posted in November 3, 2025 by Stefaan Verhulst
artificial intelligence, DATA
A home genome project: How a city learning cohort can create AI systems for optimizing housing supply
Posted in November 3, 2025 by Stefaan Verhulst