Community-Aligned A.I. Benchmarks

White Paper by the Aspen Institute: “…When people develop machine learning models for AI products and services, they iterate to improve performance.

What it means to “improve” a machine learning model depends on what you want the model to do, like correctly transcribe an audio sample or generate a reliable summary of a long document.

Machine learning benchmarks are similar to standardized tests that AI researchers and builders can score their work against. Benchmarks allow us to both see if different model tweaks improve the performance for the intended task and compare similar models against one another.

Some famous benchmarks in AI include ImageNet and the Stanford Question Answering Dataset (SQuAD).

Benchmarks are important, but their development and adoption has historically been somewhat arbitrary. The capabilities that benchmarks measure should reflect the priorities for what the public wants AI tools to be and do.

We can build positive AI futures, ones that emphasize what the public wants out of these emerging technologies. As such, it’s imperative that we build benchmarks worth striving for…(More)”.

Share

How to contribute:

Did you come across – or create – a compelling project/report/book/app at the leading edge of innovation in governance?

Share it with us at info@thelivinglib.org so that we can add it to the Collection!

About the Curator

Stefaan Verhulst

Get the latest news right in you inbox

Subscribe to curated findings and actionable knowledge from The Living Library, delivered to your inbox every Friday

Explore our articles

Community-Aligned A.I. Benchmarks

Share

How to contribute:

About the Curator

Stefaan Verhulst

Get the latest news right in you inbox

Related articles

Creating meaningful participation and building trust: The journey of Brasil Participativo

The Psychology of System Change and Resistance to Change

Shareholder Democracy with AI Representatives