Data Statements for Natural Language Processing: Toward Mitigating System Bias and Enabling Better Science

Emily M. Bender; Batya Friedman

doi:10.1162/tacl_a_00041

What is it about?

Characterize the populations represented by datasets used for training and testing Natural Language Processing systems in order to mitigate bias when those systems are deployed.

Why is it important?

With the widespread use of AI and machine learning systems that embed natural language processing, it is essential to know the characteristics of the training datasets if we are to mitigate bias when those systems are deployed.

This page is a summary of: Data Statements for Natural Language Processing: Toward Mitigating System Bias and Enabling Better Science, Transactions of the Association for Computational Linguistics, December 2018, The MIT Press,
DOI: 10.1162/tacl_a_00041.
You can read the full text:

Read

Contributors

The following have contributed to this page

Data Statements: A tool for mitigating bias in natural language processing systems.

What is it about?

Why is it important?

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Data Statements: A tool for mitigating bias in natural language processing systems.

What is it about?

Featured Image

Why is it important?

Read the Original

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management