Table of Contents

Exploring Fair Principles for Ethical AI

Published on April 24, 2023 by Iason Gabriel and Kevin McKee

Abstract header of 3D columns in a blue gradient.

Drawing from philosophy to identify fair principles for ethical AI

As artificial intelligence (AI) continues to evolve and become a significant part of our daily lives, the need for ethical guidelines guiding its use becomes imperative. Questions about the values that steer AI, whose values they are, and how they are chosen are at the forefront.

The concept of principles is essential in shaping AI’s decision-making process, much like how it guides human behavior. In a recent paper published in the Proceedings of the National Academy of Sciences, the authors explored how the philosophical concept known as the “veil of ignorance” can be applied to determine fair principles for AI behavior.

The experiments conducted revealed that approaching AI decision-making from a perspective that eliminates personal biases and focuses on fairness can lead to more equitable outcomes. Participants were more inclined to choose principles that benefitted the most disadvantaged individuals when reasoning behind the veil of ignorance.

A Tool for Fairer Decision-Making

Aligning AI systems with human values has been a prominent goal for researchers. However, the diversity of human values raises the challenge of selecting the right principles to govern AI. Drawing inspiration from the veil of ignorance, a philosophical concept introduced by John Rawls, can offer a solution to this dilemma.

This approach encourages individuals to make decisions based on fairness rather than personal gain. By withholding critical information that biases decisions, the veil of ignorance promotes impartiality and fairness in the decision-making process.

Maximize Productivity or Help the Most Disadvantaged?

In an online game scenario, participants were asked to make decisions on principles guiding an AI system’s behavior, either focusing on maximizing productivity or helping the disadvantaged group members. Those placed behind the veil of ignorance consistently favored the principle of assisting the disadvantaged, highlighting a preference for fairness.

Results showed that participants who were unaware of their position in the group were more likely to prioritize fairness over personal benefits when making choices. This underscores the role of impartial decision-making in achieving equitable outcomes.

Fairer Principles for AI

Creating AI systems that benefit all individuals requires a comprehensive understanding of ethical principles. The veil of ignorance can serve as a valuable tool in guiding the selection of principles that prioritize fairness and equity in AI decision-making.

By emphasizing fairness over personal gain, AI systems can be aligned with human values effectively. Further exploration and application of the veil of ignorance concept can contribute to the development of ethical AI systems that serve the greater good.

For more insights on DeepMind’s approach to safety and ethics, visit their website.

Paper Authors

Laura Weidinger*, Kevin McKee*, Richard Everett, Saffron Huang, Tina Zhu, Martin Chadwick, Christopher Summerfield, Iason Gabriel

*Laura Weidinger and Kevin McKee are joint first authors

Incorporating Human Values into AI