Auditing Black Box Models

Where: FAT*
When: 2018-02-23, 4-5PM EST, Vanderbilt Hall 204
Who: Suresh Venkatasubramanian, Carlos Scheidegger, Charlie Marx

Models learned through machine learning can be hard to interpret. A model that takes many inputs and has many parameters might depend on the inputs in complicated ways. This makes it hard to know if, for example,

• the model might indirectly depend on protected attributes of the input that users of the model might prefer it not do (for example race via zipcode).

• the model depends heavily on input attributes that domain knowledge might suggest should not be an important factor (for example, a system that predicts the output of a chemical reaction but appears to be influenced by the time the reaction is run).

This tutorial will teach the audience to use a software library developed by the presenters of the tutorial through a sequence of simple examples in a Jupyter notebook. The presenters will focus on understanding current strengths and limitations of the method, and how tutorial attendees can use this method in their own datasets.

Materials

Get the talk slides, in PDF or PPTX.
Follow the installation instructions.
Get the Jupyter Notebook for the hands-on demo.

Papers

Suresh, Carlos, and Charlie are the ones presenting the tutorial; but the techniques themselves were designed in collaboration with many other people, and the work is described in these papers:

Feldman et al., Certifying and Removing Disparate Impact, KDD 2015
Adler et al., Auditing Black-Box Models for Indirect Influence, Knowledge and Information Systems, 54(1):95–122, 2018.

BlackBoxAuditing repository

The source code for the BlackBoxAuditing library itself is currently hosted on a GitHub repository.

Auditing Black Box Models

A hands-on tutorial at FAT* 2018.

Auditing Black Box Models

Materials

Papers

BlackBoxAuditing repository