Blog posts

2020

Weekly Readings #17

12 minute read

Published:

Using $Q$ for imitation; differentiable decision trees and their application to RL; interactive explanations with Glass-Box.

Weekly Readings #16

11 minute read

Published:

Imitation by coaching; GAIL; human-centric vs robot-centric; DeepMimic.

Weekly Readings #15

12 minute read

Published:

Confident execution framework; explananda as differences; online decision tree induction; hybrid AI design patterns.

Weekly Readings #14

15 minute read

Published:

Integrating knowledge and machine learning; folk psychology and intentionality; soft decision trees; conceptual spaces.

Weekly Readings #13

14 minute read

Published:

Theory-of-mind as a general solution; factual and counterfactual explanation; semantic development in neural networks; cloning without action knowledge; intuition pumps.

2019

Weekly Readings #12

14 minute read

Published:

Goal hierarchies as rule sets; mutual information and auxiliary tasks for representation learning; model-based understanding.

Weekly Readings #11

11 minute read

Published:

Distillation and cloning; onboard swarm evolution; The Mind’s I chapters.

Weekly Readings #10

7 minute read

Published:

State representation learning in Atari; AI shortcuts and ethical debt; cloning swarms.

Weekly Readings #9

11 minute read

Published:

Model extraction; world models and representations; a MAS taxonomy.

Weekly Readings #8

16 minute read

Published:

State representation learning; emotions and qualitative regions for heuristic explanation; causal reasoning as a middle ground between statistics and mechanics; deep learning and neuroscientific discovery.

Weekly Readings #7

8 minute read

Published:

Meta-learning causal relations; decomposing explanation questions; misleading explanations; the critical influence of metrics.

Weekly Readings #6

9 minute read

Published:

Modelling other agents; DAGGER; evaluating feature importance visualisations; self, soul and circular ethics.

Weekly Readings #5

18 minute read

Published:

The theory of why-questions; fidelity versus accuracy; trees and programs as RL policies; partially-interpretable hybrids.

Weekly Readings #4

11 minute read

Published:

Decision trees for state space segmentation; lightweight manual labelling as a ‘seed’ for interpretability; the dangerous of homogenous distributed control; AI and the climate crisis.

Weekly Readings #3

8 minute read

Published:

This week didn’t involve very much reading since I focused instead on my practical investigation of the traffic coordination problem. Nonetheless, I encountered a variety of fascinating ideas.

Weekly Readings #2

23 minute read

Published:

Approximately three weeks in, I’m starting to work on a case study project that will allow me to explore some of the key ideas around multi-agent explainability – collision avoidance within a population of autonomous vehicles on road / track networks. As a result, more of my reading this week has focused specifically on the multi-agent context.

Weekly Readings #1

19 minute read

Published:

As it stands I’m precisely 13 days into my PhD, which means a lot of reading, and I thought I’d kick this blog off with a weekly rolling ‘diary’ of things I read, watch and otherwise consume which may have some influence on my PhD topic. Most of the papers have words pertaining to explanation in there, and that’s because I did a massive scrape of papers with that keyword. I figured that would be a reasonable start.

Start Here

3 minute read

Published:

A Go player makes a weak move that loses her the game. A company’s hiring policy appears to show gender biases. A person crashes their car. Our immediate question in each case is why? Our social norms, laws and fundamental ethical principles rely on the assumption that decisions and actions have explanations, that give insight into their context, causes and consequences.