Weekly Readings #26
Published:
Terminological quagmires; (mis)interpreting interpolation; interaction is paramount.
Published:
Terminological quagmires; (mis)interpreting interpolation; interaction is paramount.
Published:
Imitation learning using value or reward.
Published:
Trustworthy AI; unifying imitation and policy gradient; soft decision trees; SRL with dimension specialisation.
Published:
Explanatory debugging; latent canonicalisations; the perceptual user interface; automatic curriculum learning.
Published:
Explanation-based tuning; saliency maps for vision-based policies; RL with differentiable decision trees.
Published:
Constraining embeddings with side information; latent actions; RL with abstract representations and models; fuzzy state prototypes; index-free imitation.
Published:
Rule-based regularisation; DeepSHAP for augmenting GAN training; image schemas as conceptual primitives; imitating DDPG with a fuzzy rule-based system.
Published:
Symbols and cognition; robust AI through hybridisation; causal modelling via RL interventions; environment as an engineered system.
Published:
Formalising interpretation and explanation; operationally-meaningful representations; Conceptual Spaces book.
Published:
Using $Q$ for imitation; differentiable decision trees and their application to RL; interactive explanations with Glass-Box.
Published:
Imitation by coaching; GAIL; human-centric vs robot-centric; DeepMimic.
Published:
Confident execution framework; explananda as differences; online decision tree induction; hybrid AI design patterns.
Published:
Integrating knowledge and machine learning; folk psychology and intentionality; soft decision trees; conceptual spaces.
Published:
Theory-of-mind as a general solution; factual and counterfactual explanation; semantic development in neural networks; cloning without action knowledge; intuition pumps.
Published:
Goal hierarchies as rule sets; mutual information and auxiliary tasks for representation learning; model-based understanding.
Published:
Distillation and cloning; onboard swarm evolution; The Mind’s I chapters.
Published:
State representation learning in Atari; AI shortcuts and ethical debt; cloning swarms.
Published:
Model extraction; world models and representations; a MAS taxonomy.
Published:
State representation learning; emotions and qualitative regions for heuristic explanation; causal reasoning as a middle ground between statistics and mechanics; deep learning and neuroscientific discovery.
Published:
Meta learning causal relations; decomposing explanation questions; misleading explanations; the critical influence of metrics.
Published:
Modelling other agents; DAGGER
; evaluating feature importance visualisations; self, soul and circular ethics.
Published:
The theory of why-questions; fidelity versus accuracy; trees and programs as RL policies; partially-interpretable hybrids.
Published:
Decision trees for state space segmentation; lightweight manual labelling as a ‘seed’ for interpretability; the dangerous of homogenous distributed control; AI and the climate crisis.
Published:
This week didn’t involve very much reading since I focused instead on my practical investigation of the traffic coordination problem. Nonetheless, I encountered a variety of fascinating ideas.
Published:
Approximately three weeks in, I’m starting to work on a case study project that will allow me to explore some of the key ideas around multi-agent explainability – collision avoidance within a population of autonomous vehicles on road / track networks. As a result, more of my reading this week has focused specifically on the multi-agent context.
Published:
As it stands I’m precisely 13 days into my PhD, which means a lot of reading, and I thought I’d kick this blog off with a weekly rolling ‘diary’ of things I read, watch and otherwise consume which may have some influence on my PhD topic. Most of the papers have words pertaining to explanation in there, and that’s because I did a massive scrape of papers with that keyword. I figured that would be a reasonable start.