First-year PhD student in the department of Engineering Mathematics at the University Bristol, supervised by Jonathan Lawry and Arthur Richards and supported by Thales. My project is tentatively entitled Explainable AI for Interacting Autonomous Agents. I am a graduate of the UK’s top-ranked general engineering degree, holder of an MSc in Machine Learning, and recipient of an RAEng Engineering Leaders Scholarship.
My PhD research explores how we might begin to understand and explain the policies of black-box autonomous agents, whose internal mechanisms and representations may be very different from our own, with a particular view to revealing the biases and flaws in their decision-making. Critical to this question are the putative trade-off between comprehensibility and performance of machine learning models, and the thorny relationship between correlation and causation in observed data whose generative origins are unknown.
I’m currently looking at using decision trees to ‘clone’ black-box policies and offer factual, counterfactual and narrative explanations of their behaviour. Check out this pre-print for some early thoughts and results.