OxRML · University of Oxfordest. 2023 · Oxford Internet Institute

The science of evaluating AI, and securing it for use where stakes are real.

Reasoning with Machines Lab is an Oxford research lab. We study how language models and agentic systems behave under pressure, and how to deploy them where the stakes are real.

OxRML-Reason · v1.4Q1 2026 release
GPT-5
71.4
Claude 4.5
68.2
Gemini 2.5 Pro
64.7
Llama 4 405B
52.1
Human baseline
87.3
Multi-step reasoning under adversarial perturbation. 6 domains, 12k tasks, n=3 seeds. CC-BY-4.0.
Benchmarks and EvaluationAI Safety and SecurityAgentic AI for ScienceHuman–AI InteractionMechanistic interpretabilityRed-teamingCapability elicitationBenchmark designBias & toxicityDomain-grounded agentsScientific discoveryAI governanceDistributional robustnessSelf-explanationReasoning under pressureBenchmarks and EvaluationAI Safety and SecurityAgentic AI for ScienceHuman–AI InteractionMechanistic interpretabilityRed-teamingCapability elicitationBenchmark designBias & toxicityDomain-grounded agentsScientific discoveryAI governanceDistributional robustnessSelf-explanationReasoning under pressure
01·What we do

Three research lines, asking whether we can trust what these systems do next.

01 / 03

Benchmarks and Evaluation

We work on the science of LLM evaluation: what benchmarks measure, where they mislead, and how to build ones that hold up.

  • Benchmark design
  • Statistical evaluation
  • Capability elicitation
  • Contamination audits
02 / 03

AI Safety and Security

We work on bias, toxicity, and agentic misalignment, and on the technical and governance tools that address them.

  • Mechanistic interpretability
  • Red-teaming
  • Agentic misalignment
  • Policy translation
03 / 03

Agentic AI for Science

Agentic systems for scientific work. We focus on keeping them reliable, transparent, and grounded in the domain.

  • Literature synthesis
  • Hypothesis generation
  • Evidence grounding
  • Domain transfer
02·Selected work

Recent publications

All publications
03·For industry

We help teams ship AI they can defend.

Two ways to work with us: third-party evaluation of your models and agents, or a focused engagement that turns one of our research outputs into a tool you own.

Evaluation
Pre-deployment audits, custom benchmarks, agentic red-teaming.
Co-build
We work with engineering partners to turn lab work into tools other teams can run.
Sectors
SaaS, public sector, financial services, healthcare.
Engagement
12–24 weeks · NDA-friendly · publishable outcomes negotiable.
04·The lab

DPhils, MScs, and visiting fellows.

Full team
Felix KronesFK
Felix Krones
DPhil Student
Djavan De ClercqDDC
Djavan De Clercq
DPhil Student
Andrew M. BeanAMB
Andrew M. Bean
DPhil Student
Yushi YangYY
Yushi Yang
DPhil Student
Harry MayneHM
Harry Mayne
DPhil Student
05·Lab notes

Recent activity

All notes
May 2026
Paper
Three OxRML papers accepted at ICML 2026 — including a Spotlight
April 2026
Conference
OxRML presenting at ICLR 2026
February 2026
Paper
New paper in Nature Medicine on LLMs as medical assistants
February 2026
Award
Ryan Othniel Kearns wins MSc Thesis Prize
December 2025
Conference
OxRML at NeurIPS 2025
© 2026 OxRML · University of Oxford
v.2026.05