Papers & Submissions

Academic research from the Failure-First program

The Failure-First research program produces peer-reviewed papers, preprints, and policy submissions documenting how embodied AI systems fail under adversarial pressure. Click any paper title to read the full text online.

Preprint

Silent Failures in Embodied AI

Venue: arXiv Preprint

Demonstrates that current AI safety operates exclusively at the text layer while embodied AI danger emerges at the action layer. Zero outright refusals across 63 FLIP-graded VLA traces.

Embodied AIVLA SafetyAction LayerPARTIAL Compliance

Draft

The Inverse Detectability-Danger Law

Venue: AIES 2026

Examines how embodied AI systems adopt injected decision criteria at inference time, producing context-dependent compliance patterns that undermine safety guarantees.

AI EthicsDecision InjectionEmbodied AISafety Evaluation

Citation

If you use our research, data, or methodology, please cite:

@article{wedd2026failurefirst,
  title={Failure-First Evaluation of Embodied AI Safety:
         Adversarial Benchmarking Across 227 Models},
  author={Wedd, Adrian},
  year={2026},
  note={Available at https://failurefirst.org}
}

See our citation guide for venue-specific formats.