Dylan Bouchard, PhD

Lead Applied Research Scientist, Thomson Reuters Labs

I'm an applied research scientist at Thomson Reuters Labs. My recent research has focused on AI safety — uncertainty quantification, hallucination detection, and bias and fairness in large language models. Previously, I led the AI Research program at CVS Health, where I authored the UQLM and LangFair open-source toolkits. I hold a PhD in Economics (Econometrics) from North Carolina State University.

Open-Source Projects

Select Publications

  1. D. Bouchard and M. S. Chauhan. Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, LLM Judge, and Ensemble Scorers. Transactions on Machine Learning Research, 2025. [OpenReview]
  2. D. Bouchard, M. S. Chauhan, D. Skarbrevik, H.-K. Ra, V. Bajaj, and Z. Ahmad. UQLM: A Python Package for Uncertainty Quantification in Large Language Models. Journal of Machine Learning Research, 27(13):1–10, 2026. [JMLR]
  3. D. Bouchard. Bring Your Own Prompts: Use-Case-Specific Bias and Fairness Evaluation for LLMs. Proceedings of the LT-EDI Workshop at ACL, 2026 (to appear). [arXiv]
  4. D. Bouchard, M. S. Chauhan, V. Bajaj, and D. Skarbrevik. Fine-Grained Uncertainty Quantification for Long-Form Language Model Outputs: A Comparative Study. Under review, 2026. [arXiv]
  5. D. Bouchard, M. S. Chauhan, D. Skarbrevik, V. Bajaj, and Z. Ahmad. LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases. Journal of Open Source Software, 10(105):7570, 2025. [DOI]
  6. D. Bouchard. Is Escalation Worth It? A Decision-Theoretic Characterization of LLM Cascades. Under review, 2026. [arXiv]

Selected Talks

Service

Reviewer for NeurIPS, ICLR, ACL, ACM TIST, and ACM CHI.