RAI Lab University of New Brunswick
University of New Brunswick · Faculty of Computer Science

Responsible AI.

A research lab at the University of New Brunswick advancing trustworthy, interpretable, and accountable AI — from explainability and safety to LLM-powered systems and agentic AI.

Photo · UNB Fredericton Campus
— The Lab

The Responsible AI Research Lab (RAI) at the University of New Brunswick, led by Dr. Mahtab Sarvmaili, develops machine learning systems that are trustworthy, interpretable, and accountable. Our work spans the core pillars of responsible AI — explainability, privacy and safety, and security and robustness — and extends into LLM-powered systems and agentic AI, with applications in healthcare, law, and the public interest.

— 01 · Mission

Research that doesn't only work — but can be understood.

Modern machine learning models perform remarkably well, and remain largely opaque. We develop the methods, tools, and frameworks that open these systems up — so that practitioners, regulators, and the people affected by them can see how a prediction was made and decide whether it should be trusted.

— 02 · Foundations

Three pillars of Responsible AI.

Our research is structured around three foundational pillars that together define what it means for an AI system to be responsibly deployed in the real world.

PILLAR I

Explainability, Interpretability & Transparency

Building methods that surface why a model made a prediction — opening the black box so that experts, end users, and auditors can understand, scrutinize, and verify model behaviour.

PILLAR II

Privacy & Safety

Ensuring AI systems handle sensitive data responsibly, do not leak training information, and operate within boundaries that protect individuals and institutions from harm.

PILLAR III

Security & Robustness

Making models resilient against adversarial inputs, distribution shift, and deployment failures — so that systems perform reliably under the conditions of the real world, not just the lab.

— 04 · Featured Publications

Recent work.

2026
Reinforcement-learned unequal error protection for quantized semantic embeddings in bandwidth-constrained communication
Array · Elsevier
2025
Data-centric explanations: FEHAN and DICTA frameworks for local interpretability of text classifiers
ICLR 2025
2024
HDExplain: High-dimensional explanations for graph neural network predictions
NeurIPS Workshop
2024
Task-driven control and model editing for deployed large language models
Preprint
View all publications →
— 05 · Recent

Lab news.

2026 · May

RAI Lab launches at UNB Faculty of Computer Science, welcoming new students and partners.

2026 · April

Umair Akram and Dr. Sarvmaili publish work on reinforcement-learned unequal error protection for quantized semantic embeddings (Array, Elsevier).

2025 · November

Dr. Mahtab Sarvmaili joins UNB as Assistant Professor in the Faculty of Computer Science.

2025 · March

Paper on data-centric model explanation accepted at ICLR 2025.