Projects
GAA Simulation Platform
The world's first cross-industry adversarial training simulation platform. Replaces expensive human role-players with intelligent AI agents — enabling scalable, measurable professional training for hospitals, banks, insurers, and contact centers. Built by a team with both clinical expertise and deep AI engineering capability.
OSCE-Project
Medical dialogue evaluation framework inspired by Objective Structured Clinical Examinations. Simulates 64 patient personas measuring Empathy, Persuasion, and Safety.
FHIR-Agent
An LLM agents framework for interacting with FHIR databases, enabling structured clinical data retrieval and reasoning.
OSCE-AgentBeats Leaderboard
Public leaderboard for the OSCE evaluator within the AgentBeats challenge ecosystem. Compare medical agent performance across standardized clinical scenarios.
AgentBeats Security Arena
Adversarial security testing framework for AI agents. Evaluates robustness and safety against prompt injection and manipulation attacks.
OSCE Real-Time Voice
Real-time voice-based clinical examination system extending the OSCE framework with speech interaction for realistic doctor-patient dialogue evaluation.
Model Training & Distillation
End-to-end LLM fine-tuning and knowledge distillation pipelines for domain-specific medical AI models. Building smaller, faster, and more accurate clinical language models.