A data-and-model framework for trustworthy medical deepfake detection, built around evidence-grounded reasoning, forgery localization, and a public demo stack.
| Asset | Details |
|---|---|
| Paper | ACL 2026 Main Conference |
| Dataset | MedForge-90K, covering CT, MRI, and X-ray with 19 lesion types |
| Model | MedForge-Reasoner on Hugging Face |
| Demo | Online detector Space for interactive testing |
A comprehensive 50K-image dataset for instruction-based medical image editing spanning three modalities and 23 disease types.
| Modality | Task | Diseases | Success | Failed |
|---|---|---|---|---|
| Chest X-ray | Add | 12 | 9,854 | 7,971 |
| Chest X-ray | Remove | 12 | 10,667 | 4,750 |
| Brain MRI | Add | 4 | 4,536 | 8,630 |
| Brain MRI | Remove | 4 | 4,355 | 6,949 |
| Fundus | Add | 7 | 18,505 | 3,162 |
| Fundus | Remove | 7 | 2,718 | 6,360 |
| Total | 23+ | 50,635 | 37,822 | |
Note: Full dataset will be released on Hugging Face upon paper acceptance.
A zero-shot detection framework for identifying LLM-generated text in specialized domains like medicine and law, using normalized entropy-based scoring and domain knowledge distillation.
| Metric | Improvement |
|---|---|
| AUROC | +14.4% vs. SOTA |
| Recall @ 0.1% FPR | +64.0% vs. SOTA |
| Zero-shot Capability | No training needed |