A compute-supported project from MiniMax, focused on turning long-context, multimodal, and Agent capabilities into medical foundation model development and clinically verifiable workflows.
| Program | MiniMax Cowork Team Fellowship |
| Support | USD 4,500 compute grant |
| Direction | Medical foundation model development and clinically verifiable Agent workflows |
Highlights
MiniMax Cowork Team Compute Grant Medical Foundation Models
A data-and-model framework for trustworthy medical deepfake detection, built around evidence-grounded reasoning, forgery localization, and a public demo stack.
| Asset | Details |
|---|---|
| Paper | ACL 2026 Main Conference |
| Dataset | MedForge-90K, covering CT, MRI, and X-ray with 19 lesion types |
| Model | MedForge-Reasoner on Hugging Face |
| Demo | Online detector Space for interactive testing |
An open medical image editing dataset with 50,635 successful edits and 37,822 failed attempts across three modalities and 23 disease types.
| Modality | Task | Diseases | Success | Failed |
|---|---|---|---|---|
| Chest X-ray | Add | 12 | 9,854 | 7,971 |
| Chest X-ray | Remove | 12 | 10,667 | 4,750 |
| Brain MRI | Add | 4 | 4,536 | 8,630 |
| Brain MRI | Remove | 4 | 4,355 | 6,949 |
| Fundus | Add | 7 | 18,505 | 3,162 |
| Fundus | Remove | 7 | 2,718 | 6,360 |
| Total | 23+ | 50,635 | 37,822 | |
Open asset: Dataset, code, and paper are publicly available for medically grounded image editing research.
A zero-shot detection framework for identifying LLM-generated text in specialized domains like medicine and law, using normalized entropy-based scoring and domain knowledge distillation.
| Metric | Improvement |
|---|---|
| AUROC | +14.4% vs. SOTA |
| Recall @ 0.1% FPR | +64.0% vs. SOTA |
| Zero-shot Capability | No training needed |
A GPU-accelerated legal-domain speech-to-text service delivered for Haiwen & Partners LLP (HK), built on Whisper Large-v2 and packaged as a production serving stack.
Highlights
Whisper Large-v2 Docker + FastAPI WER 7.8%
An autonomous trading agent built at AQUMON on a LangChain architecture, orchestrating market analysis, signal generation, decision-making, and execution monitoring into a single closed loop for programmatic Hong Kong equity trading.
Highlights
LangChain Agent Futu OpenAPI AQUMON (HK)
A production agent harness that parses, edits, and re-formats documents from natural-language instructions, built on the ReAct paradigm with Kimi-K2 as the core LLM. Designed as a self-contained, shippable agent rather than a notebook demo.