Page cover image

Medical Dataset

The Recurv-Medical-Dataset is a comprehensive resource of 67,299 high-quality question-answer pairs explicitly designed for training and fine-tuning medical AI models. Curated from trusted medical sources, this dataset focuses on real-world scenarios like anamnesis, diagnostics, and treatment recommendations. It sets a new benchmark for advancing conversational AI in the healthcare domain.


📈 Dataset Statistics

Feature

Value

Number of QA Pairs

67,299

Average Question Length

420

Average Answer Length

603


📜 Data Sources

Sourced from the most authoritative and trusted references in the Solana ecosystem:

  • PubMed and Open Access Journals

  • Clinical Practice Guidelines (WHO, CDC)

  • Medical Textbooks

  • EHR-Simulated Data

  • Peer-Reviewed Research Papers


🙌 Contributing

We welcome contributions to enhance Recurv-Medical-Dataset. You can:

  • Share feedback or suggestions on the Hugging Face Model Hub

  • Submit pull requests or issues for model improvement.

Last updated