Skip to content

fisapool/medvision-ai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MedVision AI

Project Purpose & Background

Medical imaging annotation and reporting are critical for research, education, and the development of AI models in healthcare. However, current workflows are often:

  • Manual and time-consuming: Annotating images and generating reports typically require significant expert effort.
  • Fragmented: Tools for annotation, detection, and reporting are often separate, making workflows inefficient.
  • Not easily accessible: Many solutions require specialized software or are not user-friendly for non-experts.

MedVision AI was created to address these issues by providing:

  • An all-in-one, web-based platform for image annotation, detection, and AI-powered reporting.
  • Instant switching between detection modes and persistent, shareable reports.
  • A modern, intuitive interface for rapid experimentation and demonstration.

Who is this for?

  • Researchers and engineers working on medical imaging AI
  • Educators and medical students / Medical Lab Radiologist learning about medical image analysis
  • Developers building or testing annotation/reporting tools

Note: Many medical students and trainees now use large language models (LLMs) such as ChatGPT, Claude, or Gemini to help generate radiology reports or explanations. While these tools can be helpful for learning and brainstorming, they have important limitations (see below).

Limitations of LLMs for Medical Reporting

  • Hallucinations: LLMs may generate plausible-sounding but factually incorrect or fabricated information.
  • Lack of clinical validation: Reports generated by LLMs are not reviewed or validated by medical experts.
  • No access to patient history: LLMs do not have access to real patient data, prior studies, or clinical context.
  • Not a substitute for expert review: LLM-generated reports should never be used for clinical decision-making or patient care.

Always consult a qualified healthcare professional for medical advice and diagnosis.

A modern web app for medical image annotation, detection, and reporting using AI.

Features

  • Upload or select demo medical images (e.g., CT scans)
  • Multiple detection modes: 2D bounding boxes, segmentation masks, points, 3D bounding boxes
  • Medical imaging report generation (AI-powered)
  • Persistent, collapsible report panel with copy/download
  • Caching for fast mode switching and consistent results
  • Modern, responsive UI with tooltips and loading indicators

Demo Image

  • The app includes a demo image: test_image.jpeg (CT scan)
  • This image is available as the first example image when the app loads
  • You can also upload your own images (JPG, PNG, WEBP)

Requirements

  • Node.js (v16 or newer recommended)
  • npm or yarn

Setup & Installation

  1. Clone the repository:

    git clone <your-repo-url>
    cd spatial-understanding-MV5
  2. Install dependencies:

    npm install
    # or
    yarn install
  3. Set up environment variables:

    • Copy .env.local.example to .env.local (if provided)
    • Add your Gemini API key or other required secrets to .env.local
  4. Run the app:

    npm run dev
    # or
    yarn dev

    The app will be available at http://localhost:5173 (or as shown in your terminal)

Usage

  • Select a demo image from the left sidebar, or upload your own
  • Choose a detection mode (2D bounding boxes, segmentation masks, etc.)
  • View results instantly thanks to caching
  • Generate a medical imaging report by clicking the Reporting button
  • Copy or download the report from the right panel

Development

  • Built with React, TypeScript, and Tailwind CSS
  • Uses Jotai for state management
  • Easily extensible for new detection/reporting features

Contributing

Pull requests and issues are welcome! Please open an issue to discuss major changes first.

License

Apache-2.0

Project Author

Created by DR Arif Fahmi, MBBS, MSU Malaysia graduate

Disclaimer

This application is for research and demonstration purposes only. It is not intended for clinical or diagnostic use. Do not use the outputs of this app to make medical decisions. Always consult a qualified healthcare professional for medical advice and diagnosis.

About

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published