System Architecture

This document provides a detailed overview of the Brain Tumor Monitoring System architecture, including components, data flow, and design decisions.

High-Level Architecture

The system follows a microservices architecture with the following key components:

Frontend Layer: React-based dashboard providing real-time monitoring interface
API Gateway: FastAPI-based REST API handling all client requests
Monitoring Engine: Core monitoring logic for drift detection and feature extraction
Database Layer: PostgreSQL database storing predictions, features, and monitoring data
ML Pipeline: YOLOv8-based brain tumor detection and classification
Reporting System: Evidently AI-powered HTML report generation

Component Details

Frontend (React + TypeScript)

Technology Stack: * React 18+ with TypeScript * Tailwind CSS for styling * Chart.js for visualizations * React Router for navigation * Axios for API communication

Key Features: * Real-time dashboard updates * Interactive charts and graphs * Image upload interface * Report viewer * Responsive design

Architecture:

// Component structure
src/
├── components/          # Reusable UI components
│   ├── Dashboard.tsx   # Main dashboard
│   ├── Upload.tsx      # Image upload
│   └── Reports.tsx     # Report viewer
├── hooks/              # Custom React hooks
├── pages/              # Page components
├── types/              # TypeScript definitions
└── utils/              # Utility functions

Backend API (FastAPI)

Technology Stack:

FastAPI for REST API
SQLAlchemy for ORM
PostgreSQL for database
Pydantic for data validation
Uvicorn for ASGI server

API Structure:

backend/
├── src/
│   └── api.py         # Main FastAPI application
├── migrations/         # Database migrations
└── requirements.txt    # Python dependencies

Key Endpoints:

/health - Health checks
/predict - Image prediction
/monitoring/* - Monitoring endpoints
/patients/* - Patient management

Monitoring Engine

Core Components:

BrainTumorImageMonitor: Main orchestrator
DriftDetector: Statistical drift detection
FeatureExtractor: Image feature extraction
ReportGenerator: HTML report creation

Architecture:

monitoring/
└── core/
    ├── monitor.py           # Main monitor class, orchestrates monitoring logic
    ├── drift_detector.py    # Drift detection logic
    ├── feature_extractor.py # Feature extraction
    └── __init__.py          # Core monitoring package init

# Monitoring logic is now integrated into the backend (see backend/src/api.py)

Database Design

PostgreSQL Schema:

-- Main predictions table
CREATE TABLE predictions_log (
    id SERIAL PRIMARY KEY,
    timestamp TIMESTAMP NOT NULL,
    prediction_confidence FLOAT,
    prediction_class VARCHAR(50),
    num_detections INTEGER,
    model_version VARCHAR(50),
    processing_time_ms INTEGER,

    -- Image features
    image_width INTEGER,
    image_height INTEGER,
    image_channels INTEGER,
    image_size_bytes BIGINT,
    brightness_mean FLOAT,
    brightness_std FLOAT,
    contrast_mean FLOAT,
    contrast_std FLOAT,
    entropy FLOAT,
    skewness FLOAT,
    kurtosis FLOAT,
    mean_intensity FLOAT,
    std_intensity FLOAT,

    -- Tumor-specific features
    tumor_area_ratio FLOAT,
    tumor_detection_confidence FLOAT,
    num_tumors_detected INTEGER,
    largest_tumor_area FLOAT,
    tumor_density FLOAT,
    tumor_location_x FLOAT,
    tumor_location_y FLOAT,
    tumor_shape_regularity FLOAT
);

Indexes:

-- Performance indexes
CREATE INDEX idx_predictions_timestamp ON predictions_log(timestamp);
CREATE INDEX idx_predictions_class ON predictions_log(prediction_class);
CREATE INDEX idx_predictions_confidence ON predictions_log(prediction_confidence);

ML Pipeline

Technology Stack:

YOLOv8 for object detection
OpenCV for image processing
NumPy/Pandas for data manipulation
PyTorch for deep learning

Pipeline Components:

ml/
├── train.py           # Model training
├── predict.py         # Model inference
├── models.py          # Model definitions
├── configs/           # Configuration files
└── utils.py           # Utility functions

Training Process:

Data Preparation: Image preprocessing and annotation
Model Training: YOLOv8 training with custom dataset
Validation: Model evaluation on test set
Export: Model export for production

Inference Process:

Image Preprocessing: Resize, normalize, format conversion
Model Inference: YOLOv8 prediction
Post-processing: NMS, confidence filtering
Feature Extraction: Statistical and tumor-specific features

Experiment Logging

Our system provides robust experiment logging and tracking capabilities to ensure reproducibility and facilitate hyperparameter optimization.

Key Tools and Components:

Hydra: Used for flexible configuration management. Training scripts (ml/train.py, ml/train_sweep.py) can be launched with different configurations using Hydra, allowing for easy experimentation with various hyperparameters and settings. The main configuration is stored in ml/configs/model/config.yaml.
Weights & Biases (wandb): Integrated for experiment tracking and logging. When enabled (via the –wandb flag or wandb_logging: True in config), all training runs log metrics, hyperparameters, and results to the wandb dashboard. This is implemented in ml/models.py and can be toggled in both CLI and config files.
Sweep: Hyperparameter sweeps are managed using a sweep configuration file (ml/configs/sweep.yaml) and the sweep training script (ml/train_sweep.py). The sweep setup allows for automated exploration of hyperparameter combinations (e.g., model type, batch size, epochs) using Bayesian optimization. The sweep can be run locally or in the cloud using the provided Dockerfile (dockerfiles/sweep.Dockerfile).
Docker Integration: The sweep Dockerfile (dockerfiles/sweep.Dockerfile) provides a reproducible environment for running sweeps and logging experiments, with environment variables for wandb project/entity and sweep configuration.

How to Use:

To run a single experiment with logging:
```
python ml/train.py --wandb
```

To run a sweep (hyperparameter search):

wandb sweep ml/configs/sweep.yaml
wandb agent <entity>/<project>/<sweep_id>

All experiment logs, metrics, and results are tracked in the wandb dashboard for easy comparison and analysis.

Note: Faust is not used in this codebase.

Distributed Computing

Our system supports distributed training to efficiently utilize multiple CPU cores or nodes, enabling scalable and faster model training. This is especially important for large datasets or complex models.

Key Files and Components:

ml/train.py: The main entry point for training. It exposes a CLI (via Typer and Hydra) that allows you to specify the number of workers for distributed data loading and training.
ml/models.py: The train_model function accepts a num_workers argument, which controls the number of worker processes for data loading and distributed training.
Dockerfiles and configuration files: These support launching distributed jobs in both local and cloud environments.

How Distributed Training Works:

The number of workers can be set via command-line arguments or configuration files, allowing flexible scaling based on available resources.
The system can launch multiple processes for data loading and model training, leveraging PyTorch’s distributed and multiprocessing capabilities.
This design enables efficient use of cloud resources (e.g., GCP VMs) or local multi-core machines.

Example Usage:

python ml/train.py --num_workers 4

This command launches training with 4 worker processes for data loading and distributed computation.

By supporting distributed computing, our architecture is designed for scalability and efficient resource utilization in both research and production settings.