Creating your own dataset of MRI images to train a CNN model

Muhammad Ihtesham Khan

Software Engineering | Artificial Intelligence | Machine learning |Computer Vision | Prompt Engineer | Logical Skill | Python | Communication Skill | Presentation Skill | Research Skill | Mentor

Published Aug 4, 2024

This involves several steps, including data collection, annotation, preprocessing, and augmentation. Here’s a step-by-step guide with a focus on preprocessing techniques:

Step-by-Step Guide to Creating an MRI Image Dataset

Step 1: Collect MRI Images

Sources: Collect MRI images from medical databases, hospitals, research collaborations, or publicly available datasets such as the NIH Clinical Center or Kaggle.

Ethics: Ensure you have the necessary permissions and ethical approvals for using medical images.

Step 2: Annotate the Data

Labeling: Annotate the images based on the diagnosis or regions of interest. This might involve labeling images as 'tumor', 'no tumor', or specific types of conditions.

Tools: Use tools like LabelImg Link (labelImg · PyPI) for image annotation, or specialized medical imaging tools like ITK-SNAP Link (ITK-SNAP Medical Image Segmentation Tool download | SourceForge.net) or 3D Slicer Link (3D Slicer image computing platform | 3D Slicer).

Step 3: Organize the Data

Directory Structure: Organize images into directories based on their labels.

dataset/

tumor/

image1.png

image2.png

no_tumor/

image1.png

image2.png

Step 4: Preprocess the Data

Preprocessing is crucial for ensuring that your model receives clean and standardized data. Here are some common preprocessing techniques for MRI images:

A. Resizing

Resize all images to a fixed size (e.g., 128x128, 224x224) to ensure uniformity.

Python Link (pillow · PyPI)

from PIL import Image

import os

def resize_images(image_path, output_path, size=(128, 128)):

for filename in os.listdir(image_path):

if filename.endswith(".png"):

img = Image.open(os.path.join(image_path, filename))

img = img.resize(size)

img.save(os.path.join(output_path, filename))

resize_images('dataset/tumor', 'resized/tumor')

resize_images('dataset/no_tumor', 'resized/no_tumor')

B. Normalization

Normalize pixel values to a range of 0 to 1 or standardize them to have zero mean and unit variance.

python

import numpy as np

from tensorflow.keras.preprocessing.image import ImageDataGenerator

datagen = ImageDataGenerator(rescale=1./255)

Standardization (optional):

· mean = np.mean(images, axis=(0,1,2,3))

· std = np.std(images, axis=(0,1,2,3))

· datagen = ImageDataGenerator(preprocessing_function=lambda x: (x - mean) / std)

C. Data Augmentation

Use augmentation techniques to artificially increase the size of your dataset and improve model generalization.

python

datagen = ImageDataGenerator(

rotation_range=20,

width_shift_range=0.2,

height_shift_range=0.2,

shear_range=0.2,

zoom_range=0.2,

horizontal_flip=True,

fill_mode='nearest'

)

D. Cropping and Padding

Crop or pad images to ensure consistent dimensions and focus on the region of interest.

python

def crop_center(image, cropx, cropy):

y, x = image.shape[:2]

Creating your own dataset of MRI images to train a CNN model

Muhammad Ihtesham Khan

Software Engineering | Artificial Intelligence | Machine learning |Computer Vision | Prompt Engineer | Logical Skill | Python | Communication Skill | Presentation Skill | Research Skill | Mentor

Recommended by LinkedIn

More articles by this author

Insights from the community

Others also viewed

Synthetic Data in Medical Imaging: FDA Analysis and Implementation

Patient-Centered Unified Medical Management AI

1000 AI Algorithms in Medical Imaging - Thoughts for Tech Entrepreneurs (2 of 5)

AI in Biomedicine

Redefining CT Imaging with ANKE Artist AI De-noising Technology

Is AI capable of analyzing up to 75% of CXRs without the participation of a radiologist?

The Evolution of AI in Medical Imaging: A Decade in Review

Revolutionizing CT Imaging: The Power of True Enhance DL

1000 AI Algorithms in Medical Imaging - Thoughts for Tech Entrepreneurs (1 of 5)

Companions, AI, and quantum computers at the service of overworked doctors

Explore topics

Recommended by LinkedIn

Unlocking Success Through Continuous Improvement: The Power of the PDCA Cycle

Aug 22, 2024

Demystifying Parameters and Hyperparameters in Deep Learning

Aug 16, 2024

Understanding the Differences Between Artificial Neural Networks (ANNs) and Convolutional Neural Networks (CNNs)

Jul 29, 2024

Optimizing a Model for Classify Cat and Dog Image

Jul 27, 2024

How Machines & Human Learn: A Detailed Guide

Jul 14, 2024

CPUs, GPUs, and TPUs: Which is Best for Deep Learning?

Jul 2, 2024

Navigating Your Journey as a Junior Developer: A Roadmap to Success

Jun 30, 2024

The Future of Intelligent Conversation: OpenAI's ChatGPT-4o

Jun 29, 2024

Understanding TensorFlow and Keras: Choosing the Right Tool for Your Machine Learning Journey

Jun 28, 2024

Insights from the community

Others also viewed

Synthetic Data in Medical Imaging: FDA Analysis and Implementation

Patient-Centered Unified Medical Management AI

1000 AI Algorithms in Medical Imaging - Thoughts for Tech Entrepreneurs (2 of 5)

AI in Biomedicine

Redefining CT Imaging with ANKE Artist AI De-noising Technology

Is AI capable of analyzing up to 75% of CXRs without the participation of a radiologist?

The Evolution of AI in Medical Imaging: A Decade in Review

Revolutionizing CT Imaging: The Power of True Enhance DL

1000 AI Algorithms in Medical Imaging - Thoughts for Tech Entrepreneurs (1 of 5)

Companions, AI, and quantum computers at the service of overworked doctors

Explore topics