Image Manipulation Datasets (IMDS)

This Python package provides PyTorch-compatible dataset classes for common image manipulation datasets used in digital forensics and deepfake detection research.

Supported Datasets

CASIA 2.0 - Forgery classification dataset with 4,795 images
Defacto - Collection of manipulation datasets:
- Copy/Move (~19,000 forgeries)
- Splicing (~105,000 forgeries)
- Inpainting (~25,000 forgeries)
Coverage - Copy-move forgery database with similar genuine objects
IMD2020 - Real-life manipulated images from the Internet (2,010 images)

Installation

pip install image-manipulation-datasets

Quick Start

from imds import casia
from torch.utils.data import DataLoader

# Load any dataset
dataset = casia.CASIA2(data_dir='data/CASIA2.0', split='train')
dataloader = DataLoader(dataset, batch_size=32, shuffle=True)

for images, masks in dataloader:
    # images: torch.Tensor shape (batch_size, 3, H, W)
    # masks: torch.Tensor shape (batch_size, 1, H, W) 
    pass

Documentation

For comprehensive API documentation, usage examples, and advanced features, see:

📖 API Documentation

The documentation includes:

Complete API reference for all dataset classes
Usage examples and common patterns
Directory structure requirements for each dataset
Performance optimization tips
Error handling guidelines

Sample Quality

Datasets are not always perfect. Of the available datasets, COVERAGE, CASIA 2, and Defacto Splicing had images and masks that didn't match in size, though they have been verified as pairs. For this reason, the dataset classes resize the masks to the size of the original image, with the hopes that the masks line up correctly with the image. This is unverified as it would require manually verifying each of the over 110,000 image and mask pairs.

Name		Name	Last commit message	Last commit date
Latest commit History 235 Commits
.github/workflows		.github/workflows
src/imds		src/imds
.gitignore		.gitignore
API_DOCUMENTATION.md		API_DOCUMENTATION.md
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image Manipulation Datasets (IMDS)

Supported Datasets

Installation

Quick Start

Documentation

Sample Quality

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

srmlcn/imds

Folders and files

Latest commit

History

Repository files navigation

Image Manipulation Datasets (IMDS)

Supported Datasets

Installation

Quick Start

Documentation

Sample Quality

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages