Face Finder

🔍 AI-powered face detection and recognition in videos

Face Finder is a powerful tool that uses state-of-the-art AI models to find and track specific people in video files. It supports multiple reference images, consensus-based matching, frame export with bounding boxes, and comprehensive result analysis.

✨ Features

🎯 High Accuracy: Uses DeepFace with ArcFace, Facenet512, and VGG-Face models
🚀 Performance Optimized: Smart pre-filtering and embedding caching for fast processing
📊 Consensus Matching: Multiple reference images with configurable agreement thresholds
🖼️ Frame Export: Export detected frames with red bounding boxes around matched faces
📈 Progress Tracking: Real-time face count display and processing statistics
💾 Resume Support: Automatic checkpointing for processing large videos
📋 Multiple Output Formats: Console output, CSV export, and frame segments
🔧 Highly Configurable: Extensive command-line options for fine-tuning

🚀 Quick Start

# Basic usage - find a person in a video
face-finder person.jpg video.mp4

# Use multiple reference images for better accuracy
face-finder ./reference_photos/ video.mp4 --consensus 0.7

# Export frames with bounding boxes
face-finder person.jpg video.mp4 --export-frames ./detected_faces/

📦 Installation

Option 1: Install as Package (Recommended)

Build and install Face Finder as a package for easy distribution:

# Install build tools
pip install build

# Build the package
./build.sh

# Install the package
pip install dist/face_finder-1.0.0-py3-none-any.whl

# Run from anywhere
face-finder person.jpg video.mp4

Option 2: Development Install

For development or when you want to modify the code:

# Clone/download the project
cd face-finder

# Create virtual environment (recommended)
python -m venv face-finder-env
source face-finder-env/bin/activate  # On Windows: face-finder-env\Scripts\activate

# Install in development mode
pip install -e .

# Run the tool
face-finder person.jpg video.mp4

Option 3: Direct Script Usage

Run directly without installation:

# Install dependencies
pip install -r requirements.txt

# Run the script directly
python face_finder/cli.py person.jpg video.mp4

Option 4: Distribute to Other Machines

Build the package (see Option 1)

Copy the wheel file to your target machine:

scp dist/face_finder-1.0.0-py3-none-any.whl user@remote-machine:~/

Install on the target machine:

# Create virtual environment (recommended)
python -m venv face-finder-env
source face-finder-env/bin/activate

# Install the package
pip install face_finder-1.0.0-py3-none-any.whl

# Run the tool
face-finder person.jpg video.mp4

System Requirements

Python 3.8+
4GB+ RAM (more for large videos or many reference images)
Storage: ~500MB for dependencies, plus space for video files and exported frames
GPU: Optional (CPU-only mode is used by default for stability)

Usage

Find timestamps when a specific person appears in a video.

If installed as package:

face-finder <reference_images> <video_file> [options]

If running script directly:

python face_timestamp_finder.py <reference_images> <video_file> [options]

Arguments

reference_images: One of the following:
- Path to a single reference image
- Path to a directory containing multiple reference images
- Comma-separated list of image file paths
video_file: Path to the video file to search

Options

--interval N: Check every Nth frame (default: 24). Lower values are more accurate but slower
--tolerance X: Face matching sensitivity from 0.0-1.0 (default: 0.6). Lower values are stricter
--fast: Use fast OpenCV pre-filtering (default: enabled)
--slow: Disable fast pre-filtering for maximum accuracy
--csv FILE: Export results to specified CSV file (if not provided, auto-generates filename)
--segments: Group consecutive detections into continuous appearance segments
--no-resume: Disable resume functionality and start from beginning (resume is enabled by default)
--min-matches N: Minimum reference images that must match (default: 1)
--consensus X: Fraction of reference images that must agree (0.0-1.0, default: 0.5)
--strict: Enable strict mode: requires ALL reference images to match
--max-width N: Maximum width for frame processing (default: 900). Lower = faster processing
--clear-cache: Clear cached reference embeddings and recreate them
--export-frames [DIR]: Export detected frames with red bounding boxes to directory (default: face_output)

Examples

Package installation examples:

# Single reference image  
face-finder person.jpg video.mp4

# Multiple reference images from directory
face-finder ./reference_photos/ video.mp4

# Group detections into continuous segments (better for long videos)
face-finder ./reference_photos/ video.mp4 --segments

# Reduce false positives with strict matching
face-finder ./reference_photos/ video.mp4 --strict

# Require consensus from multiple reference images
face-finder ./reference_photos/ video.mp4 --consensus 0.8 --min-matches 3

# Export results to custom CSV file
face-finder person.jpg video.mp4 --segments --csv segments.csv

# Faster processing with smaller frame size
face-finder ./reference_photos/ video.mp4 --max-width 640

# Maximum speed (lower quality but much faster)
face-finder ./reference_photos/ video.mp4 --max-width 480

# Clear embedding cache and recreate
face-finder ./reference_photos/ video.mp4 --clear-cache

# Export detection frames with bounding boxes (uses default "face_output" directory)
face-finder person.jpg video.mp4 --export-frames

# Export to custom directory
face-finder person.jpg video.mp4 --export-frames ./detected_faces/

# Combine frame export with segments and CSV
face-finder ./reference_photos/ video.mp4 --segments --csv results.csv --export-frames

Direct script examples:

# Single reference image
python face_timestamp_finder.py person.jpg video.mp4

# Multiple reference images from directory
python face_timestamp_finder.py ./reference_photos/ video.mp4

# Multiple specific reference images
python face_timestamp_finder.py person1.jpg,person2.jpg,person3.jpg video.mp4

# Check every frame for maximum accuracy
python face_timestamp_finder.py ./reference_photos/ video.mp4 --interval 1

# Use stricter matching
python face_timestamp_finder.py person.jpg video.mp4 --tolerance 0.4

# Force start from beginning (ignore any existing checkpoints)
python face_timestamp_finder.py person.jpg video.mp4 --no-resume

Output

The tool provides two output modes: individual detections and grouped segments.

Individual Detections (Default)

Console Output:

01:23:45 (83.25s) - Score: 87.5%
01:28:12 (88.12s) - Score: 92.1%

Results exported to: face_detections_video.csv

CSV Columns:

timestamp_hms: Time in HH:MM:SS format (e.g., "01:23:45")
timestamp_seconds: Precise time in seconds (e.g., 83.25)
match_score_percent: Confidence score percentage (e.g., 87.5)
consensus_percent: Percentage of reference images that agreed (e.g., 75.0)

Grouped Segments (--segments)

Perfect for long videos where a person appears continuously across multiple frames.

Console Output:

01:23:45 - 01:25:30 (Duration: 00:01:45) - Avg Score: 89.2% - Detections: 8
01:28:12 - 01:28:45 (Duration: 00:00:33) - Avg Score: 91.5% - Detections: 4

Segments exported to: face_segments_video.csv

CSV Columns:

start_time_hms/start_time_seconds: When the person first appears
end_time_hms/end_time_seconds: When the person was last detected
duration_hms/duration_seconds: How long they appeared continuously
avg_score_percent: Average confidence across all detections in segment
max_score_percent: Best confidence score in the segment
avg_consensus_percent: Average consensus across all detections in segment
max_consensus_percent: Best consensus score in the segment
detection_count: Number of individual frames detected in this segment

Segment Logic:

Consecutive detections (based on --interval) are grouped together
A segment ends when the person is not detected in the next expected frame
Even single-frame appearances count as segments
Useful for understanding continuous presence rather than individual frame hits

Frame Export Feature

The --export-frames option allows you to save actual video frames where faces are detected, complete with red bounding boxes around the detected faces.

How Frame Export Works

Automatic detection: Only frames with positive face matches are exported
Bounding boxes: Red rectangles are drawn around all detected faces in the frame
Original quality: Exported frames maintain the original video resolution (not downscaled)
Smart naming: Files include frame number, timestamp, confidence score, and consensus percentage

Exported Filename Format

detection_frame_00012345_01h23m45s_score87_consensus75.jpg

Where:

00012345: Frame number (8 digits, zero-padded)
01h23m45s: Timestamp in hours, minutes, seconds
score87: Confidence score (rounded to nearest integer)
consensus75: Consensus percentage - how many reference images agreed (rounded to nearest integer)

Frame Export Examples

# Basic frame export (uses default "face_output" directory)
face-finder person.jpg video.mp4 --export-frames

# Export with custom output directory
face-finder ./refs/ video.mp4 --export-frames /home/user/detected_faces/

# Combine with other options
face-finder person.jpg video.mp4 --export-frames --segments --csv results.csv

# High-quality frame export (slower but better quality)
face-finder ./refs/ video.mp4 --export-frames ./frames/ --max-width 1920 --slow

Use Cases for Frame Export

Manual verification: Visually confirm detection accuracy
Dataset creation: Build training datasets from video footage
Evidence collection: Extract specific moments for documentation
Content analysis: Study facial expressions or contexts
Quality assessment: Evaluate detection performance across different conditions

Resume Functionality

The tool automatically resumes from where it left off if interrupted, making it perfect for processing very large video files.

How Resume Works

Automatic checkpoints: Progress is saved every 10 processed frames
Smart validation: Only resumes if video file, interval, and tolerance settings match
Crash recovery: Can resume after unexpected shutdowns, power failures, or interruptions
No lost work: Preserves all detections found so far
Auto-cleanup: Removes checkpoint files after successful completion

Resume Examples

# Start processing a large video
python face_timestamp_finder.py ../refs/ large_video.mp4

# If interrupted, simply run the same command again to resume
python face_timestamp_finder.py ../refs/ large_video.mp4

# Force restart from beginning (ignoring checkpoints)  
python face_timestamp_finder.py ../refs/ large_video.mp4 --no-resume

Checkpoint Files

Stored as hidden files: .face_finder_checkpoint_{hash}.json
Contain video hash, progress, detections, and settings
Automatically deleted on successful completion
Safe to manually delete if you want to start fresh

Performance Optimization

The tool uses multiple optimization strategies to dramatically speed up face detection:

Smart Pre-filtering

Fast mode (default): Uses OpenCV's Haar cascades to quickly detect if any face exists in a frame
Only processes frames with faces: Skips expensive DeepFace analysis on frames without faces
Typical speedup: 10-50x faster than processing every frame with DeepFace

Embedding-Based Matching

Pre-computed embeddings: Reference images are processed once at startup into facial embeddings
Automatic caching: Embeddings are cached to disk and reused across runs with the same reference images
Single frame processing: Each video frame only needs one embedding computation
Vector comparison: Fast cosine distance calculation between embeddings
Massive speedup: 5-10x faster than traditional image-to-image comparison
Cache invalidation: Automatically detects when reference images change and rebuilds cache

Memory-Based Processing

No temporary files: Frames are processed directly in memory without disk I/O
Eliminated bottleneck: Removes file write/read overhead that was slowing down processing
Additional speedup: 2-3x improvement from removing disk operations

Frame Scaling

Automatic downscaling: Scales frames to a maximum width before processing (default: 900px)
Maintains aspect ratio: Preserves image proportions while reducing processing load
Configurable: Use --max-width to control the speed vs. accuracy trade-off

Speed Comparison by Resolution

4K video (3840px width) → 900px = ~4x faster processing
1080p video (1920px width) → 900px = ~2x faster processing
720p video (1280px width) → 900px = ~1.4x faster processing

Performance Tips

For maximum speed: Use --max-width 480 with --interval 30
Balanced speed/quality: Default settings (900px width, interval 24)
High quality: Use --max-width 1280 with --slow mode and --interval 10
Frame interval: Increase --interval for faster processing if you don't need to catch every appearance
Multiple reference images: The embedding approach scales efficiently with more reference photos

Embedding Cache Management

The tool automatically caches reference image embeddings to speed up subsequent runs:

Cache files: Stored as hidden .face_finder_embeddings_*.json files
Smart invalidation: Cache is rebuilt when reference images are added, removed, or modified
Instant startup: Cached embeddings load in seconds instead of minutes for large reference sets
Manual clearing: Use --clear-cache to force recreation of embeddings
No maintenance needed: Cache files are automatically managed and cleaned up

🎛️ Advanced Configuration

Consensus Settings

The consensus system helps ensure reliable detections when using multiple reference images:

# Require 70% of reference images to agree
face-finder ./refs/ video.mp4 --consensus 0.7

# Require at least 3 reference images to match
face-finder ./refs/ video.mp4 --min-matches 3

# Strict mode: ALL reference images must agree
face-finder ./refs/ video.mp4 --strict

Performance Tuning

# Maximum speed (lower quality)
face-finder person.jpg video.mp4 --max-width 480 --interval 30

# Maximum accuracy (slower)
face-finder person.jpg video.mp4 --max-width 1920 --interval 1 --slow

# Balanced (recommended)
face-finder person.jpg video.mp4 --max-width 900 --interval 24

🐛 Troubleshooting

Common Issues

Slow processing:

Reduce --max-width (e.g., 640 instead of 900)
Increase --interval (process fewer frames)
Use --fast mode (enabled by default)

Too many false positives:

Increase --tolerance (e.g., 0.7 or 0.8)
Use more reference images with --consensus
Try --strict mode for highest accuracy

Missing detections:

Decrease --tolerance (e.g., 0.4 or 0.5)
Use --slow mode for maximum accuracy
Add more diverse reference images

Memory issues:

Reduce --max-width significantly
Process videos in smaller chunks
Clear embedding cache with --clear-cache

Performance Monitoring

Watch the progress bar for insights:

Processing frames: 45% |████▌    | 450/1000 [02:15<02:30, 3.2frame/s] Faces: 2/4 match

High face counts: Expect slower processing
Low match ratios: May need better reference images
"No faces detected": Fast processing, good performance

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.github		.github
face_finder		face_finder
.gitignore		.gitignore
MANIFEST.in		MANIFEST.in
README.md		README.md
build.sh		build.sh
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Uh oh!

athuler/face-finder

Folders and files

Latest commit

History

Repository files navigation

Face Finder

✨ Features

🚀 Quick Start

📦 Installation

Option 1: Install as Package (Recommended)

Option 2: Development Install

Option 3: Direct Script Usage

Option 4: Distribute to Other Machines

System Requirements

Usage

If installed as package:

If running script directly:

Arguments

Options

Examples

Output

Individual Detections (Default)

Grouped Segments (--segments)

Frame Export Feature

How Frame Export Works

Exported Filename Format

Frame Export Examples

Use Cases for Frame Export

Resume Functionality

How Resume Works

Resume Examples

Checkpoint Files

Performance Optimization

Smart Pre-filtering

Embedding-Based Matching

Memory-Based Processing

Frame Scaling

Speed Comparison by Resolution

Performance Tips

Embedding Cache Management

🎛️ Advanced Configuration

Consensus Settings

Performance Tuning

🐛 Troubleshooting

Common Issues

Performance Monitoring

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Sponsor this project

Uh oh!

Uh oh!

Languages