FLIR Camera Video Stream

Introduction

This project is designed to stream video from a FLIR camera to a virtual video device using v4l2loopback. It supports high frame rates (up to 60 FPS) and can overlay additional information such as sensor data and HMI elements onto the video stream. The project is optimized for performance using CUDA for image processing tasks.

Software architecture

This FLIR camera streaming application follows a modular architecture designed for high-performance real-time video processing and sensor data integration. The system is composed of several key modules:

Workflow Overview

┌───────────────┐
│   Camera Img  │◄─────────────────────────────────────────────────┐
└───────────────┘                                                  │
        │                                                          |
        ▼                                                          |
┌───────────────┐       ┌───────────────┐       ┌───────────────┐  |
│ BayerRG → RGB │──────→│  CYRA motion  │──────→│  KBM motion   │  |
└───────────────┘       └───────────────┘       └───────────────┘  |
                                                        │          |
                                                        ▼          |
┌───────────────┐       ┌───────────────┐       ┌───────────────┐  |
│  Merge Image  │◄──────│ Fisheye Dist. │◄──────│   Homography  │  |
└───────────────┘       └───────────────┘       └───────────────┘  |
        │                                                          |
        ▼                                                          |
┌───────────────┐       ┌───────────────┐       ┌───────────────┐  |
│   RGB → YUYV  │──────→│ Write to v4l2 │──────→│  Release Img  │──┘ 
└───────────────┘       └───────────────┘       └───────────────┘

Module Descriptions

1. Main Entry Point (`main.cpp`)

Command-line argument parsing and configuration
Signal handling for graceful shutdown
Orchestrates initialization of all subsystems
Supports multiple operation modes: streaming, calibration, and testing

2. Spinnaker Stream Module (`spinnaker_stream/`)

Purpose: Core video capture and streaming functionality
Key Components:
- FLIR camera initialization and control using Spinnaker SDK
- Real-time frame capture with configurable FPS (up to 60)
- Integration with V4L2 loopback device for video output
- Multi-threaded sensor data reception and processing

3. CUDA Module (`cuda/`)

Purpose: High-performance GPU-accelerated image processing
Key Components:
- CudaImageConverter: Handles Bayer to RGB and RGB to YUYV format conversion
- CudaResolution: Manages image scaling and resolution adjustments
- Kernel implementations for parallel processing on GPU
- Optimized for NVIDIA Jetson AGX Xavier (Compute Capability 7.2)

4. Camera Module (`camera/`)

Purpose: Image processing, calibration, and visual component rendering
Key Components:
- Fisheye: Fisheye camera calibration and undistortion
- Homography: Perspective transformation for top-down view
- Component System: Modular visual overlay system
  - StreamImage: Container for multiple visual components
  - LineComponent: Renders driving lines and predictions
  - TextComponent: Displays text information
  - ImageComponent: Handles image overlays
- RingBuffer: Efficient circular buffer for image data
- PIDGammaController: Automatic exposure and gamma correction

5. Sensor Module (`sensor/`)

Purpose: Real-time sensor data collection and network communication
Key Components:
- SocketBridge: UDP socket communication for sensor data
- SensorAPI: Unified interface for accessing sensor values
- DataLogger: CSV logging of sensor data with timestamps
- Multi-port listening (base port, base+1, base+2) for different data streams

6. Motion Module (`motion/`)

Purpose: Vehicle motion prediction and kinematic modeling
Key Components:
- CYRA Model: Constant Yaw Rate and Acceleration prediction model
- Bicycle Model: Vehicle dynamics for steering prediction
- State prediction for autonomous vehicle path planning

Data Flow

Image Acquisition: FLIR camera captures raw Bayer format images via Spinnaker SDK
GPU Processing: CUDA kernels convert Bayer → RGB → YUYV with hardware acceleration
Component Rendering: Visual overlays (driving lines, text, HMI elements) are rendered onto the image
Sensor Integration: Real-time sensor data is received via UDP and integrated into visual components
Output Streaming: Processed frames are written to V4L2 loopback device for consumption by external applications

Performance Optimization

Multi-threading: Separate threads for camera capture, sensor data reception, and processing
CUDA Acceleration: GPU-based image format conversion achieving ~8ms processing time
Memory Management: Pinned CUDA memory for efficient CPU-GPU data transfer
Circular Buffers: Lock-free ring buffers for high-throughput data handling

Configuration & Calibration

The system supports runtime calibration for:

Fisheye Distortion: Automatic calibration using chessboard patterns
Homography Transformation: Perspective correction for bird's-eye view
Sensor Mapping: Configurable sensor data sources and display parameters

This modular architecture enables easy extension for new sensors, different camera types, and additional visual components while maintaining high performance for real-time applications.

Dependency

OpenCV
v4l2loopback
CUDA
Spinnaker SDK

Usage

Build the binary

run/build

Initialize the v4l2 package

run/init_v4l2

Run the video stream

run/flir_stream

Now the stream is available on /dev/video16. If you want to view it, open a new terminal and run

ffplay /dev/video16

Run the stream with FleetMQ and lower level system of RCVE

Build the binary

run/build

Run the stream with FleetMQ

run/streamming start [-delay <time_ms>] [-hmi] [-p_hmi]

For the parameters, please refer to run/streamming -h. You can check the log in run/logs/.

Once the stream is started, you can view it FleetMQ Web app by navigating to https://app.fleetmq.com and logging in with your credentials. The stream will be available under the "Streams" section.

Restart the stream with FleetMQ

run/streamming restart [-delay <time_ms>] [-hmi] [-p_hmi]

Stop the stream with FleetMQ

run/streamming stop

The output data will be saved in run/output/ directory.

Parameters

You can also run the binary with the following parameters (default):

-h to display the help message
-d <device> to specify the video device (/dev/video16)
-fps <fps> to specify the frames per second (default is 60)
-scale <scale> to specify the scale of the frame size (default: 1)
-delay <time_ms> to specify the delay time in milliseconds (default: 0)
-s to add the sensor data to the video stream (false)
-hmi to add HMI to the stream
-p_hmi to add Prediction HMI to the stream
-ip <ip> to bind the IP address of the sensor data (0.0.0.0)
-p <port> to bind the UDP port of the sensor data (10086)
-log <logger> to specify the logger file
-fc to calibrate the fisheye camera
-fu <image> to undistort the image
-hc to calibrate the homography matrix

Components with sensor data

If the parameter -s is added, the software will try to bind the IP address and port to receive the sensor data. If bind successfully, the driver line and velocity components will display on the video.

The software will listen to <port> for listening to the lower system data and <port> + 1 for listening to the control tower data. And also listen to <port> + 2 for the latency from FleetMQ SDK.

To display the components, you will need to save the fisheye and homography matrix in the execution directory. You can find the tutorial below.

Benchmark

For production testing, we capture 1000 images and convert them from Bayer to YUYV format then directly write to video device '/dev/video16'. A timmer is used to measure the time taken to process one image (all the way from Bayer to YUYB).

The conversion is implemented using sequential processing, parallel processing, and CUDA processing. The test result of time taken to process one image (average) is as follows (run three time and take the average):

Note: The test device is NVIDIA Jetson AGX Xavier (32G).

Processing Type	Time (ms)
Bayer to RGB only	8.7
Sequential	47.6
Parallel	16.0
CUDA (pure stream)	8.0
CUDA (with components)	10.0

Fisheye criteria

Save your images in run/data/*.jpg which is taken by fisheye camera with a chessboard that has 10x7 vertices and each square size is 2.5cm in A4 paper.
Run the executable file with parameter ./flir_stream -fc in the run directory.
The program will output the matrix with distortion coefficients in fisheye_calibration.yaml.

Fisheye undistortion

Run the executable file with parameters run/flir_stream -fu <image> in the run directory with the image you want to undistort.
The program will output the undistorted image in the same directory.

Homography criteria

Save your points in homography_points.yaml with the following format:

%YAML:1.0
---
#Formet: [Left front wheel x/y, right front wheel x/y, left front 50m x/y, right front 50m x/y]
points: [ 767., 2047., 2303., 2047., 1023., 1365., 2047., 1365. ]

Run the executable file with parameter ./flir_stream -hc in the run directory.
The program will output the homography matrix in homography_calibration.yaml.

Note: The points should be the pixel coordinates from a undistorted image.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FLIR Camera Video Stream

Introduction

Software architecture

Workflow Overview

Module Descriptions

1. Main Entry Point (`main.cpp`)

2. Spinnaker Stream Module (`spinnaker_stream/`)

3. CUDA Module (`cuda/`)

4. Camera Module (`camera/`)

5. Sensor Module (`sensor/`)

6. Motion Module (`motion/`)

Data Flow

Performance Optimization

Configuration & Calibration

Dependency

Usage

Build the binary

Initialize the v4l2 package

Run the video stream

Run the stream with FleetMQ and lower level system of RCVE

Build the binary

Run the stream with FleetMQ

Restart the stream with FleetMQ

Stop the stream with FleetMQ

Parameters

Components with sensor data

Benchmark

Fisheye criteria

Fisheye undistortion

Homography criteria

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 140 Commits
camera		camera
cuda		cuda
motion		motion
run		run
sensor		sensor
spinnaker_stream		spinnaker_stream
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md
main.cpp		main.cpp

KTH-SML/redo_camera

Folders and files

Latest commit

History

Repository files navigation

FLIR Camera Video Stream

Introduction

Software architecture

Workflow Overview

Module Descriptions

1. Main Entry Point (main.cpp)

2. Spinnaker Stream Module (spinnaker_stream/)

3. CUDA Module (cuda/)

4. Camera Module (camera/)

5. Sensor Module (sensor/)

6. Motion Module (motion/)

Data Flow

Performance Optimization

Configuration & Calibration

Dependency

Usage

Build the binary

Initialize the v4l2 package

Run the video stream

Run the stream with FleetMQ and lower level system of RCVE

Build the binary

Run the stream with FleetMQ

Restart the stream with FleetMQ

Stop the stream with FleetMQ

Parameters

Components with sensor data

Benchmark

Fisheye criteria

Fisheye undistortion

Homography criteria

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. Main Entry Point (`main.cpp`)

2. Spinnaker Stream Module (`spinnaker_stream/`)

3. CUDA Module (`cuda/`)

4. Camera Module (`camera/`)

5. Sensor Module (`sensor/`)

6. Motion Module (`motion/`)

Packages