Expand description
Scene understanding and AI-powered video analysis for OxiMedia.
oximedia-scene provides comprehensive scene understanding and intelligent video
analysis capabilities for the OxiMedia multimedia framework. This includes:
- Scene Classification: Classify scenes (indoor/outdoor, day/night, landscape, portrait)
- Object Detection: Lightweight patent-free object detection
- Activity Recognition: Recognize activities (walking, running, sports)
- Shot Composition: Analyze framing (rule of thirds, symmetry, leading lines)
- Semantic Segmentation: Segment image into semantic regions (sky, ground, people)
- Saliency Detection: Identify visually important regions
- Aesthetic Scoring: Rate aesthetic quality of frames
- Event Detection: Detect events in sports and live content
- Face Detection: Lightweight face detection (Haar cascades)
- Logo Detection: Detect brand logos and graphics
§Patent-Free Algorithms
All algorithms are carefully selected to be patent-free:
- HOG (Histogram of Oriented Gradients): Object detection
- Haar Cascades: Face detection
- Color Histograms: Scene classification
- Motion Histograms: Activity recognition
- Spectral Saliency: Attention prediction
- Graph-based Segmentation: Semantic regions
- Rule-based Composition: Framing analysis
§Modules
classify: Scene, content, and quality classificationdetect: Object, face, logo, and text detectionactivity: Activity and sports recognitioncomposition: Composition rules, balance, and depth analysissegment: Semantic and foreground/background segmentationsaliency: Saliency detection and attention predictionaesthetic: Aesthetic quality scoring and feature extractionevent: Event detection for sports and live contentfeatures: Feature extraction and descriptors
§Example
use oximedia_scene::classify::scene::SceneClassifier;
use oximedia_scene::detect::face::FaceDetector;
use oximedia_scene::composition::rules::CompositionAnalyzer;
// Example usage
let classifier = SceneClassifier::new();
let face_detector = FaceDetector::new();
let composition = CompositionAnalyzer::new();Re-exports§
pub use error::SceneError;pub use error::SceneResult;
Modules§
- action_
beat - Action beat detection and analysis for video scenes.
- activity
- Activity recognition from motion patterns.
- adaptive_
scene - Adaptive scene detection with complexity histogram analysis.
- aesthetic
- Aesthetic quality assessment.
- audio_
visual_ correlation - Audio-visual correlation module for detecting synchronisation between audio events and visual changes.
- camera_
motion - Camera motion estimation and classification.
- classification
- Scene classification: genre detection, mood estimation, and location tags.
- classify
- Scene and content classification.
- color_
temperature - Scene color temperature estimation and analysis.
- common
- Common types and utilities used across modules.
- complexity_
detector - Complexity histogram-based scene detection.
- composition
- Shot composition analysis.
- content_
moderation - Content moderation analysis for video frames.
- continuity_
check - Continuity checking for video scene sequences.
- crowd_
density - Crowd density estimation for video frames.
- depth_
of_ field - Depth-of-field computation for cinematographic scene analysis.
- detect
- Object, face, logo, and text detection.
- emotion_
recognition - Emotion recognition module for analyzing facial expressions.
- error
- Error types for scene understanding.
- event
- Event detection in video.
- face_
landmark - Facial landmark detection and geometric face analysis.
- features
- Feature extraction and descriptors.
- lighting_
analysis - Lighting analysis for video frames.
- location
- Location and setting analysis for scenes.
- mood
- Scene mood analysis.
- motion_
energy - Motion energy module for quantifying overall motion intensity per scene segment.
- object_
tracker - Multi-object tracking with Kalman filter state estimation.
- pacing
- Scene pacing analysis.
- saliency
- Saliency detection and attention prediction.
- scene_
boundary - Scene boundary detection: types, descriptors, and frame-based detector.
- scene_
captioning - Scene captioning module for generating natural-language descriptions from scene features.
- scene_
graph - Scene graph structure for hierarchical narrative representation.
- scene_
metadata - Scene-level metadata: tags, environmental descriptors, and keyword sets.
- scene_
score - Scene quality scoring: per-category scores and aggregate scene score.
- scene_
stats - Per-scene metrics and aggregate statistics across a sequence of scenes.
- scene_
tags - Automatic hierarchical scene tagging with confidence-scored labels.
- segment
- Image segmentation — foreground/background and semantic region detection.
- segmentation
- Scene segmentation: shot boundary detection, scene grouping, and transition types.
- shot_
type - Shot type classification for cinematographic framing analysis.
- storyboard
- Scene storyboard generation.
- summarization
- Scene summarization: key shot extraction and scene-level digests.
- temporal_
graph - Temporal graph module for connecting scene analysis results across time.
- text_
detect - Advanced text detection using edge density, stroke width analysis, and text classification.
- thumbnail_
selector - Thumbnail selector module for choosing the most visually representative frame per scene.
- transition
- Scene transition analysis.
- visual_
quality_ map - Per-region visual quality mapping.
- visual_
rhythm - Visual rhythm extraction and analysis.