OCR in Gemini and ChatGPT, New era of digitalization of papers
In 2025, the landscape of AI models isn’t just about generating text. it’s about understanding the world through language, vision , audio,...
Video streaming Tutorial of sending processed Opencv video to NGINX and distributing video from NGINX (broadcast) by HLS stream for a wider ...
In 2025, the landscape of AI models isn’t just about generating text. it’s about understanding the world through language, vision , audio,...
This post walks through a complete Python code that captures an RTSP stream from OBS studio and rtspSimpleServer, detects motion against a s...
I'll explore techniques for segmenting, mapping, and tracking changes in skin anomalies (e.g., moles, spots, vessels) using dermatologic...
Multimodal models can process images, texts, and audio, and generate also various output representations. This feels like magic with tremend...
Essential Python Libraries for Computer Vision and Video Processing Python has become a leading language for machine learning and also resea...
OpenCV C++ Tutorial: Resizing an Image Resizing an image is a fundamental operation in image processing and computer vision. In this tutori...
This tutorial explains how to read RTSP streams using OpenCV, installed via VCPKG, and includes examples in both C++ and Python . Capturin...