Skip to main content
Filter by
Sorted by
Tagged with
Advice
0 votes
0 replies
31 views

I tracked a 3DMM object in a video stream. The tracking model performs bundle adjustment and estimates camera intrinsics and object scale. However, neither the intrinsics nor the object size are ...
sophie's user avatar
  • 88
-10 votes
0 answers
105 views

I am working on a system that renders input fields on top of PDF forms using coordinates extracted from a document analysis tool. Current Setup For each form, we receive coordinates (X, Y, width, ...
Ranit Mondal's user avatar
Advice
0 votes
0 replies
21 views

I’m architecting a document processing pipeline for a real-time workflow. I have 5 document types, but they require two completely different extraction strategies. The Document for Dynamic Form: This ...
JS3's user avatar
  • 1,951
1 vote
0 answers
111 views

I am working on a task where I need to detect the four corners of each square in an image and highlight each corner as a single pixel. Each filter is intended to respond to a specific corner ...
Marcel Majhenic's user avatar
Advice
0 votes
1 replies
73 views

I’m working on a project where I want to use AI / computer vision to read MEP (Mechanical, Electrical, Plumbing) drawings or floor plans. My goal is to: Detect rooms and extract their labels (e.g., “...
Narmeen Zafar's user avatar
Tooling
0 votes
0 replies
37 views

I’m currently working on annotating a human pose dataset (specifically of people swimming) and I’m struggling to find a tool that fits my workflow. I’m looking for a click‑based labeling workflow, ...
Jan Lattenkamp's user avatar
Advice
0 votes
1 replies
74 views

I'm tracking a target from an UAV using a thermal camera. Detection is YOLOv8n running every N frames on an NPU, and I need something to hold the track between detections. What I've tried: Template ...
Klim's user avatar
  • 1
Advice
0 votes
1 replies
55 views

I am currently working on a project focused on automated computer vision-based behavior recognition for captive dolphins. I would like to ask about your experience using SLEAP for model training—...
蔡秀蘭's user avatar
Advice
2 votes
3 replies
118 views

I am trying to build a computer vision system to analyze classroom behavior from surveillance cameras. The goal is to automatically detect several behavioral indicators such as: - student attention - ...
AI助教_曾子昕's user avatar
1 vote
1 answer
155 views

I am building a custom Multi-Object Tracking (MOT) system using Python, OpenCV, and TensorFlow. My goal is to track people and perform real-time clothing recognition. To prevent ID switches when a ...
BestlabChill's user avatar
Advice
0 votes
1 replies
143 views

I am building a simple AI hand tracking application using MediaPipe and OpenCV in Python. The program reads frames from a webcam, processes them with MediaPipe Hands, and draws the hand landmarks on ...
WoW Sky's user avatar
Advice
0 votes
1 replies
58 views

I am experimenting with the V-JEPA model developed by Meta for video understanding. My goal is to analyze a live video stream of people attending a seminar and determine their engagement level (for ...
Harshitha Gangu's user avatar
Advice
0 votes
3 replies
118 views

import cv2 import pytesseract import numpy as np image_path = "elecBill.jpg" img = cv2.imread(image_path) # Resize (VERY IMPORTANT) img = cv2.resize(img, None, fx=2, fy=2, interpolation=...
Sidharth Kumar's user avatar
Tooling
0 votes
0 replies
61 views

i am currently developing a hobbyist search and retrieval robot im running a astra pro RGB-D camera on a jetson nano, running ROS melodic and ubuntu 18.04. My task is to create a robot that can detect ...
Jyhrie's user avatar
  • 1
Advice
2 votes
0 replies
141 views

I am working on a person recognition system for learning purposes. My goal is: Maintain a small gallery of known people (multiple images per person) Given a new query image, return the most similar ...
Shanthini M's user avatar
Tooling
1 vote
2 replies
85 views

(This is my first time posting, so feedback is welcome!) I am working on a depth-based Vision system where I need to detect packages of a single type in such a way that I retreive: Their center in ...
Vincenzo's user avatar
Best practices
0 votes
0 replies
99 views

I am working with the ODIR-5K (Ocular Disease Intelligent Recognition) dataset. The goal is multi-label classification of 8 ocular diseases (Normal, Diabetes, Glaucoma, Cataract, etc.). The Data ...
Nitish Kumar's user avatar
Advice
0 votes
3 replies
75 views

I am trying to use a custom stereo camera setup for depth estimation. The first step is to perform stereo calibration, for which I use a Charuco board and standard OpenCV functions (calibrateCamera, ...
Andrew's user avatar
  • 525
Tooling
1 vote
2 replies
31 views

I am currently working on a tool that takes in every webpage of a site, adds alt text to images missing it, and outputs the webpage as a PDF for archival purposes. For the generation of alt text, is ...
Joel Singh's user avatar
Best practices
2 votes
2 replies
47 views

I am working on a classroom attendance / seat occupancy visualization system and I am struggling with the system design rather than the detection model itself. Scenario A fixed-position surveillance ...
猪猪猪猪's user avatar
1 vote
2 answers
124 views

I use cv2.FlannBasedMatcher to detect some objects. I got good accuracy and would like to get (x, y) of group of points. What I have: What i'd like to get There is my function: def detect(self): ...
Ennjin's user avatar
  • 75
Advice
0 votes
4 replies
73 views

I have images of a chair seat with and without a surface defect known as sink marks (see example image below). My dataset is very small: 15 images of good chairs and only 3 images with sink mark ...
Optical_flow_lover's user avatar
0 votes
1 answer
60 views

Can we detect the ChArUco markers after inverting the color of ChArUco markers? My client wants the color inverted ChArUco board. import os import numpy as np import cv2 # ----------------------------...
Rishmika Wijewardhana's user avatar
Advice
0 votes
1 replies
48 views

I have sample dataset where prices are stable few days/week/months and then falls/raise and stays in that price again for few days/weeks/months. Basically it looks like step down and up when you ...
Shridhar Kulkarni's user avatar
Advice
3 votes
0 replies
189 views

I’m working on face liveness (anti-spoofing) detection intended for real-world mobile apps (Flutter), and I’m struggling to achieve reliable performance outside controlled datasets. What I’m trying to ...
Mr x's user avatar
  • 11
Advice
0 votes
1 replies
75 views

I have this data Name,X1,Y1,X2,Y2,X3,Y3,X4,Y4,,centroid_x,centroid_y,,area R1-A,79,55,70,87,154,78,159,48,,115.5,67,,2486 R1-B,1108,23,1126,51,1197,44,1174,14,,1151.25,33,,2150.5 R1-C,2134,53,2183,...
Munsif Ali's user avatar
  • 8,244
Tooling
0 votes
4 replies
164 views

I am always confused between tensorflow and pytorch when to use which, because both are used for same task but when to use tensorflow and pytorch, like situation oriented. Some people's said suggested ...
Shrinivas Nadager's user avatar
1 vote
0 answers
65 views

I am running a training of instance segmentation on custom coco dataset using pytorch maskrcnn + fpn for 83 categories (+background). What is the problem with following setup and why RPN head not ...
SavEng's user avatar
  • 11
Advice
0 votes
2 replies
64 views

I'm writing a program to track my card games. It uses a birds eye view camera to record the playing surface, and a YOLO model to classify the cards. I'm running into an issue figuring out where the ...
terrenana's user avatar
Best practices
0 votes
2 replies
154 views

I’m trying to extract tabular data from a scanned engineering document. The table contains: merged header cells irregular row heights irregular column widths faint and broken borders text inside ...
pragyan lamba's user avatar
0 votes
2 answers
188 views

I have a problem that I struggle with and cant seem to find a solution; I want to get the area of the contours I see in this image/video frame: The problem is the contour is cut off at the right as ...
Jakob Leboerg's user avatar
1 vote
1 answer
121 views

Currently I'm working on object detection for counting how many object presented on the frame. I already successfully separate some of them. There's still some object which is very close together ...
Exto Logia's user avatar
Tooling
0 votes
2 replies
87 views

I am trying to create a 3D model from overlapping aerial images and am looking for a free software to use. My dataset includes 1,500 RGB images, a ground truth segmentation mask for each image, the ...
Fred1313's user avatar
0 votes
1 answer
126 views

I wanted to close the border of object. Some of the object could be rectangles too. I already tried using dilation and closing with 2 iteration but it seems the border isn't completely close. Here is ...
Exto Logia's user avatar
1 vote
0 answers
141 views

I want to count samples in the image and measure the length of each sample as I show below. But I am facing a big problem that when sample is overlapping it cannot make an accurate count, for example ...
user31726648's user avatar
1 vote
2 answers
126 views

Problem: I need to check whether a small black-and-white icon (template) appears inside a large, colored dashboard photo. The icon in the photo may differ from the template in color, scale, small ...
Dũng Hoàng's user avatar
0 votes
1 answer
141 views

I am using the MixStyle methodology for domain adaptation, and it involves using a custom layer that is inserted after every encoder stage. However, it is causing VRAM to grow linearly, which causes ...
Vedant Dalimkar's user avatar
0 votes
0 answers
110 views

I am trying to do batch inference with YOLO11. I am working with MacBook and I am running into this issue from ultralytics import YOLO import numpy as np # Load YOLO model model = YOLO("yolo11s....
Ananda's user avatar
  • 3,310
0 votes
2 answers
253 views

I have a large number of scanned discs like this: Actual image is 600 DPI, 7400x7400, 48 bit TIFF. I want to convert them to a JPEG like this: Same DPI, but cropped, and rotated so that it is ...
shogged's user avatar
  • 270
1 vote
1 answer
133 views

I have to find the contours of boxes. Some boxes have diagonal inside of them. I try to remove diagonal but I think it isn't answer. Here are the images those I preprocessing and contour result. Only ...
Lee Minhyeung's user avatar
0 votes
0 answers
280 views

I am training a NeRF with Nerfstudio using drone imagery from a MicaSense Rededge-P camera. For each capture I have metadata: lat, lon, alt (WGS84 position) yaw, pitch, roll (from MicaSense DLS, ...
Evan Hammam's user avatar
2 votes
1 answer
242 views

I am trying to extract numbers from dotted LED-style digits (0–9) using Tesseract OCR in a MAUI/Xamarin app on Android and iOS, fully offline. My boss wants a local solution that works on mobile ...
boss's user avatar
  • 1,648
3 votes
2 answers
200 views

I'm trying to understand how scikit-image's local_binary_pattern() function works. Let's take the simplest setup: input is a grayscale image, radius = 1, n_points = 4, method = "uniform". ...
J.D.'s user avatar
  • 309
1 vote
1 answer
64 views

I’m using the GLASS defect detection model and want to replace its default wideresnet50 backbone with efficientnetv2_m in shell/run-custom.sh.However, when I run bash run-custom.sh I get the ...
aniaf's user avatar
  • 19
0 votes
1 answer
174 views

I'm using the Wan2.1-VACE video generation model, and during inference I encountered a RuntimeError related to mismatched tensor shapes in a torch.cat operation inside the vace_latent() function. From ...
范姜伯軒's user avatar
3 votes
1 answer
123 views

My intention is to create a charuco board object, which supports custom ids. Here is the code snippet being used. def __init__(self, squaresX=11, squaresY=8, squareLength=0.015, markerLength=0.011, ...
Tommy Llewellyn's user avatar
0 votes
1 answer
105 views

Background: Currently, I need to crop the head or tail of the grayscale image of the steel, but this image has problems such as uneven grayscale distribution. How can I find this cutting point? My ...
Mumu's user avatar
  • 49
-1 votes
1 answer
147 views

Background: I currently have many grayscale images of steel. Some of them have high brightness, while others have uneven brightness. As shown in the figure below, how can I better extract their ...
Mumu's user avatar
  • 49
2 votes
1 answer
96 views

I am trying to get OpenCV-python to recognize a checkerboard pattern from my thermal camera. I couldn't get that working. This is the thermal image and I realize the image is low resolution, but I can'...
준서이's user avatar
-1 votes
1 answer
79 views

I have a function which is supposed to return the rotation vector and translation vector (rvec and tvec) given some 3d points, some 2d points, and an intrinsics matrix def solvePnP(points_3d: list[...
Tommy Llewellyn's user avatar

1
2 3 4 5
318