Without LiDAR, how is the 3D box data, which is measured in meters, annotated?

29 December 2024 1 9K Report

How are the 3D bounding boxes for objects, measured in meters, and their positions in the camera XYZ coordinate system annotated?

Qamar Ul Islam

Tong Guo In robotics, 3D bounding boxes for objects are annotated without LiDAR by using stereo cameras or monocular cameras combined with depth estimation techniques. These methods rely on visual data captured by cameras to infer depth and position.

For example, imagine taking two photos of the same object from slightly different angles, like how our eyes see the world. By comparing these images, we can estimate the distance (depth) of the object from the camera, similar to how we perceive depth with two eyes. This approach is called stereo vision. It calculates the disparity between the two images to determine depth, which is then used to annotate 3D positions in the camera’s XYZ coordinate system.

Alternatively, for monocular cameras (a single lens), depth estimation relies on machine learning models trained to predict the distance based on object size, perspective, and texture in the image. For instance, if you take a photo of a car, the model identifies its position based on known shapes and dimensions, estimating how far it is.

To create 3D bounding boxes, we mark the corners of the object in the image, calculate the dimensions (length, width, height) in meters, and align them with the 3D coordinate system of the camera. This process often requires camera calibration to map pixel coordinates to real-world measurements accurately.

In summary, stereo vision mimics human eyes, while monocular depth estimation relies on visual clues to annotate 3D boxes. Both methods make it possible to measure object dimensions and positions without LiDAR.

#Robotics #3DVision #DepthEstimation #StereoCamera #ComputerVision #BoundingBoxes #CameraCalibration

Badges
Science topic

More Tong Guo's questions See All

"A Markov-like Model for Patient Progression"?

A Markov-like Model for Patient Progression" Markov Chain Monte Carlo (MCMC) Markov Chain Monte Carlo (MCMC) is a powerful computational technique used to draw samples from a probability...

05 August 2024 10,079 0 View

La animación digital en plataformas digitales?

Hoy la animación se utiliza como una tecnología multimedia con gran potencial educativo, que va mucho más allá de sólo crear figuras, ya que puede promover una mejor comprensión en...

01 August 2024 7,186 0 View

GSH estimation assay: What is the right choice of standard?

Hi there, My question is: What standard curves should be used while estimating Tot GSH and GSSG by kinetic method using GR enzyme mediated recyling with DTNB chromophore? Actually I am following...

01 August 2024 8,217 1 View

How to do pca analysis of c-alpha atom of the protein?

i m interested in pca analysis of c-alpha atoms in gromacs for that i used the following gmx_mpi covar -s mdca.tpr -f mdca.xtc -o eigenvalca.xvg -v eigenvecca.trr -av average.pdb -n index.ndx but...

30 July 2024 1,607 1 View

What exactly is RAG-LLM doing? Isn’t it data engineering?

What exactly is Retrieval Augmented Generation for Large Language Model doing? Isn’t it data engineering?

30 July 2024 7,376 3 View

After a lot of feature engineering for CTR modeling, it feels like it's basically the end of iteration? I mean, it's not cost-effective to keep doing?

After a lot of feature engineering for click-through rate modeling, it feels like it's basically the end of iteration? I mean, it's not cost-effective to keep doing it?

29 July 2024 4,955 0 View

How to estimate sample size for GWAS of continuous and discrete traits? What are the pre-requisites?

Genome-wide association study (GWAS) Continuous traits: eg. Height Discrete traits: eg. Eye color

28 July 2024 286 0 View

All math can be explained by iterator of code?

all math can be traversed by code? all math can be translate to code?

26 July 2024 9,530 0 View

HEC 1A & HEC1B Cell Lines?

Hi, Kindly guide me that how many cells of HEC1A & HEC1B Cell lines should I seed for Wound healing assay and which plate type is recommended 6, 12 & 24?. Articles suggested mainly 24...

20 July 2024 4,143 2 View

Why electrical charge on the moving plate increase?

Hi, everyone This figure depicts a simulation of an electrostatic energy harvesting system in COMSOL Multiphysics software. My question is regarding the relationship between the changes in...

19 July 2024 4,694 4 View

Can you suggest reliable sources defining "3D mesh" and "3D city models"?

Dear fellow researchers, I am currently working on a paper where I need to provide a reliable reference that defines and distinguishes between 3D mesh models and 3D city models. Although I am...

06 August 2024 9,986 2 View

What are the shear and normal stiffness values of an LLDPE liner in 3D numerical modeling of a stockpile?

I am seeking experimental or applicable data for the liner (LLDPE) interface in FLAC3D numerical modeling of a large stockpile. Could you please recommend suitable references? The preferred data...

05 August 2024 3,665 0 View

Difficulty with permittivitt and Magnetic Permeability Calculations?

Difficulty with permittivitt and Magnetic Permeability Calculations Hello everyone, I have all the parameters related to the calculations of the permittivitty and magnetic permeability...

30 July 2024 5,206 1 View

CAD File of human's & rat's respiratory airways ?

Dear all, I am working on particle deposition in human's & rat's respiratory airways using CFD and I am looking for the 3D CAD file for my simulations (STEP or IGES format). If somone has such...

29 July 2024 1,092 2 View

How to use Desmond in HPC ?

Our department has recently acquired an HPC (High-Performance Computing) system, and I'm thrilled to take my molecular dynamics calculations to the next level using Desmond. I used to run my...

28 July 2024 6,553 1 View

All math can be explained by iterator of code?

all math can be traversed by code? all math can be translate to code?

26 July 2024 9,530 0 View

What is human-computer interaction (HCI)?

22 July 2024 10,056 2 View

Cell optimization in 1D Nanotubes?

Dear All, I use Quantum Espresso. I want to optimize the structure of the carbon nanotube using the vc-relax. I got confused about how to fix cell_dofree here. For 3D, cell_dofree = 'all' or...

22 July 2024 5,708 0 View

How to evaluate teachers' professional vision?

science teachers' professional vision:framewor,evaluation,review of related research,researchers related,research method,why professional vision

20 July 2024 6,577 1 View

Which are the Scopus Indexed Journals in Computer Science with short review time?

Hello, I am looking out for Scopus Indexed Journals in Computer Science with short review time and short time to publish after acceptance (with / without APC). Please mention the journals that you...

19 July 2024 4,250 2 View