Color/Render a 3D Point Cloud in Python 🎨 1. Point Clouds and Spherical Images 2. From (x,y,z) to (φ,θ,r) 3. Colorize a Point Cloud From a Spherical Image 4. Render a Spherical Image From a Coloured Point Cloud Conclusion

Artificial Intelligence

Color/Render a 3D Point Cloud in Python 🎨 1. Point Clouds and Spherical Images 2. From (x,y,z) to (φ,θ,r) 3. Colorize a Point Cloud From a Spherical Image 4. Render a Spherical Image From a Coloured Point Cloud Conclusion

admin

May 26, 2023

Color/Render a 3D Point Cloud in Python 🎨
1. Point Clouds and Spherical Images
2. From (x,y,z) to (φ,θ,r)
3. Colorize a Point Cloud From a Spherical Image
4. Render a Spherical Image From a Coloured Point Cloud
Conclusion

Let’s use the powerful vectorization capabilities of NumPy to change between 2D spherical images and 3D point clouds

Introduction

By harnessing the duality between 3D point clouds and 2D spherical coloured images, we will synergize robust 3D reconstruction algorithms with powerful 2D object detection techniques.

To concurrently capture spherical coloured images and point clouds, specialized devices akin to depth sensors or LiDAR scanners are commonly equipped with spherical cameras.

Let’s establish a precise definition of the objects we’re talking about:

it provides a direct representation of the spatial layout of the scene
each point is defined by its 3D spatial coordinates (x, y, z)
additional attributes akin to color, intensity, or other properties might be related to each point

Coloured point cloud of a room —Image by the creator

It provides a panoramic view of the scene, capturing a large field of view in all directions around a central viewpoint. As an example, it may very well be provided by two back-to-back fisheye cameras (See my previous Medium article Understanding 360 images 🌎)
Each pixel is defined by its 2D coordinates (latitude φ, longitude θ) and refers to a particular viewing direction, like on the earth’s surface.

Coloured 360 image of a Room — Image by the creator

Hidden potential of 2D algorithms in 3D realm

Object detection algorithms are frequently more robust in 2D than in 3D. Hence, the flexibility to transition to a 2D representation for efficient detection and segmentation and return to the 3D domain with the identified objects holds tremendous promise and excitement.

With this in mind, Florent Poux, Ph.D., has recently showcased a sensible maneuver: using Meta’s Segment Anything Model (SAM) for 3D point cloud segmentation.

Within the upcoming sections, we are going to define essential functions to facilitate the bidirectional transformation of knowledge between the 2D and 3D domains.

Spherical coordinates

This section will convert the worldwide cartesian coordinates (x,y,z) to local spherical coordinates (φ,θ,r).

Angles φ and θ correspond, respectively, to the rows and columns of the 2D spherical image, while the radius r refers back to the distance from which a degree is observed, i.e., its depth.

: For the sake of simplicity, we are going to consider that the spherical coordinates are situated on the origin of the reference frame. Nonetheless, a translation/rotation might be applied to the purpose cloud before converting the coordinates to render it from any desired pose (See my previous article Converting camera poses from OpenCV to OpenGL might be easy).

I strongly advise you to have a look at my Understanding 360 images 🌎 article, which explains the spherical coordinate system. The image and equation below comprehensively summarize all of the essential elements required.

Spherical coordinates with Theta across the Y axis and Phi across the X axis — Figure by the creator from a previous article

Conversion between cartesian and spherical coordinates

Python code

To start with, we use the above equation to convert from cartesian to spherical coordinates. The input, xyz, is a floating-point numpy array of shape (N,3) containing the purpose cloud’s vertices.

Example

Python code

Example

1 COMMENT

LEAVE A REPLY Cancel reply