Tampere University of Technology

TUTCRIS Research Portal

Video coding of dynamic 3D point cloud data

Research output: Contribution to journalArticleScientificpeer-review

Standard

Video coding of dynamic 3D point cloud data. / Schwarz, Sebastian; Sheikhipour, Nahid; Fakour Sevom, Vida; Hannuksela, Miska M.

In: APSIPA Transactions on Signal and Information Processing, 2019.

Research output: Contribution to journalArticleScientificpeer-review

Harvard

Schwarz, S, Sheikhipour, N, Fakour Sevom, V & Hannuksela, MM 2019, 'Video coding of dynamic 3D point cloud data', APSIPA Transactions on Signal and Information Processing. https://doi.org/10.1017/ATSIP.2019.24

APA

Schwarz, S., Sheikhipour, N., Fakour Sevom, V., & Hannuksela, M. M. (2019). Video coding of dynamic 3D point cloud data. APSIPA Transactions on Signal and Information Processing, [e31]. https://doi.org/10.1017/ATSIP.2019.24

Vancouver

Schwarz S, Sheikhipour N, Fakour Sevom V, Hannuksela MM. Video coding of dynamic 3D point cloud data. APSIPA Transactions on Signal and Information Processing. 2019. e31. https://doi.org/10.1017/ATSIP.2019.24

Author

Schwarz, Sebastian ; Sheikhipour, Nahid ; Fakour Sevom, Vida ; Hannuksela, Miska M. / Video coding of dynamic 3D point cloud data. In: APSIPA Transactions on Signal and Information Processing. 2019.

Bibtex - Download

@article{6a6529110c7e4345a0184202956a6bcd,
title = "Video coding of dynamic 3D point cloud data",
abstract = "Due to the increased popularity of augmented (AR) and virtual (VR) reality experiences, the interest in representing the real world in an immersive fashion has never been higher. Distributing such representations enables users all over the world to freely navigate in never seen before media experiences. Unfortunately, such representations require a large amount of data, not feasible for transmission on today's networks. Thus, efficient compression technologies are in high demand. This paper proposes an approach to compress 3D video data utilizing 2D video coding technology. The proposed solution was developed to address the needs of {"}tele-immersive{"} applications, such as VR, AR, or mixed reality with {"}Six Degrees of Freedom{"} capabilities. Volumetric video data is projected on 2D image planes and compressed using standard 2D video coding solutions. A key benefit of this approach is its compatibility with readily available 2D video coding infrastructure. Furthermore, objective and subjective evaluation shows significant improvement in coding efficiency over reference technology. The proposed solution was contributed and evaluated in international standardization. Although it is was not selected as the winning proposal, as very similar solution has been selected developed since then.",
keywords = "AR, Immersive media, Point cloud coding, Volumetric video",
author = "Sebastian Schwarz and Nahid Sheikhipour and {Fakour Sevom}, Vida and Hannuksela, {Miska M.}",
year = "2019",
doi = "10.1017/ATSIP.2019.24",
language = "English",
journal = "APSIPA Transactions on Signal and Information Processing",
issn = "2048-7703",
publisher = "Cambridge University Press",

}

RIS (suitable for import to EndNote) - Download

TY - JOUR

T1 - Video coding of dynamic 3D point cloud data

AU - Schwarz, Sebastian

AU - Sheikhipour, Nahid

AU - Fakour Sevom, Vida

AU - Hannuksela, Miska M.

PY - 2019

Y1 - 2019

N2 - Due to the increased popularity of augmented (AR) and virtual (VR) reality experiences, the interest in representing the real world in an immersive fashion has never been higher. Distributing such representations enables users all over the world to freely navigate in never seen before media experiences. Unfortunately, such representations require a large amount of data, not feasible for transmission on today's networks. Thus, efficient compression technologies are in high demand. This paper proposes an approach to compress 3D video data utilizing 2D video coding technology. The proposed solution was developed to address the needs of "tele-immersive" applications, such as VR, AR, or mixed reality with "Six Degrees of Freedom" capabilities. Volumetric video data is projected on 2D image planes and compressed using standard 2D video coding solutions. A key benefit of this approach is its compatibility with readily available 2D video coding infrastructure. Furthermore, objective and subjective evaluation shows significant improvement in coding efficiency over reference technology. The proposed solution was contributed and evaluated in international standardization. Although it is was not selected as the winning proposal, as very similar solution has been selected developed since then.

AB - Due to the increased popularity of augmented (AR) and virtual (VR) reality experiences, the interest in representing the real world in an immersive fashion has never been higher. Distributing such representations enables users all over the world to freely navigate in never seen before media experiences. Unfortunately, such representations require a large amount of data, not feasible for transmission on today's networks. Thus, efficient compression technologies are in high demand. This paper proposes an approach to compress 3D video data utilizing 2D video coding technology. The proposed solution was developed to address the needs of "tele-immersive" applications, such as VR, AR, or mixed reality with "Six Degrees of Freedom" capabilities. Volumetric video data is projected on 2D image planes and compressed using standard 2D video coding solutions. A key benefit of this approach is its compatibility with readily available 2D video coding infrastructure. Furthermore, objective and subjective evaluation shows significant improvement in coding efficiency over reference technology. The proposed solution was contributed and evaluated in international standardization. Although it is was not selected as the winning proposal, as very similar solution has been selected developed since then.

KW - AR

KW - Immersive media

KW - Point cloud coding

KW - Volumetric video

U2 - 10.1017/ATSIP.2019.24

DO - 10.1017/ATSIP.2019.24

M3 - Article

JO - APSIPA Transactions on Signal and Information Processing

JF - APSIPA Transactions on Signal and Information Processing

SN - 2048-7703

M1 - e31

ER -