Hostname: page-component-89b8bd64d-b5k59 Total loading time: 0 Render date: 2026-05-12T11:04:47.957Z Has data issue: false hasContentIssue false

Seam carving modeling for semantic video coding in security applications

Published online by Cambridge University Press:  06 August 2015

Marc Décombas*
Affiliation:
Institut Mines-Télécom, Télécom ParisTech, CNRS LTCI, Paris, France Laboratory MultiMedia – Thales Communications & Security, Gennevilliers, France
Younous Fellah
Affiliation:
Institut Mines-Télécom, Télécom ParisTech, CNRS LTCI, Paris, France
Fréderic Dufaux
Affiliation:
Institut Mines-Télécom, Télécom ParisTech, CNRS LTCI, Paris, France
Beatrice Pesquet-popescu
Affiliation:
Institut Mines-Télécom, Télécom ParisTech, CNRS LTCI, Paris, France
Francois Capman
Affiliation:
Laboratory MultiMedia – Thales Communications & Security, Gennevilliers, France
Erwann Renan
Affiliation:
Laboratory MultiMedia – Thales Communications & Security, Gennevilliers, France
*
Corresponding author: M. Décombas Email: marc.decombas@gmail.com

Abstract

In some security applications, it is important to transmit just enough information to take the right decisions. Traditional video codecs try to maximize the global quality, irrespective of the video content pertinence for certain tasks. To better maintain the semantics of the scene, some approaches allocate more bitrate to the salient information. In this paper, a semantic video compression scheme based on seam carving is proposed. The idea is to suppress non-salient parts of the video by seam carving. The reduced sequence is encoded with H.264/AVC while the seams are encoded with our approach. The main contributions of this paper are (1) an algorithm that segments the sequence into group of pictures, depending on the content, (2) a spatio-temporal seam clustering method, (3) an isolated seam discarding technique, improving the seam encoding, (4) a new seam modeling, avoiding geometric distortion and resulting in a better control of the seam shapes, and (5) a new encoder which reduces the overall bit-rate. A full reference object-oriented quality metric is used to assess the performance of the approach. Our approach outperforms traditional H.264/AVC intra encoding with a Bjontegaard's rate improvement between 7.02 and 21.77% while maintaining the quality of the salient objects.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Authors, 2015
Figure 0

Fig. 1. Architecture of the proposed semantic video coding using seam carving.

Figure 1

Fig. 2. Overall scheme of the proposed seam carving process (corresponding to the module “seam carving” in Fig. 1).

Figure 2

Fig. 3. Seam computation process to obtain from a frame a list of seams (corresponding to the module “seam computation” in Fig. 2).

Figure 3

Fig. 4. Group of picture (GOP) definition based on the content. Horizontal continuous lines represent the number of seams. Vertical dotted lines represent the GOP segmentation. Blue, resp. red, indicates horizontal, resp. vertical, seams.

Figure 4

Fig. 5. Spatio-temporal clustering process to obtain a list of spatio-temporal clusters of seams from a list of seams by GOP (corresponding to the module “spatio-temporal seam clustering” in Fig. 2).

Figure 5

Fig. 6. Seams reordering: on the left, seams before reordering; on the right, seams after reordering.

Figure 6

Fig. 7. Illustration of the spatio-temporal seam clustering.

Figure 7

Fig. 8. Illustration of seams in two consecutive frames of the Coastguard sequence. Arrows show examples of group of seams.

Figure 8

Fig. 9. Example of isolated seams for a frame of the Ducks sequence (left) and the Parkrun sequence (right). Isolated seams are shown by the red arrows.

Figure 9

Fig. 10. Illustration of two groups of seams with in red the border seams and in blue the inner seams within the group of seams.

Figure 10

Fig. 11. Group of seams modeling (corresponding to the module “group of seams modeling” in Fig. 2).

Figure 11

Fig. 12. Illustration of the original seams (left) and the modeled seams (right) for the Parkjoy sequence.

Figure 12

Fig. 13. Predictive model for the group of border seams and encoding (corresponding to the module “group of seams modeling” in Fig. 1).

Figure 13

Fig. 14. Evaluation protocol. Comparison of the decoded video with seam carving approach with the original video. SSIM_MASK is computed on the ground-truth binary mask.

Figure 14

Table 1. Value and influence of various parameters.

Figure 15

Fig. 15. Approximation of the number of suppressed seams for Parkjoy: evolution of the number of vertical seams (top) and horizontal seams (bottom).

Figure 16

Table 2. Comparison of the ratio of spatial resizing between different approaches.

Figure 17

Fig. 16. Visual seam modeling for Coastguard/Ducks/ParkJoy/ParkRun sequences. First column: sample original frame; second column: salient objects [39]; third column: initial seams; and fourth column: seams after the proposed modeling.

Figure 18

Fig. 17. Visual seam modeling for Coastguard at different times t = 27, 28, 29, 30. On first line, initial seams, on second line, the seams after the proposed modeling.

Figure 19

Table 3. Percentage of bitrate saved compared with H.264/AVC as a function of quantization parameter intra (QPI) (positive value means bitrate is decreased, negative value means bitrate is increased): (1) reduced video after the seam carving without seam coding, (2) reduced video after the seam carving with seam position encoded without modeling, (3) reduced video after the seam carving modeled without seam coding, and (4) our proposed approach with reduced video after the seam carving modeled and the seams modeled and encoded.

Figure 20

Table 4. Percentage of bitrate saving compared with H.264/AVC as a function of QP (positive value means bitrate is decreased, negative value means bitrate is increased): (1) reduced video after the seam carving without seam coding, (2) reduced video after the seam carving with seam position encoded without modeling, (3) reduced video after the seam carving modeled without seam coding, and (4) our proposed approach with reduced video after the seam carving modeled and the seams modeled and encoded.

Figure 21

Fig. 18. Illustration of the different processes on Coastguard sequence: (a) original image, (b) binary mask of reference, (c) saliency model, (d) image with proposed seam carving and seams in green, and (e) image reduced with proposed seam carving.

Figure 22

Fig. 19. Rate-distortion performance in full intra coding for Coastguard, Ducks, Parkrun, and Parkjoy, using the SSIM_MASK. In green, H.264/AVC and in red the proposed approach.

Figure 23

Table 5. Bjontegaard's scores (percentage and Delta SSIM_MASK) in full intra coding for Coastguard, Ducks, Parkrun, and Parkjoy, using the SSIM_MASK.

Figure 24

Fig. 20. Rate-distortion performance. Visual results for Parkrun in intra coding with a SSIM_MASK  =  0.83 and a bitrate saving of 23%.

Figure 25

Fig. 21. Rate-distortion performance. Visual results for Coastguard in intra coding with a bitrate of 488 Kbits/s.

Figure 26

Fig. 22. Rate-distortion performance in inter coding for Coastguard, Ducks, Parkrun, and Parkjoy, using the SSIM_MASK. In green, H.264/AVC and in red the proposed approach.

Figure 27

Table 6. Bjontegaard's scores in inter coding for Coastguard, Ducks, Parkrun, and Parkjoy, using the SSIM_MASK.