Table of contents for issues of ACM Transactions on Multimedia Computing, Communications, and Applications

Last update: Wed Apr 22 07:41:46 MDT 2026

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 1, Number 1, February, 2005

           Nicolas D. Georganas   Editorial: The birth of the ACM
                                  Transactions on Multimedia Computing,
                                  Communications and Applications
                                  (TOMCCAP)  . . . . . . . . . . . . . . . 1--2
           Lawrence A. Rowe and   
                    Ramesh Jain   ACM SIGMM Retreat report on future
                                  directions in multimedia research  . . . 3--13
                Ramesh Jain and   
           Thomas Plagemann and   
                 Ralf Steinmetz   Guest editorial: The International ACM
                                  Multimedia Conference 1993 --- ten years
                                  after  . . . . . . . . . . . . . . . . . 14--15
             Laura Teodosio and   
                  Walter Bender   Salient stills . . . . . . . . . . . . . 16--36
             A. L. N. Reddy and   
                 Jim Wyllie and   
           K. B. R. Wijayaratne   Disk scheduling in a multimedia I/O
                                  system . . . . . . . . . . . . . . . . . 37--59
        M. Cecelia Buchanan and   
             Polle T. Zellweger   Automatic temporal layout mechanisms
                                  revisited  . . . . . . . . . . . . . . . 60--88
       Dick C. A. Bulterman and   
                  Lynda Hardman   Structured multimedia authoring  . . . . 89--109
          Ketan Mayer-Patel and   
             Brian C. Smith and   
               Lawrence A. Rowe   The Berkeley software MPEG-1 video
                                  decoder  . . . . . . . . . . . . . . . . 110--125

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 1, Number 2, May, 2005

           Thomas Plagemann and   
            Prashant Shenoy and   
                  John R. Smith   Selected papers from the ACM Multimedia
                                  Conference 2003  . . . . . . . . . . . . 127--127
               Sang-Uok Kum and   
              Ketan Mayer-Patel   Real-time multidepth stream compression  128--150
                Wu-Chi Feng and   
                  Ed Kaiser and   
              Wu Chang Feng and   
              Mikael Le Baillif   Panoptes: scalable low-power video
                                  sensor networking technologies . . . . . 151--167
                Kingshy Goh and   
                  Beitao Li and   
                Edward Y. Chang   Semantics and feature discovery via
                                  confidence-based ensemble  . . . . . . . 168--189
            H. Harlyn Baker and   
                Nina Bhatti and   
             Donald Tanguay and   
                Irwin Sobel and   
                   Dan Gelb and   
            Michael E. Goss and   
        W. Bruce Culbertson and   
              Thomas Malzbender   Understanding performance in Coliseum,
                                  an immersive videoconferencing system    190--210

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 1, Number 3, August, 2005

                Brett Adams and   
           Svetha Venkatesh and   
                    Ramesh Jain   IMCE: Integrated media creation
                                  environment  . . . . . . . . . . . . . . 211--247
      Christian Poellabauer and   
                 Karsten Schwan   Flexible cross-domain event delivery for
                                  quality-managed multimedia applications  248--268
             Matthew Cooper and   
             Jonathan Foote and   
         Andreas Girgensohn and   
                    Lynn Wilcox   Temporal event clustering for digital
                                  photo collections  . . . . . . . . . . . 269--288
                   Keqiu Li and   
                      Hong Shen   Coordinated enroute multimedia object
                                  caching in transcoding proxies for tree
                                  networks . . . . . . . . . . . . . . . . 289--314

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 1, Number 4, November, 2005

                  Huahui Wu and   
              Mark Claypool and   
                 Robert Kinicki   Adjusting forward error correction with
                                  temporal scaling for TCP-friendly
                                  streaming MPEG . . . . . . . . . . . . . 315--337
                Jianfei Cai and   
                Xiangjun Li and   
                 Chang Wen Chen   Layered unequal loss protection with
                                  pre-interleaving for fast progressive
                                  image transmission over packet-loss
                                  channels . . . . . . . . . . . . . . . . 338--353
                Yi-Cheng Tu and   
              Jianzhong Sun and   
            Mohamed Hefeeda and   
                Sunil Prabhakar   An analytical study of peer-to-peer
                                  media streaming systems  . . . . . . . . 354--376

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 2, Number 1, February, 2006

             Michael S. Lew and   
                  Nicu Sebe and   
            Chabane Djeraba and   
                    Ramesh Jain   Content-based multimedia information
                                  retrieval: State of the art and
                                  challenges . . . . . . . . . . . . . . . 1--19
          Alberto Del Bimbo and   
                    Pietro Pala   Content-based retrieval of $3$D models   20--43
                  Huaxin Xu and   
                  Tat-Seng Chua   Fusion of AV features and external
                                  information sources for event detection
                                  in team sports video . . . . . . . . . . 44--67
               Dhiraj Joshi and   
              James Z. Wang and   
                         Jia Li   The Story Picturing Engine---a system
                                  for automatic text illustration  . . . . 68--89

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 2, Number 2, May, 2006

           Cees G. M. Snoek and   
             Marcel Worring and   
         Alexander G. Hauptmann   Learning rich semantics from news video
                                  archives by style analysis . . . . . . . 91--108
                 Guang Yang and   
                   Tony Sun and   
                Mario Gerla and   
             M. Y. Sanadidi and   
                  Ling-Jyh Chen   Smooth and efficient real-time video
                                  transport in the presence of wireless
                                  errors . . . . . . . . . . . . . . . . . 109--126
                    Xi Shao and   
              Changsheng Xu and   
          Namunu C. Maddage and   
                    Qi Tian and   
       Mohan S. Kankanhalli and   
                   Jesse S. Jin   Automatic summarization of music videos  127--148
        Viktor S. Wold Eide and   
     Ole-Christoffer Granmo and   
             Frank Eliassen and   
Jòrgen Andreas Michaelsen   Real-time video content analysis:
                                  QoS-aware application composition and
                                  parallel processing  . . . . . . . . . . 149--172

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 2, Number 3, August, 2006

    K. Selçuk Candan and   
          Augusto Celentano and   
                  Wolfgang Klas   Introduction to special issue on the use
                                  of context in multimedia information
                                  systems  . . . . . . . . . . . . . . . . 173--176
              Alfio Ferrara and   
           Luca A. Ludovico and   
         Stefano Montanelli and   
            Silvana Castano and   
                  Goffredo Haus   A Semantic Web ontology for
                                  context-based classification and
                                  retrieval of music resources . . . . . . 177--198
         Anne-Muriel Arigon and   
           Anne Tchounikine and   
               Maryvonne Miquel   Handling multiple points of view in a
                                  multimedia data warehouse  . . . . . . . 199--218
                Kanav Kahol and   
        Priyamvada Tripathi and   
              Troy Mcdaniel and   
              Laura Bratton and   
        Sethuraman Panchanathan   Modeling context in haptic perception,
                                  rendering, and visualization . . . . . . 219--240

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 2, Number 4, November, 2006

        Stephen R. Gulliver and   
              Gheorghita Ghinea   Defining user perception of distributed
                                  multimedia quality . . . . . . . . . . . 241--257
             Kartik Gopalan and   
                  Lan Huang and   
                  Gang Peng and   
            Tzi-Cker Chiueh and   
                   Yow-Jian Lin   Statistical admission control using
                                  delay distribution measurements  . . . . 258--281
                      H. Li and   
                      M. Li and   
                 B. Prabhakaran   Middleware for streaming $3$D
                                  progressive meshes over lossy networks   282--317
                Yoav Etsion and   
                Dan Tsafrir and   
              Dror G. Feitelson   Process prioritization using output
                                  production: Scheduling for multimedia    318--342
                Pablo Cesar and   
             Petri Vuorimaa and   
                  Juha Vierinen   A graphics architecture for high-end
                                  interactive television terminals . . . . 343--357
   Chitra L. Madhwacharyula and   
                 Marc Davis and   
            Philippe Mulhem and   
           Mohan S. Kankanhalli   Metadata handling: a video perspective   358--388

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 3, Number 1, February, 2007

           Pradeep K. Atrey and   
       Mohan S. Kankanhalli and   
                 John B. Oommen   Goal-oriented optimal subset selection
                                  of correlated multimedia streams . . . . ??
                Datong Chen and   
                   Jie Yang and   
              Robert Malkin and   
              Howard D. Wactlar   Detecting social interactions of the
                                  elderly in a nursing home environment    ??
                Rachel Heck and   
            Michael Wallick and   
               Michael Gleicher   Virtual videography  . . . . . . . . . . ??
               Ba Tu Truong and   
               Svetha Venkatesh   Video abstraction: a systematic review
                                  and classification . . . . . . . . . . . ??
              Changsheng Xu and   
          Namunu C. Maddage and   
                    Xi Shao and   
                        Qi Tian   Content-adaptive digital music
                                  watermarking based on music structure
                                  analysis . . . . . . . . . . . . . . . . ??
                 Wei-Qi Yan and   
           Mohan S. Kankanhalli   Multimedia simplification for optimized
                                  MMS synthesis  . . . . . . . . . . . . . ??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 3, Number 2, May, 2007

               Tiecheng Liu and   
                 John R. Kender   Computational approaches to temporal
                                  sampling of video sequences  . . . . . . 7:1--7:??
            Simon Moncrieff and   
           Svetha Venkatesh and   
                     Geoff West   Online audio background determination
                                  for complex audio environments . . . . . 8:1--8:??
               Chika Oshima and   
          Kazushi Nishimoto and   
                Norihiro Hagita   A piano duo support system for parents
                                  to lead children to practice musical
                                  performances . . . . . . . . . . . . . . 9:1--9:??
                 Xiaofei He and   
                   Deng Cai and   
                Ji-Rong Wen and   
                Wei-Ying Ma and   
               Hong-Jiang Zhang   Clustering and searching WWW images
                                  using link and page layout analysis  . . 10:1--10:??
              Byunghee Jung and   
               Junehwa Song and   
                   Yoonjoon Lee   A narrative-based abstraction framework
                                  for story-oriented video . . . . . . . . 11:1--11:??
                Ron Shacham and   
        Henning Schulzrinne and   
         Srisakul Thakolsri and   
              Wolfgang Kellerer   Ubiquitous device personalization and
                                  use: The next generation of IP
                                  multimedia communications  . . . . . . . 12:1--12:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 3, Number 3, August, 2007

             Herng-Yow Chen and   
                   Sheng-Wei Li   Exploring many-to-one speech-to-text
                                  correlation for Web-based language
                                  learning . . . . . . . . . . . . . . . . 13:1--13:??
                Surong Wang and   
            Manoranjan Dash and   
            Liang-Tien Chia and   
                         Min Xu   Efficient sampling of training set in
                                  large and noisy multimedia data  . . . . 14:1--14:??
               Suiping Zhou and   
                Wentong Cai and   
          Stephen J. Turner and   
                Bu-Sung Lee and   
                      Junhu Wei   Critical causal order of events in
                                  distributed virtual environments . . . . 15:1--15:??
                Chuanjun Li and   
                S. Q. Zheng and   
                 B. Prabhakaran   Segmentation and recognition of motion
                                  streams by similarity search . . . . . . 16:1--16:??
               David E. Ott and   
              Ketan Mayer-Patel   An open architecture for transport-level
                                  protocol coordination in distributed
                                  multimedia applications  . . . . . . . . 17:1--17:??
                  Ziad Sakr and   
           Nicolas D. Georganas   Robust content-based MPEG-4 XMT scene
                                  structure authentication and multimedia
                                  content location . . . . . . . . . . . . 18:1--18:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 3, Number 4, December, 2007

          Gheorghita Ghinea and   
            Chabane Djeraba and   
           Stephen Gulliver and   
             Kara Pernice Coyne   Introduction to special issue on
                                  eye-tracking applications in multimedia
                                  systems  . . . . . . . . . . . . . . . . 1:1--1:4
              Carlo Colombo and   
           Dario Comanducci and   
              Alberto Del Bimbo   Robust tracking and remapping of eye
                                  appearance with passive computer vision  2:1--2:20
                   Jun Wang and   
                  Lijun Yin and   
                    Jason Moore   Using geometric properties of
                                  topographic manifold to detect and track
                                  eyes for human-computer interaction  . . 3:1--3:20
              D. Agrafiotis and   
            S. J. C. Davies and   
             N. Canagarajah and   
                     D. R. Bull   Towards efficient context-specific video
                                  coding based on gaze-tracking analysis   4:1--4:15
             Thierry Urruty and   
              Stanislas Lew and   
           Nacim Ihadaddene and   
                Dan A. Simovici   Detecting eye fixations by projection
                                  clustering . . . . . . . . . . . . . . . 5:1--5:20
        Andrew T. Duchowski and   
      Arzu Çöltekin   Foveated gaze-contingent displays for
                                  peripheral LOD management, $3$D
                                  visualization, and stereo imaging  . . . 6:1--6:18
          Lester C. Loschky and   
              Gary S. Wolverton   How late can you update gaze-contingent
                                  multiresolutional displays without
                                  detection? . . . . . . . . . . . . . . . 7:1--7:10
              Norman Murray and   
               Dave Roberts and   
              Anthony Steed and   
               Paul Sharkey and   
             Paul Dickerson and   
                       John Rae   An assessment of eye-gaze potential
                                  within immersive virtual environments    8:1--8:17
         Dorothy Rachovides and   
           James Walkerdine and   
                 Peter Phillips   The conductor interaction method . . . . 9:1--9:23

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 4, Number 1, January, 2008

                Hangzai Luo and   
                   Yuli Gao and   
              Xiangyang Xue and   
                 Jinye Peng and   
                   Jianping Fan   Incorporating feature hierarchy and
                                  boosting to achieve more effective
                                  classifier training and concept-oriented
                                  video summarization and skimming . . . . 1:1--1:??
            Mohamed Hefeeda and   
                 Cheng-Hsin Hsu   Rate-distortion optimized streaming of
                                  fine-grained scalable video sequences    2:1--2:??
              Fulvio Babich and   
            Marco D'orlando and   
                Francesca Vatta   Video quality estimation in wireless IP
                                  networks: Algorithms and applications    3:1--3:??
           Phani S. Kotharu and   
                 B. Prabhakaran   Partial query resolution for animation
                                  authoring  . . . . . . . . . . . . . . . 4:1--4:??
              Alan T. S. Ip and   
             John C. S. Lui and   
                 Jiangchuan Liu   A revenue-rewarding scheme of providing
                                  incentive for cooperative proxy caching
                                  for media streaming systems  . . . . . . 5:1--5:??
                  Cha Zhang and   
                   Yong Rui and   
               Jim Crawford and   
                      Li-Wei He   An automated end-to-end lecture capture
                                  and broadcasting system  . . . . . . . . 6:1--6:??
        Giang Phuong Nguyen and   
                 Marcel Worring   Optimization of interactive
                                  visual-similarity-based search . . . . . 7:1--7:??
             Helmut Hlavacs and   
              Shelley Buchinger   Hierarchical video patching with optimal
                                  server bandwidth . . . . . . . . . . . . 8:1--8:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 4, Number 2, May, 2008

              Songqing Chen and   
               Shiping Chen and   
                Huiping Guo and   
                    Bo Shen and   
                 Sushil Jajodia   Achieving simultaneous distribution
                                  control and privacy protection for
                                  Internet media delivery  . . . . . . . . 9:1--9:??
                     Rui Li and   
                  Bir Bhanu and   
                     Anlei Dong   Feature synthesized EM algorithm for
                                  image retrieval  . . . . . . . . . . . . 10:1--10:??
                     Min Xu and   
              Changsheng Xu and   
                Lingyu Duan and   
               Jesse S. Jin and   
                     Suhuai Luo   Audio keywords generation for sports
                                  video analysis . . . . . . . . . . . . . 11:1--11:??
            Sunand Tullimas and   
               Thinh Nguyen and   
              Rich Edgecomb and   
               Sen-ching Cheung   Multimedia streaming using multiple TCP
                                  connections  . . . . . . . . . . . . . . 12:1--12:??
         Dian Tjondronegoro and   
        Yi-Ping Phoebe Chen and   
                    Adrien Joly   A scalable and extensible
                                  segment-event-object-based sports video
                                  retrieval system . . . . . . . . . . . . 13:1--13:??
           Roger Zimmermann and   
                Elaine Chew and   
           Sakire Arslan Ay and   
                    Moses Pawar   Distributed musical performances:
                                  Architecture and stream management . . . 14:1--14:??
             Cheng-Hsin Hsu and   
                Mohamed Hefeeda   On the accuracy and complexity of
                                  rate-distortion models for fine-grained
                                  scalable video sequences . . . . . . . . 15:1--15:??
                  Bing Wang and   
                 Jim Kurose and   
            Prashant Shenoy and   
                    Don Towsley   Multimedia streaming via TCP: an
                                  analytic performance study . . . . . . . 16:1--16:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 4, Number 3, August, 2008

               Tsungnan Lin and   
               Chiapin Wang and   
                  Po-Chiang Lin   A neural-network-based context-aware
                                  handoff algorithm for multimedia
                                  computing  . . . . . . . . . . . . . . . 17:1--17:??
           Ingmar S. Franke and   
         Sebastian Pannasch and   
            Jens R. Helmert and   
              Robert Rieger and   
                Rainer Groh and   
          Boris M. Velichkovsky   Towards attention-centered interfaces:
                                  an aesthetic evaluation of perspective
                                  with eye tracking  . . . . . . . . . . . 18:1--18:??
                   Chuan Wu and   
                 Baochun Li and   
                   Shuqiao Zhao   Exploring large-scale peer-to-peer live
                                  streaming topologies . . . . . . . . . . 19:1--19:??
                Ashvin Goel and   
             Charles Krasic and   
               Jonathan Walpole   Low-latency adaptive streaming over TCP  20:1--20:??
               Seung-Ho Lim and   
               Yo-Won Jeong and   
                    Kyu Ho Park   Data placement and prefetching with
                                  accurate bit rate control for
                                  interactive media server . . . . . . . . 21:1--21:??
                     Li Jie and   
                 James J. Clark   Video game design using an
                                  eye-movement-dependent model of visual
                                  attention  . . . . . . . . . . . . . . . 22:1--22:??
        Oleg V. Komogortsev and   
                  Javed I. Khan   Predictive real-time perceptual
                                  compression based on eye-gaze-position
                                  analysis . . . . . . . . . . . . . . . . 23:1--23:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 4, Number 4, October, 2008

                Pablo Cesar and   
       Dick C. A. Bulterman and   
     Luiz Fernando Gomes Soares   Introduction to special issue:
                                  Human-centered television --- directions
                                  in interactive digital television
                                  research . . . . . . . . . . . . . . . . 24:1--24:??
             Marian F. Ursu and   
             Maureen Thomas and   
                  Ian Kegel and   
              Doug Williams and   
               Mika Tuomola and   
            Inger Lindstedt and   
             Terence Wright and   
             Andra Leurdijk and   
            Vilmos Zsombori and   
              Julia Sussner and   
               Ulf Myrestam and   
                      Nina Hall   Interactive TV narratives:
                                  Opportunities, progress, and challenges  25:1--25:??
                  Bin Cheng and   
                  Lex Stein and   
                    Hai Jin and   
               Xiaofei Liao and   
                    Zheng Zhang   GridCast: Improving peer sharing for P2P
                                  VoD  . . . . . . . . . . . . . . . . . . 26:1--26:??
             Crysta Metcalf and   
              Gunnar Harboe and   
                 Joe Tullio and   
                Noel Massey and   
                 Guy Romano and   
            Elaine M. Huang and   
                  Frank Bentley   Examining presence and lightweight
                                  messaging in a social television
                                  experience . . . . . . . . . . . . . . . 27:1--27:??
          Renan G. Cattelan and   
             Cesar Teixeira and   
           Rudinei Goularte and   
Maria Da Graça C. Pimentel   Watch-and-comment as a paradigm toward
                                  ubiquitous interactive video editing . . 28:1--28:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 5, Number 1, October, 2008

            Brian P. Bailey and   
                  Nicu Sebe and   
                  Alan Hanjalic   Special section from the ACM Multimedia
                                  Conference 2007  . . . . . . . . . . . . 1:1--1:??
        Michael L. Gleicher and   
                       Feng Liu   Re-cinematography: Improving the
                                  camerawork of casual video . . . . . . . 2:1--2:??
                 Guo-Jun Qi and   
             Xian-Sheng Hua and   
                   Yong Rui and   
                Jinhui Tang and   
                    Tao Mei and   
                  Meng Wang and   
               Hong-Jiang Zhang   Correlative multilabel video annotation
                                  with temporal kernels  . . . . . . . . . 3:1--3:??
               Yinpeng Chen and   
                  Weiwei Xu and   
              Hari Sundaram and   
          Thanassis Rikakis and   
                  Sheng-Min Liu   A dynamic decision network framework for
                                  online media adaptation in stroke
                                  rehabilitation . . . . . . . . . . . . . 4:1--4:??
            Frederic Thouin and   
                    Mark Coates   Equipment allocation in video-on-demand
                                  network deployments  . . . . . . . . . . 5:1--5:??
              Prakash Kolan and   
                  Ram Dantu and   
        João W. Cangussu   Nuisance level of a voice call . . . . . 6:1--6:??
            Qing-Fang Zheng and   
                        Wen Gao   Constructing visual phrases for
                                  effective and efficient object-based
                                  image retrieval  . . . . . . . . . . . . 7:1--7:??
              Phillipa Gill and   
                   Liqi Shi and   
            Anirban Mahanti and   
                Zongpeng Li and   
                 Derek L. Eager   Scalable on-demand media streaming for
                                  heterogeneous clients  . . . . . . . . . 8:1--8:??
                Dawoon Jung and   
                Jaegeuk Kim and   
                Jin-Soo Kim and   
                    Joonwon Lee   ScaleFFS: a scalable log-structured
                                  flash file system for mobile multimedia
                                  systems  . . . . . . . . . . . . . . . . 9:1--9:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 5, Number 2, November, 2008

            Simon Moncrieff and   
           Svetha Venkatesh and   
                     Geoff West   Dynamic privacy assessment in a smart
                                  house environment using multimodal
                                  sensing  . . . . . . . . . . . . . . . . 10:1--10:??
                Brett Adams and   
                 Dinh Phung and   
               Svetha Venkatesh   Sensing and using social context . . . . 11:1--11:??
          Saraju P. Mohanty and   
             Bharat K. Bhargava   Invisible watermarking based on creation
                                  and robust insertion-extraction of image
                                  adaptive watermarks  . . . . . . . . . . 12:1--12:??
            Wai-Pun Ken Yiu and   
           Shueng-Han Gary Chan   Offering data confidentiality for
                                  multimedia overlay multicast: Design and
                                  analysis . . . . . . . . . . . . . . . . 13:1--13:??
            Minoru Nakayama and   
              Yosiyuki Takahasi   Estimation of certainty for responses to
                                  multiple-choice questionnaires using eye
                                  movements  . . . . . . . . . . . . . . . 14:1--14:??
              Frank Shipman and   
         Andreas Girgensohn and   
                    Lynn Wilcox   Authoring, viewing, and generating
                                  hypervideo: an overview of
                                  Hyper-Hitchcock  . . . . . . . . . . . . 15:1--15:??
                   Wenbo He and   
            Klara Nahrstedt and   
                        Xue Liu   End-to-end delay control of multimedia
                                  applications over multihop wireless
                                  links  . . . . . . . . . . . . . . . . . 16:1--16:??
                   Leon Pan and   
                 Chang N. Zhang   A criterion-based multilayer access
                                  control approach for multimedia
                                  applications and the implementation
                                  considerations . . . . . . . . . . . . . 17:1--17:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 5, Number 3, August, 2009

    K. Selçuk Candan and   
          Alberto Del Bimbo and   
            Carsten Griwodz and   
               Alejandro Jaimes   Introduction to the special section for
                                  the best papers of ACM Multimedia 2008   18:1--18:??
                Pablo Cesar and   
       Dick C. A. Bulterman and   
                Jack Jansen and   
               David Geerts and   
             Hendrik Knoche and   
                 William Seager   Fragment, tag, enrich, and send:
                                  Enhancing social sharing of video  . . . 19:1--19:??
                  H. Knoche and   
                    M. A. Sasse   The big picture on small screens
                                  delivering acceptable video quality in
                                  mobile TV  . . . . . . . . . . . . . . . 20:1--20:??
           Sebastien Mondet and   
                  Wei Cheng and   
            Geraldine Morin and   
           Romulus Grigoras and   
            Frederic Boudon and   
                  Wei Tsang Ooi   Compact and progressive plant models for
                                  streaming in networked virtual
                                  environments . . . . . . . . . . . . . . 21:1--21:??
                   Yong Wei and   
    Suchendra M. Bhandarkar and   
                        Kang Li   Client-centered multimedia content
                                  adaptation . . . . . . . . . . . . . . . 22:1--22:??
        G. S. V. S. Sivaram and   
       Mohan S. Kankanhalli and   
             K. R. Ramakrishnan   Design of multimedia surveillance
                                  systems  . . . . . . . . . . . . . . . . 23:1--23:??
                Xiaotao Liu and   
                Mark Corner and   
                Prashant Shenoy   SEVA: Sensor-enhanced video annotation   24:1--24:??
                  Bing Wang and   
                    Wei Wei and   
                  Zheng Guo and   
                    Don Towsley   Multipath live streaming via TCP:
                                  Scheme, performance and benefits . . . . 25:1--25:??
                 Mingzhe Li and   
              Mark Claypool and   
                 Robert Kinicki   Playout buffer and rate optimization for
                                  streaming over IEEE 802.11 wireless
                                  networks . . . . . . . . . . . . . . . . 26:1--26:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 5, Number 4, October, 2009

             Danielle Sauer and   
                  Yee-Hong Yang   Music-driven character animation . . . . 27:1--27:??
             Robert H. Deng and   
                  Yanjiang Yang   A study of content authentication in
                                  proxy-enabled multimedia delivery
                                  systems: Model, techniques, and
                                  applications . . . . . . . . . . . . . . 28:1--28:??
                Jongeun Cha and   
                Mohamad Eid and   
         Abdulmotaleb El Saddik   Touchable $3$D video system  . . . . . . 29:1--29:??
 Fabrício Benevenuto and   
            Tiago Rodrigues and   
           Virgilio Almeida and   
            Jussara Almeida and   
                     Keith Ross   Video interactions in online video
                                  social networks  . . . . . . . . . . . . 30:1--30:??
              Maike Erdmann and   
            Kotaro Nakayama and   
              Takahiro Hara and   
                 Shojiro Nishio   Improving the extraction of bilingual
                                  terminology from Wikipedia . . . . . . . 31:1--31:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 6, Number 1, February, 2010

            Niklas Carlsson and   
                 Derek L. Eager   Server selection in large-scale
                                  video-on-demand systems  . . . . . . . . 1:1--1:??
              Parag Agarwal and   
       Balakrishnan Prabhakaran   Blind robust watermarking of $3$D motion
                                  data . . . . . . . . . . . . . . . . . . 2:1--2:??
                        Bo Yang   DSI: a model for distributed multimedia
                                  semantic indexing and content
                                  integration  . . . . . . . . . . . . . . 3:1--3:??
        Marcus Nyström and   
              Kenneth Holmqvist   Effect of compressed offline foveated
                                  video on viewing behavior and subjective
                                  quality  . . . . . . . . . . . . . . . . 4:1--4:??
             Yuri V. Ivanov and   
                 C. J. Bleakley   Real-time H.264 video encoding in
                                  software with fast mode decision and
                                  dynamic complexity control . . . . . . . 5:1--5:??
            Mohamed Hefeeda and   
            Kianoosh Mokhtarian   Authentication schemes for multimedia
                                  streams: Quantitative analysis and
                                  comparison . . . . . . . . . . . . . . . 6:1--6:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 6, Number 2, March, 2010

                Zhenyu Yang and   
                  Wanmin Wu and   
            Klara Nahrstedt and   
           Gregorij Kurillo and   
                  Ruzena Bajcsy   Enabling multi-party $3$D tele-immersive
                                  environments with \em ViewCast . . . . . 7:1--7:??
                  Junwen Wu and   
               Mohan M. Trivedi   An eye localization, tracking and blink
                                  pattern recognition system: Algorithm
                                  and evaluation . . . . . . . . . . . . . 8:1--8:??
                   Xing Jin and   
                S.-H. Gary Chan   Detecting malicious nodes in
                                  peer-to-peer streaming by peer-based
                                  monitoring . . . . . . . . . . . . . . . 9:1--9:??
               Chih-Yi Chiu and   
              Hsin-Min Wang and   
                  Chu-Song Chen   Fast min-hashing indexing and robust
                                  spatio-temporal matching for detecting
                                  video copies . . . . . . . . . . . . . . 10:1--10:??
            Nabil J. Sarhan and   
       Mohammad A. Alsmirat and   
               Musab Al-Hadrusi   Waiting-time prediction in scalable
                                  on-demand video streaming  . . . . . . . 11:1--11:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 6, Number 3, August, 2010

              Changsheng Xu and   
         Eckehard Steinbach and   
     Abdulmotaleb El Saddik and   
                  Michelle Zhou   Introduction to the best papers of ACM
                                  Multimedia 2009  . . . . . . . . . . . . 12:1--12:??
              Zheng-Jun Zha and   
                Linjun Yang and   
                    Tao Mei and   
                  Meng Wang and   
                Zengfu Wang and   
              Tat-Seng Chua and   
                 Xian-Sheng Hua   Visual query suggestion: Towards
                                  capturing user intent in Internet image
                                  search . . . . . . . . . . . . . . . . . 13:1--13:??
                  Wei Jiang and   
           Courtenay Cotton and   
              Shih-Fu Chang and   
                  Dan Ellis and   
              Alexander C. Loui   Audio-visual atoms for generic video
                                  concept classification . . . . . . . . . 14:1--14:??
        Rodrigo De Oliveira and   
            Mauro Cherubini and   
                   Nuria Oliver   Looking at near-duplicate videos from a
                                  human-centric perspective  . . . . . . . 15:1--15:??
                    Hao Yin and   
                Xuening Liu and   
                Tongyu Zhan and   
                 Vyas Sekar and   
                   Feng Qiu and   
                 Chuang Lin and   
                  Hui Zhang and   
                          Bo Li   LiveSky: Enhancing CDN with P2P  . . . . 16:1--16:??
            Arthur G. Money and   
                    Harry Agius   ELVIS: Entertainment-Led VIdeo Summaries 17:1--17:??
           Steven C. h. Hoi and   
                    Wei Liu and   
                  Shih-Fu Chang   Semi-supervised distance metric learning
                                  for collaborative image retrieval and
                                  clustering . . . . . . . . . . . . . . . 18:1--18:??
          Namunu C. Maddage and   
               Khe Chai Sim and   
                     Haizhou Li   Word level automatic alignment of music
                                  and lyrics using vocal synthesis . . . . 19:1--19:??
               Bashar Qudah and   
                Nabil J. Sarhan   Efficient delivery of on-demand video
                                  streams to heterogeneous receivers . . . 20:1--20:??
    João V. P. Gomes and   
  Pedro R. M. Inácio and   
               Branka Lakic and   
     Mário M. Freire and   
    Henrique J. A. Da Silva and   
              Paulo P. Monteiro   Source traffic analysis  . . . . . . . . 21:1--21:??
               Susanne Boll and   
                  Jiebo Luo and   
                Ramesh Jain and   
                        Dong Xu   Call for papers: ACM Transactions on
                                  Multimedia Computing, Communications and
                                  Applications special issue on social
                                  media  . . . . . . . . . . . . . . . . . 22:1--22:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 6, Number 4, November, 2010

                 Ralf Steinmetz   Obituary to our dear friend Professor
                                  Dr. Nicolas D. Georganas, PhD  . . . . . 23:1--23:??
             Thomas Haenselmann   Foreword to the special issue on
                                  multimedia sensor fusion . . . . . . . . 24:1--24:??
               Xiangyu Wang and   
              Mohan Kankanhalli   MultiFusion: a boosting approach for
                                  multimedia fusion  . . . . . . . . . . . 25:1--25:??
              Girija Chetty and   
                  Matthew White   Multimedia sensor fusion for retrieving
                                  identity in biometric access control
                                  systems  . . . . . . . . . . . . . . . . 26:1--26:??
           Gerald Friedland and   
                Chuohao Yeo and   
                    Hayley Hung   Dialocalization: Acoustic speaker
                                  diarization and visual localization as
                                  joint optimization problem . . . . . . . 27:1--27:??
Abu Saleh Md Mahfujur Rahman and   
           M. Anwar Hossain and   
         Abdulmotaleb El Saddik   Spatial-geometric approach to physical
                                  mobile interaction based on
                                  accelerometer and IR sensory data fusion 28:1--28:??
                Zhenyu Yang and   
                  Wanmin Wu and   
            Klara Nahrstedt and   
           Gregorij Kurillo and   
                  Ruzena Bajcsy   Enabling multiparty $3$D tele-immersive
                                  environments with ViewCast . . . . . . . 29:1--29:??
            Damien Marshall and   
      Séamus Mcloone and   
              Tomás Ward   Optimizing consistency by maximizing
                                  bandwidth usage in distributed
                                  interactive applications . . . . . . . . 30:1--30:??
                    Long Vu and   
             Indranil Gupta and   
            Klara Nahrstedt and   
                      Jin Liang   Understanding overlay characteristics of
                                  a large-scale peer-to-peer IPTV system   31:1--31:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 7, Number 1, January, 2011

                Marek Meyer and   
          Christoph Rensing and   
                 Ralf Steinmetz   Multigranularity reuse of learning
                                  resources  . . . . . . . . . . . . . . . 1:1--1:??
            Samia Bouyakoub and   
             Abdelkader Belkhir   SMIL builder: an incremental authoring
                                  tool for SMIL Documents  . . . . . . . . 2:1--2:??
           M. Anwar Hossain and   
           Pradeep K. Atrey and   
         Abdulmotaleb El Saddik   Modeling and assessing quality of
                                  information in multisensor multimedia
                                  monitoring systems . . . . . . . . . . . 3:1--3:??
                 Jianke Zhu and   
           Steven C. H. Hoi and   
             Michael R. Lyu and   
                  Shuicheng Yan   Near-duplicate keyframe retrieval by
                                  semi-supervised learning and nonrigid
                                  image matching . . . . . . . . . . . . . 4:1--4:??
             Cheng-Hsin Hsu and   
                Mohamed Hefeeda   A framework for cross-layer optimization
                                  of video streaming in wireless networks  5:1--5:??
           Surendar Chandra and   
                       Xuwen Yu   An empirical analysis of serendipitous
                                  media sharing among campus-wide wireless
                                  users  . . . . . . . . . . . . . . . . . 6:1--6:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 7, Number 2, February, 2011

            Ajay Gopinathan and   
                    Zongpeng Li   Optimal layered multicast  . . . . . . . 7:1--7:??
             Cheng-Hsin Hsu and   
                Mohamed Hefeeda   Using simulcast and scalable video
                                  coding to efficiently control channel
                                  switching delay in mobile TV broadcast
                                  networks . . . . . . . . . . . . . . . . 8:1--8:??
                  Yohan Jin and   
       Balakrishnan Prabhakaran   Knowledge discovery from $3$D human
                                  motion streams through semantic
                                  dimensional reduction  . . . . . . . . . 9:1--9:??
                  Wei Cheng and   
              Wei Tsang Ooi and   
           Sebastien Mondet and   
           Romulus Grigoras and   
         Géraldine Morin   Modeling progressive mesh streaming:
                                  Does data dependency matter? . . . . . . 10:1--10:??
                  Susmit Bagchi   A fuzzy algorithm for dynamically
                                  adaptive multimedia streaming  . . . . . 11:1--11:??
             Cheng-Hsin Hsu and   
                Mohamed Hefeeda   Statistical multiplexing of
                                  variable-bit-rate videos streamed to
                                  mobile devices . . . . . . . . . . . . . 12:1--12:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 7, Number 3, August, 2011

                 Ralf Steinmetz   Editorial notice . . . . . . . . . . . . 13:1--13:??
            Pavel Korshunov and   
                  Wei Tsang Ooi   Video quality for face detection,
                                  recognition, and tracking  . . . . . . . 14:1--14:??
                 Pei-Yu Lin and   
               Jung-San Lee and   
                Chin-Chen Chang   Protecting the content integrity of
                                  digital imagery with fidelity
                                  preservation . . . . . . . . . . . . . . 15:1--15:??
      Reinier H. Van Leuken and   
              Remco C. Veltkamp   Selecting vantage objects for similarity
                                  indexing . . . . . . . . . . . . . . . . 16:1--16:??
                Wu-Chi Feng and   
                 Thanh Dang and   
             John Kassebaum and   
                     Tim Bauman   Supporting region-of-interest cropping
                                  through constrained compression  . . . . 17:1--17:??
              Qingzhong Liu and   
             Andrew H. Sung and   
                    Mengyu Qiao   Derivative-based audio steganalysis  . . 18:1--18:??
         Frederick W. B. Li and   
           Rynson W. H. Lau and   
                Danny Kilis and   
                 Lewis W. F. Li   Game-on-demand:: an online game engine
                                  based on geometry streaming  . . . . . . 19:1--19:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 7S, Number 1, 2011

      Shervin Shirmohammadi and   
                  Jiebo Luo and   
                   Jie Yang and   
         Abdulmotaleb El Saddik   Introduction to ACM Multimedia 2010 best
                                  paper candidates . . . . . . . . . . . . 20:1--20:??
    Subhabrata Bhattacharya and   
           Rahul Sukthankar and   
                   Mubarak Shah   A holistic approach to aesthetic
                                  enhancement of photographs . . . . . . . 21:1--21:??
                Shulong Tan and   
                  Jiajun Bu and   
                  Chun Chen and   
                     Bin Xu and   
                   Can Wang and   
                     Xiaofei He   Using rich social media information for
                                  music recommendation via hypergraph
                                  model  . . . . . . . . . . . . . . . . . 22:1--22:??
              Simone Milani and   
             Giancarlo Calvagno   A cognitive approach for effective
                                  coding and transmission of $3$D video    23:1--23:??
               Richang Hong and   
                  Meng Wang and   
             Xiao-Tong Yuan and   
                  Mengdi Xu and   
              Jianguo Jiang and   
              Shuicheng Yan and   
                  Tat-Seng Chua   Video accessibility enhancement for
                                  hearing-impaired users . . . . . . . . . 24:1--24:??
               Susanne Boll and   
                Ramesh Jain and   
                  Jiebo Luo and   
                        Dong Xu   Introduction to special issue on social
                                  media  . . . . . . . . . . . . . . . . . 25:1--25:??
               Yu-Ching Lin and   
              Yi-Hsuan Yang and   
                  Homer H. Chen   Exploiting online music tags for music
                                  emotion classification . . . . . . . . . 26:1--26:??
            Mohamad Rabbath and   
           Philipp Sandhaus and   
                   Susanne Boll   Automatic creation of photo books from
                                  stories in social media  . . . . . . . . 27:1--27:??
                 Weiming Hu and   
               Haiqiang Zuo and   
                      Ou Wu and   
                Yunfei Chen and   
             Zhongfei Zhang and   
                    David Suter   Recognition of adult images, videos, and
                                  web page bags  . . . . . . . . . . . . . 28:1--28:??
                  Yu-Ru Lin and   
   K. Selçcuk Candan and   
              Hari Sundaram and   
                     Lexing Xie   SCENT: Scalable compressed monitoring of
                                  evolving multirelational social networks 29:1--29:??
                 Jitao Sang and   
                  Changsheng Xu   Browse by chunks: Topic mining and
                                  organizing on web-scale social media . . 30:1--30:??
                Rongrong Ji and   
                    Yue Gao and   
               Bineng Zhong and   
                Hongxun Yao and   
                        Qi Tian   Mining flickr landmarks by modeling
                                  reconstruction sparsity  . . . . . . . . 31:1--31:??
          Michael I. Mandel and   
             Razvan Pascanu and   
                Douglas Eck and   
              Yoshua Bengio and   
             Luca M. Aiello and   
        Rossano Schifanella and   
                Filippo Menczer   Contextual tag inference . . . . . . . . 32:1--32:??
            Joan-Isaac Biel and   
            Daniel Gatica-Perez   VlogSense: Conversational behavior and
                                  social attention in YouTube  . . . . . . 33:1--33:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 7, Number 4, November, 2011

                      Anonymous   Table of Contents: Online Supplement
                                  Volume 7S, Number 1  . . . . . . . . . . 34:1--34:??
               Richang Hong and   
                Jinhui Tang and   
             Hung-Khoon Tan and   
              Chong-Wah Ngo and   
              Shuicheng Yan and   
                  Tat-Seng Chua   Beyond search: Event-driven
                                  summarization for Web videos . . . . . . 35:1--35:??
              Wen-Kuang Kuo and   
                     Kuo-Wei Wu   Traffic prediction and QoS transmission
                                  of real-time live VBR videos in WLANs    36:1--36:??
          Namunu C. Maddage and   
                     Haizhou Li   Beat space segmentation and octave scale
                                  cepstral feature for sung language
                                  recognition in pop music . . . . . . . . 37:1--37:??
                 Simone Santini   Efficient computation of queries on
                                  feature streams  . . . . . . . . . . . . 38:1--38:??
             Renato Verdugo and   
            Miguel Nussbaum and   
                Pablo Corro and   
        Pablo Nuñnez and   
                Paula Navarrete   Interactive films and coconstruction . . 39:1--39:??
    Shahram Ghandeharizadeh and   
               Shahin Shayandeh   Domical cooperative caching for
                                  streaming media in wireless home
                                  networks . . . . . . . . . . . . . . . . 40:1--40:??
    Shahram Ghandeharizadeh and   
               Shahin Shayandeh   Call for papers: Special issue on $3$D
                                  mobile multimedia  . . . . . . . . . . . 41:1--41:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 8, Number 1, January, 2012

                 Ralf Steinmetz   Editorial note and call for nominations:
                                  Nicolas D. Georganas best paper award    1:1--1:??
           Georghita Ghinea and   
              Oluwakemi Ademoye   The sweet smell of success: Enhancing
                                  multimedia applications with olfaction   2:1--2:??
            Mohamed Hefeeda and   
                 Cheng-Hsin Hsu   Design and evaluation of a testbed for
                                  mobile TV networks . . . . . . . . . . . 3:1--3:??
                  Yu-Ru Lin and   
              Hari Sundaram and   
        Munmun De Choudhury and   
               Aisling Kelliher   Discovering multirelational structure in
                                  social media streams . . . . . . . . . . 4:1--4:??
                   Xu Cheng and   
                 Jiangchuan Liu   Exploring interest correlation for
                                  peer-to-peer socialized video sharing    5:1--5:??
                    Tao Mei and   
                  Lusong Li and   
             Xian-Sheng Hua and   
                     Shipeng Li   ImageSense: Towards contextual image
                                  advertising  . . . . . . . . . . . . . . 6:1--6:??
              Lauro Snidaro and   
           Ingrid Visentini and   
              Gian Luca Foresti   Fusing multiple video sensors for
                                  surveillance . . . . . . . . . . . . . . 7:1--7:??
            Jiun-Long Huang and   
            Shih-Chuan Chiu and   
                  Man-Kwan Shan   Towards an automatic music arrangement
                                  framework using score reduction  . . . . 8:1--8:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 8s, Number 1, February, 2012

                 Ralf Steinmetz   Editorial note . . . . . . . . . . . . . 9:1--9:??
                 Dongyu Liu and   
                     Fei Li and   
                    Bo Shen and   
                  Songqing Chen   Building an efficient transcoding
                                  overlay for P2P streaming to
                                  heterogeneous devices  . . . . . . . . . 10:1--10:??
                Zhijie Shen and   
               Roger Zimmermann   ISP-friendly P2P live streaming: a
                                  roadmap to realization . . . . . . . . . 11:1--11:??
               Xiaosong Lou and   
                      Kai Hwang   Quality of data delivery in peer-to-peer
                                  video streaming  . . . . . . . . . . . . 12:1--12:??
                   Chuan Wu and   
                 Baochun Li and   
                   Shuqiao Zhao   Diagnosing network-wide P2P live
                                  streaming inefficiencies . . . . . . . . 13:1--13:??
                   Chuan Wu and   
                Zongpeng Li and   
                Xuanjia Qiu and   
              Francis C. M. Lau   Auction-based P2P VoD streaming:
                                  Incentives and optimal scheduling  . . . 14:1--14:??
              Tieying Zhang and   
                Xueqi Cheng and   
                Jianming Lv and   
                 Zhenhua Li and   
                    Weisong Shi   Providing hierarchical lookup service
                                  for P2P--VoD systems . . . . . . . . . . 15:1--15:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 8, Number 2, May, 2012

                      Anonymous   Table of Contents: Online Supplement
                                  Volume 8S, Number 1  . . . . . . . . . . 16:1--16:??
              Fadi Dornaika and   
                 James H. Elder   Image registration for foveated
                                  panoramic sensing  . . . . . . . . . . . 17:1--17:??
                  Xin Zhang and   
          Tomás Ward and   
          Séamus Mcloone   Comparison of predictive contract
                                  mechanisms from an information theory
                                  perspective  . . . . . . . . . . . . . . 18:1--18:??
               Dan R. Olsen and   
                 Derek Bunn and   
              Trent Boulter and   
                    Robert Walz   Interactive television news  . . . . . . 19:1--19:??
         Grenville Armitage and   
                    Amiel Heyde   REED: Optimizing first person shooter
                                  game server discovery using network
                                  coordinates  . . . . . . . . . . . . . . 20:1--20:??
                Xiaobai Liu and   
              Shuicheng Yan and   
              Tat-Seng Chua and   
                        Hai Jin   Image label completion by pursuing
                                  contextual decomposability . . . . . . . 21:1--21:??
                    Yi Chen and   
      Abhidnya A. Deshpande and   
         Ramazan S. Aygüun   Sprite generation using sprite fusion    22:1--22:??
             Ming-Fang Weng and   
                 Yung-Yu Chuang   Collaborative video reindexing via
                                  matrix factorization . . . . . . . . . . 23:1--23:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 8, Number 2S, September, 2012

           Mohan S. Kankanhalli   Introduction to special issue on
                                  multimedia security  . . . . . . . . . . 31:1--31:??
              Jonathan Weir and   
                  Weiqi Yan and   
           Mohan S. Kankanhalli   Image hatching for visual cryptography   32:1--32:??
                    Jian Li and   
                Hongmei Liu and   
                 Jiwu Huang and   
                     Yun Q. Shi   Reference index-based H.264 video
                                  watermarking scheme  . . . . . . . . . . 33:1--33:??
                 Xifeng Gao and   
              Caiming Zhang and   
                  Yan Huang and   
                   Zhigang Deng   A robust high-capacity
                                  affine-transformation-invariant scheme
                                  for watermarking $3$D geometric models   34:1--34:??
                   Rui Yang and   
                 Zhenhua Qu and   
                     Jiwu Huang   Exposing MP3 audio forgeries using frame
                                  offsets  . . . . . . . . . . . . . . . . 35:1--35:??
                   Hui Feng and   
                 Hefei Ling and   
                  Fuhao Zou and   
                  Weiqi Yan and   
                   Zhengding Lu   A collusion attack optimization strategy
                                  for digital fingerprinting . . . . . . . 36:1--36:??
                Amit Sachan and   
              Sabu Emmanuel and   
           Mohan S. Kankanhalli   Aggregate licenses validation for
                                  digital rights violation detection . . . 37:1--37:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 8, Number 3, July, 2012

              Haakon Riiser and   
              Tore Endestad and   
             Paul Vigmostad and   
            Carsten Griwodz and   
            Pâl Halvorsen   Video streaming using a location-based
                                  bandwidth-lookup service for bitrate
                                  planning . . . . . . . . . . . . . . . . 24:1--24:??
              Victor Valdes and   
               Jose M. Martinez   Automatic evaluation of video summaries  25:1--25:??
                Xinmei Tian and   
                Dacheng Tao and   
                       Yong Rui   Sparse transfer learning for interactive
                                  video search reranking . . . . . . . . . 26:1--26:??
                  Xin Zhang and   
       Tomás E. Ward and   
          Séamus Mcloone   An information-based dynamic
                                  extrapolation model for networked
                                  virtual environments . . . . . . . . . . 27:1--27:??
                Linjun Yang and   
                    Bo Geng and   
              Alan Hanjalic and   
                 Xian-Sheng Hua   A unified context model for web image
                                  retrieval  . . . . . . . . . . . . . . . 28:1--28:??
                Paul Patras and   
              Albert Banchs and   
                  Pablo Serrano   A control theoretic scheme for efficient
                                  video transmission over IEEE 802.11e
                                  EDCA WLANs . . . . . . . . . . . . . . . 29:1--29:??
                Xinglei Zhu and   
                  Chang W. Chen   A joint layered scheme for reliable and
                                  secure mobile JPEG-2000 streaming  . . . 30:1--30:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 8, Number 3s, September, 2012

        Daniel Gatica-Perez and   
                   Gang Hua and   
              Wei Tsang Ooi and   
            Pål Halvorsen   Introduction to the special section of
                                  best papers of ACM Multimedia 2011 . . . 38:1--38:??
                  Wanmin Wu and   
               Ahsan Arefin and   
           Gregorij Kurillo and   
              Pooja Agarwal and   
            Klara Nahrstedt and   
                  Ruzena Bajcsy   CZLoD: a psychophysical approach for
                                  $3$D tele-immersive video  . . . . . . . 39:1--39:??
                Rongrong Ji and   
                Felix X. Yu and   
              Tongtao Zhang and   
                  Shih-Fu Chang   Active query sensing: Suggesting the
                                  best query view for mobile visual search 40:1--40:??
      Shervin Shirmohammadi and   
            Mohamed Hefeeda and   
              Wei Tsang Ooi and   
               Romulus Grigoras   Introduction to special section on $3$D
                                  mobile multimedia  . . . . . . . . . . . 41:1--41:??
                 Yanwei Liu and   
                    Song Ci and   
                   Hui Tang and   
                     Yun Ye and   
                     Jinxia Liu   QoE-oriented $3$D video transcoding for
                                  mobile streaming . . . . . . . . . . . . 42:1--42:??
                 Shujie Liu and   
                 Chang Wen Chen   A novel $3$D video transcoding scheme
                                  for adaptive $3$D video transmission to
                                  heterogeneous terminals  . . . . . . . . 43:1--43:??
               Hoda Roodaki and   
       Mahmoud Reza Hashemi and   
          Shervin Shirmohammadi   A new methodology to derive objective
                                  quality assessment metrics for scalable
                                  multiview $3$D video coding  . . . . . . 44:1--44:??
                Ahmed Hamza and   
                Mohamed Hefeeda   Energy-efficient multicasting of
                                  multiview $3$D videos to mobile devices  45:1--45:??
                    Shu Shi and   
            Klara Nahrstedt and   
                   Roy Campbell   A real-time remote rendering system for
                                  interactive mobile graphics  . . . . . . 46:1--46:??
                   Wei Guan and   
                   Suya You and   
                 Ulrich Newmann   Efficient matchings and mobile augmented
                                  reality  . . . . . . . . . . . . . . . . 47:1--47:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 8, Number 4, November, 2012

                  TOMCCAP-STAFF   Table of contents: Online supplement
                                  volume 8, number 2s, online supplement
                                  volume 8, number 3s  . . . . . . . . . . 48:1--48:??
                 Ralf Steinmetz   Editorial  . . . . . . . . . . . . . . . 49:1--49:??
                Xiaobai Liu and   
              Shuicheng Yan and   
                  Bin Cheng and   
                Jinhui Tang and   
             Tat-Sheng Chua and   
                        Hai Jin   Label-to-region with continuity-biased
                                  bi-layer sparsity priors . . . . . . . . 50:1--50:??
               Ork De Rooij and   
                 Marcel Worring   Efficient targeted search using a focus
                                  and context video browser  . . . . . . . 51:1--51:??
          Gheorghita Ghinea and   
              Oluwakemi Ademoye   User perception of media content
                                  association in olfaction-enhanced
                                  multimedia . . . . . . . . . . . . . . . 52:1--52:??
                Ryan Spicer and   
                  Yu-Ru Lin and   
           Aisling Kelliher and   
                  Hari Sundaram   NextSlidePlease: Authoring and
                                  delivering agile multimedia
                                  presentations  . . . . . . . . . . . . . 53:1--53:??
                    Heng Qi and   
                   Keqiu Li and   
               Yanming Shen and   
                       Wenyu Qu   Object-based image retrieval with kernel
                                  on adjacency matrix and local combined
                                  features . . . . . . . . . . . . . . . . 54:1--54:??
                 Guangda Li and   
                  Meng Wang and   
                   Zheng Lu and   
               Richang Hong and   
                  Tat-Seng Chua   In-video product annotation with Web
                                  information mining . . . . . . . . . . . 55:1--55:??
            Ajay Gopinathan and   
                    Zongpeng Li   Algorithms for stochastic optimization
                                  of multicast content delivery with
                                  network coding . . . . . . . . . . . . . 56:1--56:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 9, Number 1, February, 2013

              Mark Hendrikx and   
          Sebastiaan Meijer and   
       Joeri Van Der Velden and   
                Alexandru Iosup   Procedural content generation for games:
                                  a survey . . . . . . . . . . . . . . . . 1:1--1:??
                   Dong Liu and   
              Shuicheng Yan and   
               Rong-Rong Ji and   
             Xian-Sheng Hua and   
               Hong-Jiang Zhang   Image retrieval with query-adaptive
                                  hashing  . . . . . . . . . . . . . . . . 2:1--2:??
              Yan-Tao Zheng and   
              Shuicheng Yan and   
              Zheng-Jun Zha and   
                   Yiqun Li and   
             Xiangdong Zhou and   
              Tat-Seng Chua and   
                    Ramesh Jain   GPSView: a scenic driving route planner  3:1--3:??
               Wengang Zhou and   
                Houqiang Li and   
                  Yijuan Lu and   
                        Qi Tian   SIFT match verification by geometric
                                  coding for large-scale partial-duplicate
                                  web image search . . . . . . . . . . . . 4:1--4:??
            Jong-Seung Park and   
                    Ramesh Jain   Identification of scene locations from
                                  geotagged images . . . . . . . . . . . . 5:1--5:??
               Yichuan Wang and   
                Ting-An Lin and   
             Cheng-Hsin Hsu and   
                        Xin Liu   Region- and action-aware virtual world
                                  clients  . . . . . . . . . . . . . . . . 6:1--6:??
       Naghmeh Khodabakhshi and   
                Mohamed Hefeeda   Spider: a system for finding $3$D video
                                  copies . . . . . . . . . . . . . . . . . 7:1--7:??
              Austin Abrams and   
                   Robert Pless   Web-accessible geographic integration
                                  and calibration of webcams . . . . . . . 8:1--8:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 9, Number 1s, October, 2013

                 Ralf Steinmetz   Editorial note . . . . . . . . . . . . . 31:1--31:??
            Klara Nahrstedt and   
            Rainer Lienhart and   
                 Malcolm Slaney   Introduction to the special section on
                                  the 20th anniversary of the ACM
                                  International Conference on Multimedia   32:1--32:??
                 Baochun Li and   
                   Zhi Wang and   
             Jiangchuan Liu and   
                      Wenwu Zhu   Two decades of Internet video streaming:
                                  a retrospective view . . . . . . . . . . 33:1--33:??
                Zixia Huang and   
            Klara Nahrstedt and   
                 Ralf Steinmetz   Evolution of temporal multimedia
                                  synchronization principles: a historical
                                  viewpoint  . . . . . . . . . . . . . . . 34:1--34:??
       Dick C. A. Bulterman and   
                Pablo Cesar and   
Rodrigo Laiola Guimarães   Socially-aware multimedia authoring:
                                  Past, present, and future  . . . . . . . 35:1--35:??
                  Lei Zhang and   
                       Yong Rui   Image search-from thousands to billions
                                  in 20 years  . . . . . . . . . . . . . . 36:1--36:??
               Lawrence A. Rowe   Looking forward 10 years to multimedia
                                  successes  . . . . . . . . . . . . . . . 37:1--37:??
                Prashant Shenoy   Multimedia systems research: The first
                                  twenty years and lessons for the next
                                  twenty . . . . . . . . . . . . . . . . . 38:1--38:??
                    Kien A. Hua   Online video delivery: Past, present,
                                  and future . . . . . . . . . . . . . . . 39:1--39:??
        Viswanathan Swaminathan   Are we in the middle of a video
                                  streaming revolution?  . . . . . . . . . 40:1--40:??
                 Philip A. Chou   Advances in immersive communication: (1)
                                  Telephone, (2) Television, (3)
                                  Teleportation  . . . . . . . . . . . . . 41:1--41:??
                  Shih-Fu Chang   How far we've come: Impact of 20 years
                                  of multimedia information retrieval  . . 42:1--42:??
            Wolfgang Effelsberg   A personal look back at twenty years of
                                  research in multimedia content analysis  43:1--43:??
                  Alan Hanjalic   Multimedia retrieval that matters  . . . 44:1--44:??
                   Matthew Turk   Over twenty years of eigenfaces  . . . . 45:1--45:??
                  Brian Whitman   Care and scale: Fifteen years of music
                                  retrieval  . . . . . . . . . . . . . . . 46:1--46:??
           Richard Szeliski and   
               Noah Snavely and   
                Steven M. Seitz   Navigating the worldwide community of
                                  photos . . . . . . . . . . . . . . . . . 47:1--47:??
                Elisabeth Andre   Exploiting unconscious user signals in
                                  multimodal human-computer interaction    48:1--48:??
                  Hari Sundaram   Experiential media systems . . . . . . . 49:1--49:??
Ioannis (Yiannis) Kompatsiaris and   
        Wenjun (Kevin) Zeng and   
                   Gang Hua and   
                 Liangliang Cao   Introduction to the special section of
                                  best papers of ACM multimedia 2012 . . . 50:1--50:??
                   Heng Liu and   
                    Tao Mei and   
                Houqiang Li and   
                  Jiebo Luo and   
                     Shipeng Li   Robust and accurate mobile visual
                                  localization and its applications  . . . 51:1--51:??
                   Zhi Wang and   
                  Wenwu Zhu and   
              Xiangwen Chen and   
                 Lifeng Sun and   
             Jiangchuan Liu and   
               Minghua Chen and   
                   Peng Cui and   
                  Shiqiang Yang   Propagation-based social-aware
                                  multimedia content distribution  . . . . 52:1--52:??
                 Jitao Sang and   
                  Changsheng Xu   Social influence analysis and
                                  application on multimedia sharing
                                  websites . . . . . . . . . . . . . . . . 53:1--53:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 9, Number 2, May, 2013

              Juan M. Silva and   
            Mauricio Orozco and   
                Jongeun Cha and   
     Abdulmotaleb El Saddik and   
                 Emil M. Petriu   Human perception of haptic-to-video and
                                  haptic-to-audio skew in multimedia
                                  applications . . . . . . . . . . . . . . 9:1--9:??
          Chidansh A. Bhatt and   
           Pradeep K. Atrey and   
           Mohan S. Kankanhalli   A reward-and-punishment-based approach
                                  for concept detection using adaptive
                                  ontology rules . . . . . . . . . . . . . 10:1--10:??
        Fawaz A. Alsulaiman and   
                 Nizar Sakr and   
     Julio J. Valdés and   
         Abdulmotaleb El Saddik   Identity verification based on
                                  handwritten signatures with haptic
                                  information using genetic programming    11:1--11:??
               Qianni Zhang and   
               Ebroul Izquierdo   Multifeature analysis and semantic
                                  context learning for image
                                  classification . . . . . . . . . . . . . 12:1--12:??
              Zhen Wei Zhao and   
             Sameer Samarth and   
                  Wei Tsang Ooi   Modeling the effect of user interactions
                                  on mesh-based P2P VoD streaming systems  13:1--13:??
                  Yang Yang and   
                    Yi Yang and   
                  Heng Tao Shen   Effective transfer tagging from image to
                                  video  . . . . . . . . . . . . . . . . . 14:1--14:??
              Zhen Wei Zhao and   
                  Wei Tsang Ooi   APRICOD: an access-pattern-driven
                                  distributed caching middleware for fast
                                  content discovery of noncontinuous media
                                  access . . . . . . . . . . . . . . . . . 15:1--15:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 9, Number 3, June, 2013

                      Anonymous   Call for papers: Multiple sensorial
                                  (MulSeMedia) multi-modal media: Advances
                                  and applications . . . . . . . . . . . . 15:1--15:??
                    Tao Mei and   
               Lin-Xie Tang and   
                Jinhui Tang and   
                 Xian-Sheng Hua   Near-lossless semantic video
                                  summarization and its applications to
                                  video analysis . . . . . . . . . . . . . 16:1--16:??
       Oluwakemi A. Ademoye and   
              Gheorghita Ghinea   Information recall task impact in
                                  olfaction-enhanced multimedia  . . . . . 17:1--17:??
                 Lo-Yao Yeh and   
                Jiun-Long Huang   A conditional access system with
                                  efficient key distribution and
                                  revocation for mobile pay-TV systems . . 18:1--18:??
             Ruchira Naskar and   
       Rajat Subhra Chakraborty   A generalized tamper localization
                                  approach for reversible watermarking
                                  algorithms . . . . . . . . . . . . . . . 19:1--19:??
           Jonathan Doherty and   
               Kevin Curran and   
                  Paul Mckevitt   A self-similarity approach to repairing
                                  large dropouts of streamed music . . . . 20:1--20:??
            Edmond S. L. Ho and   
           Jacky C. P. Chan and   
                Taku Komura and   
                   Howard Leung   Interactive partner control in close
                                  interactions for real-time applications  21:1--21:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 9, Number 4, August, 2013

                 Ralf Steinmetz   Editorial: Reviewers . . . . . . . . . . 22:1--22:??
               Kazuya Sakai and   
               Wei-Shinn Ku and   
                 Min-Te Sun and   
               Roger Zimmermann   Privacy preserving continuous multimedia
                                  streaming in MANETs  . . . . . . . . . . 23:1--23:??
                  Jian Dong and   
                  Bin Cheng and   
               Xiangyu Chen and   
              Tat-Seng Chua and   
              Shuicheng Yan and   
                        Xi Zhou   Robust image annotation via simultaneous
                                  feature and sample outlier pursuit . . . 24:1--24:??
         Arantxa Villanueva and   
              Victoria Ponz and   
        Laura Sesma-Sanchez and   
                 Mikel Ariz and   
                Sonia Porta and   
                  Rafael Cabeza   Hybrid method based on topography for
                                  robust detection of iris center and eye
                                  corners  . . . . . . . . . . . . . . . . 25:1--25:??
                    Bo Wang and   
               Jinqiao Wang and   
                     Hanqing Lu   Exploiting content relevance and social
                                  relevance for personalized ad
                                  recommendation on Internet TV  . . . . . 26:1--26:??
          Kazi Masudul Alam and   
Abu Saleh Md Mahfujur Rahman and   
         Abdulmotaleb El Saddik   Mobile haptic e-book system to support
                                  $3$D immersive reading in ubiquitous
                                  environments . . . . . . . . . . . . . . 27:1--27:??
              Tam V. Nguyen and   
                     Si Liu and   
                Bingbing Ni and   
                    Jun Tan and   
                   Yong Rui and   
                  Shuicheng Yan   Towards decrypting attractiveness via
                                  multi-modality cues  . . . . . . . . . . 28:1--28:??
                Jinhui Tang and   
                 Qiang Chen and   
                  Meng Wang and   
              Shuicheng Yan and   
              Tat-Seng Chua and   
                    Ramesh Jain   Towards optimizing human labeling for
                                  interactive image tagging  . . . . . . . 29:1--29:??
            Bogdan Carbunar and   
            Rahul Potharaju and   
             Michael Pearce and   
        Venugopal Vasudevan and   
                Michael Needham   A framework for network aware caching
                                  for video on demand systems  . . . . . . 30:1--30:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 10, Number 1, December, 2013

                  Zechao Li and   
                   Jing Liu and   
                  Meng Wang and   
              Changsheng Xu and   
                     Hanqing Lu   Enhancing news organization for
                                  convenient retrieval and browsing  . . . 1:1--1:??
                Peter Knees and   
                  Markus Schedl   A survey of music similarity and
                                  recommendation from music context data   2:1--2:??
              Yi-Liang Zhao and   
                 Qiang Chen and   
              Shuicheng Yan and   
              Tat-Seng Chua and   
                   Daqing Zhang   Detecting profilable and overlapping
                                  communities with user-generated
                                  multimedia contents in LBSNs . . . . . . 3:1--3:??
           Gaurav Bhatnagar and   
          Q. M. Jonathan Wu and   
               Pradeep K. Atrey   Secure randomized image watermarking
                                  based on singular value decomposition    4:1--4:??
                Luntian Mou and   
               Tiejun Huang and   
              Yonghong Tian and   
              Menglin Jiang and   
                        Wen Gao   Content-based copy detection through
                                  multimodal feature representation and
                                  temporal pyramid matching  . . . . . . . 5:1--5:??
               Xiangyu Chen and   
                  Yadong Mu and   
                Hairong Liu and   
              Shuicheng Yan and   
                   Yong Rui and   
                  Tat-Seng Chua   Large-scale multilabel propagation based
                                  on efficient sparse graph construction   6:1--6:??
           Michael E. Houle and   
               Vincent Oria and   
            Shin'ichi Satoh and   
                     Jichao Sun   Annotation propagation in image
                                  databases using similarity graphs  . . . 7:1--7:??
             Anupama Mallik and   
             Hiranmay Ghosh and   
          Santanu Chaudhury and   
                   Gaurav Harit   MOWL: an ontology representation
                                  language for Web-based multimedia
                                  applications . . . . . . . . . . . . . . 8:1--8:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 10, Number 2, February, 2014

                Yunhua Deng and   
               Rynson W. H. Lau   Dynamic load balancing in distributed
                                  virtual environments using heat
                                  diffusion  . . . . . . . . . . . . . . . 16:1--16:??
                  James She and   
              Jon Crowcroft and   
                     Hao Fu and   
                       Flora Li   Convergence of interactive displays with
                                  smart mobile devices for effective
                                  advertising: a survey  . . . . . . . . . 17:1--17:??
           Ekaterina Gonina and   
           Gerald Friedland and   
            Eric Battenberg and   
       Penporn Koanantakool and   
           Michael Driscoll and   
        Evangelos Georganas and   
                   Kurt Keutzer   Scalable multimedia content analysis on
                                  parallel platforms using Python  . . . . 18:1--18:??
           Surendar Chandra and   
              John Boreczky and   
               Lawrence A. Rowe   High performance many-to-many intranet
                                  screen sharing with DisplayCast  . . . . 19:1--19:??
                 Ya-Lin Lee and   
                Wen-Hsiang Tsai   A new data hiding method via revision
                                  history records on collaborative writing
                                  platforms  . . . . . . . . . . . . . . . 20:1--20:??
                   Jin Yuan and   
              Yi-Liang Zhao and   
                Huanbo Luan and   
                  Meng Wang and   
                  Tat-Seng Chua   Memory recall based video search:
                                  Finding videos you have seen before
                                  based on your memory . . . . . . . . . . 21:1--21:??
              Xianglong Liu and   
                  Yadong Mu and   
                    Bo Lang and   
                  Shih-Fu Chang   Mixed image-keyword query adaptive
                                  hashing over multilabel images . . . . . 22:1--22:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 10, Number 3, April, 2014

                      Anonymous   Table of Contents: Online Supplement
                                  Volume 10, Number 1s . . . . . . . . . . 22:1--22:??
                   Ning Liu and   
                 Huajie Cui and   
            S.-H. Gary Chan and   
               Zhipeng Chen and   
                  Yirong Zhuang   Dissecting User Behaviors for a
                                  Simultaneous Live and VoD IPTV System    23:1--23:??
              Rossano Gaeta and   
            Marco Grangetto and   
                  Lorenzo Bovio   DIP: Distributed Identification of
                                  Polluters in P2P Live Streaming  . . . . 24:1--24:??
    Mohammad Asharful Hoque and   
            Matti Siekkinen and   
          Jukka K. Nurminen and   
               Sasu Tarkoma and   
                     Mika Aalto   Saving Energy in Mobile Devices for
                                  On-Demand Multimedia Streaming --- A
                                  Cross-Layer Approach . . . . . . . . . . 25:1--25:??
                  Feng Wang and   
               Wan-Lei Zhao and   
              Chong-Wah Ngo and   
               Bernard Merialdo   A Hamming Embedding Kernel with
                                  Informative Bag-of-Visual Words for
                                  Video Semantic Indexing  . . . . . . . . 26:1--26:??
                  Ying Yang and   
           Ioannis Ivrissimtzis   Mesh Discriminative Features for $3$D
                                  Steganalysis . . . . . . . . . . . . . . 27:1--27:??
           Abdelwahab Hamam and   
     Abdulmotaleb El Saddik and   
                  Jihad Alja'am   A Quality of Experience Model for Haptic
                                  Virtual Environments . . . . . . . . . . 28:1--28:??
                Marco Botta and   
           Davide Cavagnino and   
                Victor Pomponiu   Protecting the Content Integrity of
                                  Digital Imagery with Fidelity
                                  Preservation: An Improved Version  . . . 29:1--29:??
                     Da Luo and   
                  Weiqi Luo and   
                   Rui Yang and   
                     Jiwu Huang   Identifying Compression History of Wave
                                  Audio and Its Applications . . . . . . . 30:1--30:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 10, Number 4, June, 2014

              Tianzhu Zhang and   
                  Changsheng Xu   Cross-Domain Multi-Event Tracking via
                                  CO-PMHT  . . . . . . . . . . . . . . . . 31:1--31:??
              Qinghua Huang and   
               Bisheng Chen and   
              Jingdong Wang and   
                        Tao Mei   Personalized Video Recommendation
                                  through Graph Propagation  . . . . . . . 32:1--32:??
                  Haitao Li and   
                   Xu Cheng and   
                 Jiangchuan Liu   Understanding Video Sharing Propagation
                                  in Social Networks: Measurement and
                                  Analysis . . . . . . . . . . . . . . . . 33:1--33:??
                 Zhiyu Wang and   
                   Peng Cui and   
                 Lexing Xie and   
                  Wenwu Zhu and   
                   Yong Rui and   
                  Shiqiang Yang   Bilateral Correspondence Model for
                                  Words-and-Pictures Association in
                                  Multimedia-Rich Microblogs . . . . . . . 34:1--34:??
               Yanqiang Lei and   
                Guoping Qiu and   
               Ligang Zheng and   
                     Jiwu Huang   Fast Near-Duplicate Image Detection
                                  Using Uniform Randomized Trees . . . . . 35:1--35:??
                Che-Hua Yeh and   
            Brian A. Barsky and   
                  Ming Ouhyoung   Personalized Photograph Ranking and
                                  Selection System Considering Positive
                                  and Negative User Feedback . . . . . . . 36:1--36:??
                   Song Tan and   
              Yu-Gang Jiang and   
                  Chong-Wah Ngo   Placing Videos on a Semantic Hierarchy
                                  for Search Result Navigation . . . . . . 37:1--37:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 11, Number 1, August, 2014

                 Ralf Steinmetz   Editorial Note . . . . . . . . . . . . . 1:1--1:??
               Yong-Jin Liu and   
                 Cui-Xia Ma and   
                 Qiufang Fu and   
                 Xiaolan Fu and   
             Sheng-Feng Qin and   
                     Lexing Xie   A Sketch-Based Approach for Interactive
                                  Organization of Video Clips  . . . . . . 2:1--2:??
               Junshi Huang and   
                     Si Liu and   
              Junliang Xing and   
                    Tao Mei and   
                  Shuicheng Yan   Circle & Search: Attribute-Aware Shoe
                                  Retrieval  . . . . . . . . . . . . . . . 3:1--3:??
              Genliang Guan and   
               Zhiyong Wang and   
                Shaohui Mei and   
                    Max Ott and   
                  Mingyi He and   
               David Dagan Feng   A Top-Down Approach for Video
                                  Summarization  . . . . . . . . . . . . . 4:1--4:??
           Richard W. Pazzi and   
             Azzedine Boukerche   PROPANE: a Progressive Panorama
                                  Streaming Protocol to Support
                                  Interactive $3$D Virtual Environment
                                  Exploration on Graphics-Constrained
                                  Devices  . . . . . . . . . . . . . . . . 5:1--5:??
               Xiangyu Wang and   
                   Yong Rui and   
              Mohan Kankanhalli   Up-Fusion: an Evolving Multimedia Fusion
                                  Method . . . . . . . . . . . . . . . . . 6:1--6:??
                 Xinxi Wang and   
                    Yi Wang and   
                  David Hsu and   
                        Ye Wang   Exploration in Interactive Personalized
                                  Music Recommendation: a Reinforcement
                                  Learning Approach  . . . . . . . . . . . 7:1--7:??
               Harish Katti and   
      Anoop Kolar Rajagopal and   
          Mohan Kankanhalli and   
          Ramakrishnan Kalpathi   Online Estimation of Evolving Human
                                  Visual Interest  . . . . . . . . . . . . 8:1--8:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 11, Number 1s, September, 2014

          Gheorghita Ghinea and   
         Christian Timmerer and   
                  Weisi Lin and   
               Stephen Gulliver   Introduction to Special Issue on
                                  Multiple Sensorial (MulSeMedia)
                                  Multimodal Media: Advances and
                                  Applications . . . . . . . . . . . . . . 9:1--9:??
                  Zhihan Lv and   
              Alaa Halawani and   
            Shengzhong Feng and   
                   Haibo Li and   
        Shafiq Ur Réhman   Multimodal Hand and Foot Gesture
                                  Interaction for Handheld Devices . . . . 10:1--10:??
               Manoj Prasad and   
              Murat Russell and   
               Tracy A. Hammond   Designing Vibrotactile Codes to
                                  Communicate Verb Phrases . . . . . . . . 11:1--11:??
               Niall Murray and   
                  Brian Lee and   
              Yuansong Qiao and   
           Gabriel-Miro Muntean   Multiple-Scent Enhanced Multimedia
                                  Synchronization  . . . . . . . . . . . . 12:1--12:??
               Eleni Kroupi and   
             Ashkan Yazdani and   
            Jean-Marc Vesin and   
               Touradj Ebrahimi   EEG Correlates of Pleasant and
                                  Unpleasant Odor Perception . . . . . . . 13:1--13:??
            Benjamin Rainer and   
             Christian Timmerer   A Generic Utility Model Representing the
                                  Quality of Sensory Experience  . . . . . 14:1--14:??
               Zhenhui Yuan and   
             Shengyang Chen and   
          Gheorghita Ghinea and   
           Gabriel-Miro Muntean   User Quality of Experience of Mulsemedia
                                  Applications . . . . . . . . . . . . . . 15:1--15:??
      Francisco Pedro Luque and   
               Iris Galloso and   
             Claudio Feijoo and   
Carlos Alberto Martín and   
             Guillermo Cisneros   Integration of Multisensorial Stimuli
                                  and Multimodal Interaction in a Hybrid
                                  $3$DTV System  . . . . . . . . . . . . . 16:1--16:??
          Gheorghita Ghinea and   
         Christian Timmerer and   
                  Weisi Lin and   
            Stephen R. Gulliver   Mulsemedia: State of the Art,
                                  Perspectives, and Challenges . . . . . . 17:1--17:??
              Zheng-Jun Zha and   
                  Lei Zhang and   
   Max Mühlhäuser and   
                Alan F. Smeaton   Introduction to the Special Issue Best
                                  Papers of ACM Multimedia 2013  . . . . . 18:1--18:??
                  Quan Fang and   
                 Jitao Sang and   
                  Changsheng Xu   Discovering Geo-Informative Attributes
                                  for Location Recognition and Exploration 19:1--19:??
                  Luoqi Liu and   
              Junliang Xing and   
                     Si Liu and   
                     Hui Xu and   
                    Xi Zhou and   
                  Shuicheng Yan   ``Wow! You Are So Beautiful Today!'' . . 20:1--20:??
              Hanwang Zhang and   
              Zheng-Jun Zha and   
                  Yang Yang and   
              Shuicheng Yan and   
                    Yue Gao and   
                  Tat-Seng Chua   Attribute-Augmented Semantic Hierarchy:
                                  Towards a Unified Framework for
                                  Content-Based Image Retrieval  . . . . . 21:1--21:??
                   Xin Zhao and   
                     Xue Li and   
                Chaoyi Pang and   
              Quan Z. Sheng and   
                   Sen Wang and   
                         Mao Ye   Structured Streaming Skeleton --- A New
                                  Feature for Online Human Gesture
                                  Recognition  . . . . . . . . . . . . . . 22:1--22:??
            Bogdan Carbunar and   
            Rahul Potharaju and   
             Michael Pearce and   
        Venugopal Vasudevan and   
                Michael Needham   Errata for: A Framework for Network
                                  Aware Caching for Video on Demand
                                  Systems  . . . . . . . . . . . . . . . . 23:1--23:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 11, Number 2, December, 2014

                 Ying Zhang and   
               Luming Zhang and   
               Roger Zimmermann   Aesthetics-Guided Summarization from
                                  Multiple User Generated Videos . . . . . 24:1--24:??
             Kiana Calagari and   
     Mohammad Reza Pakravan and   
      Shervin Shirmohammadi and   
                Mohamed Hefeeda   ALP: Adaptive Loss Protection Scheme
                                  with Constant Overhead for Interactive
                                  Video Applications . . . . . . . . . . . 25:1--25:??
                 Dongni Ren and   
                 Yisheng Xu and   
                S.-H. Gary Chan   Beyond 1Mbps Global Overlay Live
                                  Streaming: The Case of Proxy Helpers . . 26:1--26:??
            Shengsheng Qian and   
              Tianzhu Zhang and   
              Changsheng Xu and   
              M. Shamim Hossain   Social Event Classification via Boosted
                                  Multimodal Supervised Latent Dirichlet
                                  Allocation . . . . . . . . . . . . . . . 27:1--27:??
                     Jun Ye and   
                    Kien A. Hua   Octree-Based $3$D Logic and Computation
                                  of Spatial Relationships in Live Video
                                  Query Processing . . . . . . . . . . . . 28:1--28:??
                 Yifang Yin and   
                Zhijie Shen and   
               Luming Zhang and   
               Roger Zimmermann   Spatial-Temporal Tag Mining for
                                  Automatic Geospatial Video Annotation    29:1--29:??
               Chih-Wei Lin and   
              Kuan-Wen Chen and   
              Shen-Chi Chen and   
              Cheng-Wu Chen and   
                   Yi-Ping Hung   Large-Area, Multilayered, and
                                  High-Resolution Visual Monitoring Using
                                  a Dual-Camera System . . . . . . . . . . 30:1--30:??
               Zhengyu Deng and   
                   Ming Yan and   
                 Jitao Sang and   
                  Changsheng Xu   Twitter is Faster: Personalized
                                  Time-Aware Video Recommendation from
                                  Twitter to YouTube . . . . . . . . . . . 31:1--31:??
                 Yongtao Hu and   
                  Jan Kautz and   
                  Yizhou Yu and   
                   Wenping Wang   Speaker-Following Video Subtitles  . . . 32:1--32:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 11, Number 2s, February, 2015

               Kuan-Ta Chen and   
              Songqing Chen and   
                  Wei Tsang Ooi   Introduction to the Special Issue on
                                  MMSys 2014 and NOSSDAV 2014  . . . . . . 41:1--41:??
            Philipp Schaber and   
               Stephan Kopf and   
                Sina Wetzel and   
              Tyler Ballast and   
            Christoph Wesch and   
            Wolfgang Effelsberg   CamMark: Analyzing, Modeling, and
                                  Simulating Artifacts in Camcorder Copies 42:1--42:??
                 Laura Toni and   
       Ramon Aparicio-Pardo and   
               Karine Pires and   
              Gwendal Simon and   
              Alberto Blanc and   
                Pascal Frossard   Optimal Selection of Adaptive Streaming
                                  Representations  . . . . . . . . . . . . 43:1--43:??
                 Liang Chen and   
                Yipeng Zhou and   
                  Dah Ming Chiu   Analysis and Detection of Fake Views in
                                  Online Video Services  . . . . . . . . . 44:1--44:??
               Minseok Song and   
                Yeongju Lee and   
                    Jinhan Park   Scheduling a Video Transcoding Server to
                                  Save Energy  . . . . . . . . . . . . . . 45:1--45:??
    Mohsen Jamali Langroodi and   
              Joseph Peters and   
          Shervin Shirmohammadi   Decoder-Complexity-Aware Encoding of
                                  Motion Compensation for Multiple
                                  Heterogeneous Receivers  . . . . . . . . 46:1--46:??
               Shannon Chen and   
               Zhenhuan Gao and   
            Klara Nahrstedt and   
                 Indranil Gupta   $3$DTI Amphitheater: Towards $3$DTI
                                  Broadcasting . . . . . . . . . . . . . . 47:1--47:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 11, Number 3, January, 2015

                    Ke Chen and   
                 Zhong Zhou and   
                         Wei Wu   Progressive Motion Vector Clustering for
                                  Motion Estimation and Auxiliary Tracking 33:1--33:??
                Liquan Shen and   
                    Ping An and   
             Zhaoyang Zhang and   
                Qianqian Hu and   
                Zhengchuan Chen   A $3$D--HEVC Fast Mode Decision
                                  Algorithm for Real-Time Applications . . 34:1--34:??
              Xiaoshan Yang and   
              Tianzhu Zhang and   
              Changsheng Xu and   
                Ming-Hsuan Yang   Boosted Multifeature Learning for
                                  Cross-Domain Transfer  . . . . . . . . . 35:1--35:??
                     Pei-Yu Lin   Double Verification Secret Sharing
                                  Mechanism Based on Adaptive Pixel Pair
                                  Matching . . . . . . . . . . . . . . . . 36:1--36:??
                Shuang Wang and   
                 Shuqiang Jiang   INSTRE: a New Benchmark for
                                  Instance-Level Object Retrieval and
                                  Recognition  . . . . . . . . . . . . . . 37:1--37:??
              Ankita Lathey and   
               Pradeep K. Atrey   Image Enhancement in Encrypted Domain
                                  over Cloud . . . . . . . . . . . . . . . 38:1--38:??
                 Yifang Yin and   
                Beomjoo Seo and   
               Roger Zimmermann   Content vs. Context: Visual and
                                  Geographic Information Use in Video
                                  Landmark Retrieval . . . . . . . . . . . 39:1--39:??
             Hong-Ying Yang and   
            Xiang-Yang Wang and   
                Pan-Pan Niu and   
                   Ai-Long Wang   Robust Color Image Watermarking Using
                                  Geometric Invariant Quaternion Polar
                                  Harmonic Transform . . . . . . . . . . . 40:1--40:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 11, Number 4, April, 2015

     Dilip Kumar Krishnappa and   
               Michael Zink and   
            Carsten Griwodz and   
            Pål Halvorsen   Cache-Centric Video Recommendation: an
                                  Approach to Improve the Efficiency of
                                  YouTube Caches . . . . . . . . . . . . . 48:1--48:??
                   Yu Zhang and   
              James Z. Wang and   
                         Jia Li   Parallel Massive Clustering of Discrete
                                  Distributions  . . . . . . . . . . . . . 49:1--49:??
                Eilwoo Baik and   
                 Amit Pande and   
              Prasant Mohapatra   Efficient MAC for Real-Time Video
                                  Streaming over Wireless LAN  . . . . . . 50:1--50:??
           Stefanos Antaris and   
           Dimitrios Rafailidis   Similarity Search over the Cloud Based
                                  on Image Descriptors' Dimensions Value
                                  Cardinalities  . . . . . . . . . . . . . 51:1--51:??
                Yin-Tzu Lin and   
                 I-Ting Liu and   
       Jyh-Shing Roger Jang and   
                     Ja-Ling Wu   Audio Musical Dice Game: a
                                  User-Preference-Aware Medley Generating
                                  System . . . . . . . . . . . . . . . . . 52:1--52:??
                Bo-Hao Chen and   
                Shih-Chia Huang   An Advanced Visibility Restoration
                                  Algorithm for Single Hazy Images . . . . 53:1--53:??
               Bing-Kun Bao and   
              Changsheng Xu and   
                Weiqing Min and   
        Mohammod Shamim Hossain   Cross-Platform Emerging Topic Detection
                                  and Elaboration from Multimedia Streams  54:1--54:??
                    Yang Li and   
             Azzedine Boukerche   QuGu: a Quality Guaranteed Video
                                  Dissemination Protocol Over Urban
                                  Vehicular Ad Hoc Networks  . . . . . . . 55:1--55:??
     Vamsidhar Reddy Gaddam and   
                Ragnhild Eg and   
            Ragnar Langseth and   
            Carsten Griwodz and   
            Pål Halvorsen   The Cameraman Operating My Virtual
                                  Camera is Artificial: Can the Machine Be
                                  as Good as a Human?  . . . . . . . . . . 56:1--56:??
           Prabhu Natarajan and   
           Pradeep K. Atrey and   
              Mohan Kankanhalli   Multi-Camera Coordination and Control in
                                  Surveillance Systems: a Survey . . . . . 57:1--57:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 12, Number 1, August, 2015

          Shingchern D. You and   
                      Yi-Han Pu   Using Paired Distances of Signal Peaks
                                  in Stereo Channels as Fingerprints for
                                  Copy Identification  . . . . . . . . . . 1:1--1:??
             Ali El Essaili and   
                 Zibin Wang and   
         Eckehard Steinbach and   
                     Liang Zhou   QoE-Based Cross-Layer Optimization for
                                  Uplink Video Transmission  . . . . . . . 2:1--2:??
                  Li-Jia Li and   
            David A. Shamma and   
              Xiangnan Kong and   
             Sina Jafarpour and   
            Roelof Van Zwol and   
                   Xuanhui Wang   CelebrityNet: a Social Network
                                  Constructed from Large-Scale Online
                                  Celebrity Images . . . . . . . . . . . . 3:1--3:??
                   Bo Zhang and   
               Nicola Conci and   
      Francesco G. B. De Natale   Segmentation of Discriminative Patches
                                  in Human Activity Video  . . . . . . . . 4:1--4:??
                   Hui Wang and   
             Mun Choon Chan and   
                  Wei Tsang Ooi   Wireless Multicast for Zoomable Video
                                  Streaming  . . . . . . . . . . . . . . . 5:1--5:??
              Simone Bianco and   
               Gianluigi Ciocca   User Preferences Modeling and Learning
                                  for Pleasing Photo Collage Generation    6:1--6:??
                      Bo Fu and   
               Dirk Staehle and   
            Gerald Kunzmann and   
         Eckehard Steinbach and   
              Wolfgang Kellerer   QoE-Based SVC Layer Dropping in LTE
                                  Networks Using Content-Aware Layer
                                  Priorities . . . . . . . . . . . . . . . 7:1--7:??
                  Siqi Shen and   
                Shun-Yun Hu and   
            Alexandru Iosup and   
                     Dick Epema   Area of Simulation: Mechanism and
                                  Architecture for Multi-Avatar Virtual
                                  Environments . . . . . . . . . . . . . . 8:1--8:??
                Suk Kyu Lee and   
                Seungho Yoo and   
              Jongtack Jung and   
               Hwangnam Kim and   
                    Jihoon Ryoo   Link-Aware Reconfigurable Point-to-Point
                                  Video Streaming for Mobile Devices . . . 9:1--9:??
                 Ming-Ju Wu and   
              Jyh-Shing R. Jang   Combining Acoustic and Multilevel Visual
                                  Features for Music Genre Classification  10:1--10:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 12, Number 1s, October, 2015

                  James She and   
                 Alvin Chin and   
                   Feng Xia and   
                  Jon Crowcroft   Introduction to: Special Issue on
                                  Smartphone-Based Interactive
                                  Technologies, Systems, and Applications  11:1--11:??
                   Biao Zhu and   
              Hongxin Zhang and   
                   Wei Chen and   
                   Feng Xia and   
               Ross Maciejewski   ShotVis: Smartphone-Based Visualization
                                  of OCR Information from Images . . . . . 12:1--12:??
Seshadri Padmanabha Venkatagiri and   
             Mun Choon Chan and   
                  Wei Tsang Ooi   Automated Link Generation for
                                  Sensor-Enriched Smartphone Images  . . . 13:1--13:??
                  Chung-Hua Chu   Visual Comfort for Stereoscopic $3$D by
                                  Using Motion Sensors on $3$D Mobile
                                  Devices  . . . . . . . . . . . . . . . . 14:1--14:??
                 Kaikai Liu and   
                     Xiaolin Li   Enabling Context-Aware Indoor Augmented
                                  Reality via Smartphone Sensing and
                                  Vision Tracking  . . . . . . . . . . . . 15:1--15:??
                  Junho Ahn and   
           James Williamson and   
              Mike Gartrell and   
                Richard Han and   
                     Qin Lv and   
               Shivakant Mishra   Supporting Healthy Grocery Shopping via
                                  Mobile Augmented Reality . . . . . . . . 16:1--16:??
                  Sixuan Ma and   
                      Zheng Yan   PSNController: an Unwanted Content
                                  Control System in Pervasive Social
                                  Networking Based on Trust Management . . 17:1--17:??
                    Fei Hao and   
               Mingjie Jiao and   
                 Geyong Min and   
               Laurence T. Yang   Launching an Efficient Participatory
                                  Sensing Campaign: a Smart Mobile
                                  Device-Based Approach  . . . . . . . . . 18:1--18:??
         Yogesh Singh Rawat and   
           Mohan S. Kankanhalli   Context-Aware Photography Learning for
                                  Smart Mobile Devices . . . . . . . . . . 19:1--19:??
             Sergio Canazza and   
             Carlo Fantozzi and   
                Niccol`o Pretto   Accessing Tape Music Documents on Mobile
                                  Devices  . . . . . . . . . . . . . . . . 20:1--20:??
                  Xiping Hu and   
                 Junqi Deng and   
                  Jidi Zhao and   
                  Wenyan Hu and   
           Edith C.-H. Ngai and   
                Renfei Wang and   
                Johnny Shen and   
                  Min Liang and   
                  Xitong Li and   
         Victor C. M. Leung and   
                  Yu-Kwong Kwok   SAfeDJ: a Crowd-Cloud Codesign Approach
                                  to Situation-Aware Music Delivery for
                                  Drivers  . . . . . . . . . . . . . . . . 21:1--21:??
           Matthias Baldauf and   
        Peter Fröhlich and   
           Florence Adegeye and   
                  Stefan Suette   Investigating On-Screen Gamepad Designs
                                  for Smartphone-Controlled Video Games    22:1--22:??
            Diana S. Bental and   
         Eliza Papadopoulou and   
         Nicholas K. Taylor and   
         M. Howard Williams and   
         Fraser R. Blackmun and   
           Idris S. Ibrahim and   
                Mei Yii Lim and   
         Ioannis Mimtsoudis and   
            Stuart W. Whyte and   
                  Edel Jennings   Smartening Up the Student Learning
                                  Experience with Ubiquitous Media . . . . 23:1--23:??
                Hayley Hung and   
                George Toderici   Introduction to: Special Issue on
                                  Extended Best Papers from ACM Multimedia
                                  2014 . . . . . . . . . . . . . . . . . . 24:1--24:??
                  Yelin Kim and   
            Emily Mower Provost   Emotion Recognition During Speech Using
                                  Dynamics of Multiple Regions of the Face 25:1--25:??
             Fangxiang Feng and   
               Xiaojie Wang and   
                  Ruifan Li and   
                    Ibrar Ahmad   Correspondence Autoencoders for
                                  Cross-Modal Retrieval  . . . . . . . . . 26:1--26:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 12, Number 2, March, 2016

               Longyu Zhang and   
                Haiwei Dong and   
         Abdulmotaleb El Saddik   From $3$D Sensing to Printing: a Survey  27:1--27:??
         Stefano Petrangeli and   
              Jeroen Famaey and   
               Maxim Claeys and   
        Steven Latré and   
                 Filip De Turck   QoE-Driven Rate Adaptation Heuristic for
                                  Fair Adaptive Video Streaming  . . . . . 28:1--28:??
                Shaoyan Sun and   
               Wengang Zhou and   
                    Qi Tian and   
                    Houqiang Li   Scalable Object Retrieval with Compact
                                  Image Representation from Generic Object
                                  Regions  . . . . . . . . . . . . . . . . 29:1--29:??
            Mansoor Ebrahim and   
                 Wai Chong Chia   Multiview Image Block Compressive
                                  Sensing with Joint Multiphase Decoding
                                  for Visual Sensor Network  . . . . . . . 30:1--30:??
                   Lei Pang and   
                  Chong-Wah Ngo   Opinion Question Answering by Sentiment
                                  Clip Localization  . . . . . . . . . . . 31:1--31:??
   Vasileios Papapanagiotou and   
              Christos Diou and   
          Anastasios Delopoulos   Improving Concept-Based Image Retrieval
                                  with Training Weights Computed from Tags 32:1--32:??
                Xuyong Yang and   
                    Tao Mei and   
               Ying-Qing Xu and   
                   Yong Rui and   
                     Shipeng Li   Automatic Generation of Visual-Textual
                                  Presentation Layout  . . . . . . . . . . 33:1--33:??
                 Xuelong Li and   
                 Mulin Chen and   
                        Qi Wang   Measuring Collectiveness via Refined
                                  Topological Similarity . . . . . . . . . 34:1--34:??
               Gareth Tyson and   
             Yehia Elkhatib and   
            Nishanth Sastry and   
                    Steve Uhlig   Measurements and Analysis of a Major
                                  Adult Video Portal . . . . . . . . . . . 35:1--35:??
                Bart Thomee and   
           Ioannis Arapakis and   
                David A. Shamma   Finding Social Points of Interest from
                                  Georeferenced and Oriented Online
                                  Photographs  . . . . . . . . . . . . . . 36:1--36:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 12, Number 3, June, 2016

              Alberto del Bimbo   From the Past Editor-In-Chief  . . . . . 37:1--37:??
               Luming Zhang and   
                 Xuelong Li and   
                Liqiang Nie and   
                    Yan Yan and   
               Roger Zimmermann   Semantic Photo Retargeting Under Noisy
                                  Image Labels . . . . . . . . . . . . . . 37:1--37:??
                     Liang Zhou   Mobile Device-to-Device Video
                                  Distribution: Theory and Application . . 38:1--38:??
               Hareesh Ravi and   
          A. V. Subramanyam and   
                  Sabu Emmanuel   Forensic Analysis of Linear and
                                  Nonlinear Image Filtering Using
                                  Quantization Noise . . . . . . . . . . . 39:1--39:??
                 Xianjun Hu and   
              Weiming Zhang and   
                      Ke Li and   
                Honggang Hu and   
                     Nenghai Yu   Secure Nonlocal Denoising in Outsourced
                                  Images . . . . . . . . . . . . . . . . . 40:1--40:??
             Kiana Calagari and   
              Tarek Elgamal and   
                Khaled Diab and   
          Krzysztof Templin and   
                Piotr Didyk and   
           Wojciech Matusik and   
                Mohamed Hefeeda   Depth Personalization and Streaming of
                                  Stereoscopic Sports Videos . . . . . . . 41:1--41:??
                   Qiong Wu and   
               Pierre Boulanger   Enhanced Reweighted MRFs for Efficient
                                  Fashion Image Parsing  . . . . . . . . . 42:1--42:??
                     Yao Hu and   
                  Chen Zhao and   
                   Deng Cai and   
                 Xiaofei He and   
                     Xuelong Li   Atom Decomposition with Adaptive Basis
                                  Selection Strategy for Matrix Completion 43:1--43:??
                   Dan Miao and   
                Jingjing Fu and   
                     Yan Lu and   
                 Shipeng Li and   
                 Chang Wen Chen   A High-Fidelity and
                                  Low-Interaction-Delay Screen Sharing
                                  System . . . . . . . . . . . . . . . . . 44:1--44:??
                Stefan Wilk and   
               Stephan Kopf and   
            Wolfgang Effelsberg   Collaborative Annotation of Videos
                                  Relying on Weak Consistency  . . . . . . 45:1--45:??
         Maria Luisa Merani and   
                   Laura Natali   Adaptive Streaming in P2P Live Video
                                  Systems: a Distributed Rate Control
                                  Approach . . . . . . . . . . . . . . . . 46:1--46:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 12, Number 4, August, 2016

               Adele Lu Jia and   
                  Siqi Shen and   
           Dick H. J. Epema and   
                Alexandru Iosup   When Game Becomes Life: The Creators and
                                  Spectators of Online Game Replays and
                                  Live Streaming . . . . . . . . . . . . . 47:1--47:??
              Shuvendu Rana and   
                     Arijit Sur   Depth-Based View-Invariant Blind $3$D
                                  Image Watermarking . . . . . . . . . . . 48:1--48:??
          Bruno M. C. Silva and   
    Joel J. P. C. Rodrigues and   
Neeraj Kumar Mario L. Proença, Jr. and   
                   Guangjie Han   MobiCoop: an Incentive-Based Cooperation
                                  Solution for Mobile Applications . . . . 49:1--49:??
          Shivendra Shivani and   
                Suneeta Agarwal   Progressive Visual Cryptography with
                                  Unexpanded Meaningful Shares . . . . . . 50:1--50:??
       Oluwakemi A. Ademoye and   
               Niall Murray and   
       Gabriel-Miro Muntean and   
              Gheorghita Ghinea   Audio Masking Effect on Inter-Component
                                  Skews in Olfaction-Enhanced Multimedia
                                  Presentations  . . . . . . . . . . . . . 51:1--51:??
            Sheng-Hua Zhong and   
                    Yan Liu and   
                    Kien A. Hua   Field Effect Deep Networks for Image
                                  Recognition with Incomplete Data . . . . 52:1--52:??
                   Ming Yan and   
                 Jitao Sang and   
              Changsheng Xu and   
              M. Shamim Hossain   A Unified Video Recommendation by
                                  Cross-Network User Modeling  . . . . . . 53:1--53:??
               Yijing Jiang and   
                Shanyu Tang and   
               Liping Zhang and   
               Muzhou Xiong and   
                    Yau Jim Yip   Covert Voice over Internet Protocol
                                  Communications with Packet Loss Based on
                                  Fractal Interpolation  . . . . . . . . . 54:1--54:??
              Xiaoshan Yang and   
              Tianzhu Zhang and   
                  Changsheng Xu   Semantic Feature Mining for Video Event
                                  Understanding  . . . . . . . . . . . . . 55:1--55:??
              Tommy Nilsson and   
               Carl Hogsden and   
             Charith Perera and   
               Saeed Aghaee and   
           David J. Scruton and   
               Andreas Lund and   
              Alan F. Blackwell   Applying Seamful Design in
                                  Location-Based Mobile Museum
                                  Applications . . . . . . . . . . . . . . 56:1--56:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 13, Number 1, January, 2017

                      Zheng Yan   Learning from Collective Intelligence:
                                  Feature Learning Using Social Images and
                                  Tags . . . . . . . . . . . . . . . . . . 1:1--1:??
                Ming Cheung and   
                  James She and   
                Alvin Junus and   
                        Lei Cao   Prediction of Virality Timing Using
                                  Cascades in Social Media . . . . . . . . 2:1--2:??
               Chih-Yi Chiu and   
              Yu-Cyuan Liou and   
           Amorntip Prayoonwong   Approximate Asymmetric Search for Binary
                                  Embedding Codes  . . . . . . . . . . . . 3:1--3:??
          Konstantin Miller and   
      Abdel-Karim Al-Tamimi and   
                    Adam Wolisz   QoE-Based Low-Delay Live Streaming Using
                                  Throughput Predictions . . . . . . . . . 4:1--4:??
         Nimesha Ranasinghe and   
               Ellen Yi-Luen Do   Digital Lollipop: Studying Electrical
                                  Stimulation on the Human Tongue to
                                  Simulate Taste Sensations  . . . . . . . 5:1--5:??
               Xiongkuo Min and   
              Guangtao Zhai and   
                      Ke Gu and   
                  Xiaokang Yang   Fixation Prediction through Multimodal
                                  Analysis . . . . . . . . . . . . . . . . 6:1--6:??
                 Wei-Ta Chu and   
                  Chih-Hao Chiu   Predicting Occupation from Images by
                                  Combining Face and Body Context
                                  Information  . . . . . . . . . . . . . . 7:1--7:??
                  Jingxi Xu and   
                Benjamin W. Wah   Consistent Synchronization of Action
                                  Order with Least Noticeable Delays in
                                  Fast-Paced Multiplayer Online Games  . . 8:1--8:??
            Rodrigo Schramm and   
      Helena De Souza Nunes and   
     Cláudio Rosito Jung   Audiovisual Tool for Solf\`ege
                                  Assessment . . . . . . . . . . . . . . . 9:1--9:??
                  Haojun Wu and   
                  Yong Wang and   
                     Jiwu Huang   Identification of Reconstructed Speech   10:1--10:??
                 Sibaji Gaj and   
            Aditya Kanetkar and   
                 Arijit Sur and   
              Prabin Kumar Bora   Drift-Compensated Robust Watermarking
                                  Algorithm for H.265/HEVC Video Stream    11:1--11:??
               Tanima Dutta and   
             Hari Prabhat Gupta   An Efficient Framework for Compressed
                                  Domain Watermarking in $P$ Frames of
                                  High-Efficiency Video Coding
                                  (HEVC)-Encoded Video . . . . . . . . . . 12:1--12:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 13, Number 2, May, 2017

           Giuseppe Lisanti and   
             Svebor Karaman and   
                    Iacopo Masi   Multichannel-Kernel Canonical
                                  Correlation Analysis for Cross-View
                                  Person Reidentification  . . . . . . . . 13:1--13:??
                     Jun Ye and   
                     Hao Hu and   
                 Guo-Jun Qi and   
                    Kien A. Hua   A Temporal Order Modeling Approach to
                                  Human Action Recognition from Multimodal
                                  Sensor Data  . . . . . . . . . . . . . . 14:1--14:??
                 Shuai Wang and   
                  Yang Cong and   
                 Huijie Fan and   
                 Baojie Fan and   
               Lianqing Liu and   
              Yunsheng Yang and   
               Yandong Tang and   
                Huaici Zhao and   
                      Haibin Yu   Multi-Class Latent Concept Pooling for
                                  Computer-Aided Endoscopy Diagnosis . . . 15:1--15:??
            Edip Demirbilek and   
   Jean-Charles Grégoire   Machine Learning-Based Parametric
                                  Audiovisual Quality Prediction Models
                                  for Real-Time Communications . . . . . . 16:1--16:??
             Vineet Gokhale and   
          Jayakrishnan Nair and   
             Subhasis Chaudhuri   Congestion Control for Network-Aware
                                  Telehaptic Communication . . . . . . . . 17:1--17:??
             Ashkan Sobhani and   
         Abdulsalam Yassine and   
          Shervin Shirmohammadi   A Video Bitrate Adaptation and
                                  Prediction Mechanism for HTTP Adaptive
                                  Streaming  . . . . . . . . . . . . . . . 18:1--18:??
             Jason M. Grant and   
               Patrick J. Flynn   Crowd Scene Understanding from Video: a
                                  Survey . . . . . . . . . . . . . . . . . 19:1--19:??
            Fairouz Hussein and   
               Massimo Piccardi   V-JAUNE: a Framework for Joint Action
                                  Recognition and Video Summarization  . . 20:1--20:??
              Burak Cizmeci and   
                    Xiao Xu and   
            Rahul Chaudhari and   
        Christoph Bachhuber and   
                Nicolas Alt and   
             Eckehard Steinbach   A Multiplexing Scheme for Multimodal
                                  Teleoperation  . . . . . . . . . . . . . 21:1--21:??
                    Zhuo Su and   
                   Kun Zeng and   
                  Hanhui Li and   
                    Xiaonan Luo   A Dual-Domain Perceptual Framework for
                                  Generating Visual Inconspicuous
                                  Counterparts . . . . . . . . . . . . . . 22:1--22:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 13, Number 3, August, 2017

             Priyanka Singh and   
      Balasubramanian Raman and   
            Nishant Agarwal and   
               Pradeep K. Atrey   Secure Cloud-Based Image Tampering
                                  Detection and Localization Using POB
                                  Number System  . . . . . . . . . . . . . 23:1--23:??
    Ishwarya Thirunarayanan and   
           Khimya Khetarpal and   
             Sanjeev Koppal and   
            Olivier Le Meur and   
                  John Shea and   
                     Eakta Jain   Creating Segments and Effects on Comics
                                  by Clustering Gaze Data  . . . . . . . . 24:1--24:??
           Michael E. Houle and   
                   Xiguo Ma and   
               Vincent Oria and   
                     Jichao Sun   Query Expansion for Content-Based
                                  Similarity Search Using Local and Global
                                  Features . . . . . . . . . . . . . . . . 25:1--25:??
            Michael Riegler and   
       Konstantin Pogorelov and   
     Sigrun Losada Eskeland and   
       Peter Thelin Schmidt and   
              Zeno Albisser and   
               Dag Johansen and   
            Carsten Griwodz and   
        Pål Halvorsen and   
                Thomas De Lange   From Annotation to Computer-Aided
                                  Diagnosis: Detailed Evaluation of a
                                  Medical Multimedia System  . . . . . . . 26:1--26:??
                   Xun Yang and   
                  Meng Wang and   
               Richang Hong and   
                    Qi Tian and   
                       Yong Rui   Enhancing Person Re-identification in a
                                  Self-Trained Subspace  . . . . . . . . . 27:1--27:??
               Shih-Yao Lin and   
                 Yen-Yu Lin and   
              Chu-Song Chen and   
                   Yi-Ping Hung   Recognizing Human Actions with Outlier
                                  Frames by Observation Filtering and
                                  Completion . . . . . . . . . . . . . . . 28:1--28:??
        Georgios Karafotias and   
            Akiko Teranishi and   
            Georgios Korres and   
          Friederike Eyssel and   
              Scandar Copti and   
                    Mohamad Eid   Intensifying Emotional Reactions via
                                  Tactile Gestures in Immersive Films  . . 29:1--29:??
                Ming Cheung and   
                      James She   An Analytic System for User Gender
                                  Identification through User Shared
                                  Images . . . . . . . . . . . . . . . . . 30:1--30:??
      Herman A. Engelbrecht and   
                John S. Gilmore   Pithos: Distributed Storage for Massive
                                  Multi-User Virtual Environments  . . . . 31:1--31:??
                  Jun Zhang and   
                  Meng Wang and   
                  Liang Lin and   
                   Xun Yang and   
                    Jun Gao and   
                       Yong Rui   Saliency Detection on Light Field: a
                                  Multi-Cue Approach . . . . . . . . . . . 32:1--32:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 13, Number 3s, August, 2017

                  Kaoru Ota and   
               Minh Son Dao and   
          Vasileios Mezaris and   
      Francesco G. B. De Natale   Introduction to Special Issue on Deep
                                  Learning for Mobile Multimedia . . . . . 33:1--33:??
                  Kaoru Ota and   
               Minh Son Dao and   
          Vasileios Mezaris and   
      Francesco G. B. De Natale   Deep Learning for Mobile Multimedia: a
                                  Survey . . . . . . . . . . . . . . . . . 34:1--34:??
          Lorenzo Seidenari and   
            Claudio Baecchi and   
           Tiberio Uricchio and   
           Andrea Ferracani and   
              Marco Bertini and   
              Alberto Del Bimbo   Deep Artwork Detection and Retrieval for
                                  Automatic Context-Aware Audio Guides . . 35:1--35:??
         Parisa Pouladzadeh and   
          Shervin Shirmohammadi   Mobile Multi-Food Recognition Using Deep
                                  Learning . . . . . . . . . . . . . . . . 36:1--36:??
            Sailesh Bharati and   
       Hassan Aboubakr Omar and   
                  Weihua Zhuang   Enhancing Transmission Collision
                                  Detection for Distributed TDMA in
                                  Vehicular Networks . . . . . . . . . . . 37:1--37:??
      Florian Vandecasteele and   
        Karel Vandenbroucke and   
          Dimitri Schuurman and   
               Steven Verstockt   Spott: On-the-Spot e-Commerce for
                                  Television Using Deep Learning-Based
                                  Video Analysis Techniques  . . . . . . . 38:1--38:??
             Qingchen Zhang and   
           Laurence T. Yang and   
                Xingang Liu and   
                Zhikui Chen and   
                        Peng Li   A Tucker Deep Computation Model for
                                  Mobile Multimedia Feature Learning . . . 39:1--39:??
         Christian Timmerer and   
                   Ali C. Begen   Best Papers of the 2016 ACM Multimedia
                                  Systems (MMSys) Conference and Workshop
                                  on Network and Operating System Support
                                  for Digital Audio and Video (NOSSDAV)
                                  2016 . . . . . . . . . . . . . . . . . . 40:1--40:??
           Stefano D'aronco and   
                Sergio Mena and   
                Pascal Frossard   Distributed Rate Allocation in
                                  Switch-Based Multiparty
                                  Videoconferencing System . . . . . . . . 41:1--41:??
            Giuseppe Cofano and   
              Luca De Cicco and   
              Thomas Zinner and   
            Anh Nguyen-Ngoc and   
             Phuoc Tran-Gia and   
                Saverio Mascolo   Design and Performance Evaluation of
                                  Network-assisted Control Strategies for
                                  HTTP Adaptive Streaming  . . . . . . . . 42:1--42:??
           Piotr Wisniewski and   
       Jordi Mongay Batalla and   
              Andrzej Beben and   
              Piotr Krawiec and   
             Andrzej Chydzinski   On Optimizing Adaptive Algorithms Based
                                  on Rebuffering Probability . . . . . . . 43:1--43:??
   Jan Willem Kleinrouweler and   
             Sergio Cabrero and   
                    Pablo Cesar   An SDN Architecture for Privacy-Friendly
                                  Network-Assisted DASH  . . . . . . . . . 44:1--44:??
                  Cong Wang and   
             Divyashri Bhat and   
                   Amr Rizk and   
                   Michael Zink   Design and Analysis of QoE-Aware Quality
                                  Adaptation for DASH: a Spectrum-Based
                                  Approach . . . . . . . . . . . . . . . . 45:1--45:??
                 Cong Zhang and   
             Jiangchuan Liu and   
                   Haiyang Wang   Cloud-Assisted Crowdsourced Livecast . . 46:1--46:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 13, Number 4, October, 2017

                   Minh Son Dao   This is the Table of Contents for the
                                  most recent online-only supplemental
                                  issue TOMM 13(3s). Please find this
                                  supplemental issue in the ACM Digital
                                  Library and enjoy reading them!  . . . . 47:1--47:??
              Hong-Bo Zhang and   
               Bineng Zhong and   
                   Qing Lei and   
                Ji-Xiang Du and   
                Jialin Peng and   
             Duansheng Chen and   
                        Xiao Ke   Sparse Representation-Based
                                  Semi-Supervised Regression for People
                                  Counting . . . . . . . . . . . . . . . . 47:1--47:??
              Shahid Akhtar and   
                 Andre Beck and   
                    Ivica Rimac   Caching Online Video: Analysis and
                                  Proposed Algorithm . . . . . . . . . . . 48:1--48:??
       Duc-Tien Dang-Nguyen and   
                 Luca Piras and   
           Giorgio Giacinto and   
               Giulia Boato and   
      Francesco G. B. De Natale   Multimodal Retrieval with
                                  Diversification and Relevance Feedback
                                  for Tourist Attraction Images  . . . . . 49:1--49:??
     Luciana Fujii Pontello and   
        Pedro H. F. Holanda and   
            Bruno Guilherme and   
João Paulo V. Cardoso and   
          Olga Goussevskaia and   
       Ana Paula Couto Da Silva   Mixtape: Using Real-Time User Feedback
                                  to Navigate Large Media Collections  . . 50:1--50:??
          Abukari M. Yakubu and   
          Namunu C. Maddage and   
               Pradeep K. Atrey   Securing Speech Noise Reduction in
                                  Outsourced Environment . . . . . . . . . 51:1--51:??
          Fabrizio Guerrini and   
               Nicola Adami and   
              Sergio Benini and   
           Alberto Piacenza and   
             Julie Porteous and   
               Marc Cavazza and   
              Riccardo Leonardi   Interactive Film Recombination . . . . . 52:1--52:??
             Mingliang Zhou and   
              Yongfei Zhang and   
                      Bo Li and   
                     Xupeng Lin   Complexity Correlation-Based CTU-Level
                                  Rate Control with Direction Selection
                                  for HEVC . . . . . . . . . . . . . . . . 53:1--53:??
          Yousef O. Sharrab and   
                Nabil J. Sarhan   Modeling and Analysis of Power
                                  Consumption in Live Video Streaming
                                  Systems  . . . . . . . . . . . . . . . . 54:1--54:??
                Pai Chet Ng and   
                  James She and   
              Kang Eun Jeon and   
               Matthias Baldauf   When Smart Devices Interact With
                                  Pervasive Screens: a Survey  . . . . . . 55:1--55:??
           Pasi Fränti and   
      Radu Mariescu-Istodor and   
                Lahari Sengupta   O-Mopsi: Mobile Orienteering Game for
                                  Sightseeing, Exercising, and Education   56:1--56:??
           Farouk Messaoudi and   
             Adlen Ksentini and   
              Gwendal Simon and   
                Philippe Bertin   Performance Analysis of Game Engines on
                                  Mobile and Fixed Devices . . . . . . . . 57:1--57:??
                Ming Cheung and   
                Xiaopeng Li and   
                      James She   An Efficient Computation Framework for
                                  Connection Discovery using Shared Images 58:1--58:??
                Xiaopeng Li and   
                Ming Cheung and   
                      James She   A Distributed Streaming Framework for
                                  Connection Discovery Using Shared Videos 59:1--59:??
       Maaike H. T. De Boer and   
                  Yi-Jie Lu and   
                  Hao Zhang and   
             Klamer Schutte and   
              Chong-Wah Ngo and   
                  Wessel Kraaij   Semantic Reasoning in Zero Example Video
                                  Event Retrieval  . . . . . . . . . . . . 60:1--60:??
               Jianting Guo and   
               Peijia Zheng and   
                     Jiwu Huang   An Efficient Motion Detection and
                                  Tracking Scheme for Encrypted
                                  Surveillance Videos  . . . . . . . . . . 61:1--61:??
          Mohammad Motamedi and   
              Philipp Gysel and   
                  Soheil Ghiasi   PLACID: a Platform for FPGA-Based
                                  Accelerator Creation for DCNNs . . . . . 62:1--62:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 14, Number 1, January, 2018

     Oryina Kingsley Akputu and   
             Kah Phooi Seng and   
                  Yunli Lee and   
                    Li-Minn Ang   Emotion Recognition Using Multiple
                                  Kernel Learning toward E-learning
                                  Applications . . . . . . . . . . . . . . 1:1--1:??
                     Kai Li and   
                 Guo-Jun Qi and   
                    Kien A. Hua   Learning Label Preserving Binary Codes
                                  for Multimedia Retrieval: a General
                                  Approach . . . . . . . . . . . . . . . . 2:1--2:??
           Rodrigo Ceballos and   
           Beatrice Ionascu and   
                Wanjoo Park and   
                    Mohamad Eid   Implicit Emotion Communication: EEG
                                  Classification and Haptic Feedback . . . 3:1--3:??
                   Jiyan Wu and   
                   Bo Cheng and   
                  Yuan Yang and   
                  Ming Wang and   
                  Junliang Chen   Delay-Aware Quality Optimization in
                                  Cloud-Assisted Video Streaming System    4:1--4:??
               Shuhui Jiang and   
                     Yue Wu and   
                         Yun Fu   Deep Bidirectional Cross-Triplet
                                  Embedding for Online Clothing Shopping   5:1--5:??
               Peisong Wang and   
                 Qinghao Hu and   
                Zhiwei Fang and   
              Chaoyang Zhao and   
                     Jian Cheng   DeepSearch: a Fast Image Search
                                  Framework for Mobile Devices . . . . . . 6:1--6:??
                 Sicong Liu and   
   Silvestro Roberto Poccia and   
    K. Selçuk Candan and   
         Maria Luisa Sapino and   
                   Xiaolan Wang   Robust Multi-Variate Temporal Features
                                  of Multi-Variate Time Series . . . . . . 7:1--7:??
                    Dan Guo and   
               Wengang Zhou and   
                Houqiang Li and   
                      Meng Wang   Online Early-Late Fusion Based on
                                  Adaptive HMM for Sign Language
                                  Recognition  . . . . . . . . . . . . . . 8:1--8:??
             Huei-Fang Yang and   
                 Bo-Yao Lin and   
             Kuang-Yu Chang and   
                  Chu-Song Chen   Joint Estimation of Age and Expression
                                  by Combining Scattering and
                                  Convolutional Networks . . . . . . . . . 9:1--9:??
                 Shao Huang and   
              Weiqiang Wang and   
               Shengfeng He and   
               Rynson W. H. Lau   Egocentric Hand Detection Via Dynamic
                                  Region Growing . . . . . . . . . . . . . 10:1--10:??
                 Jiqing Wen and   
                  James She and   
                Xiaopeng Li and   
                        Hui Mao   Visual Background Recommendation for
                                  Dance Performances Using Deep Matrix
                                  Factorization  . . . . . . . . . . . . . 11:1--11:??
               Zhaoqing Pan and   
                Jianjun Lei and   
               Yajuan Zhang and   
                    Fu Lee Wang   Adaptive Fractional-Pixel Motion
                                  Estimation Skipped Algorithm for
                                  Efficient HEVC Motion Estimation . . . . 12:1--12:??
              Zhedong Zheng and   
                Liang Zheng and   
                        Yi Yang   A Discriminatively Learned CNN Embedding
                                  for Person Reidentification  . . . . . . 13:1--13:??
                 Weiwei Sun and   
               Jiantao Zhou and   
                Shuyuan Zhu and   
                  Yuan Yan Tang   Robust Privacy-Preserving Image Sharing
                                  over Online Social Networks (OSNs) . . . 14:1--14:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 14, Number 2, May, 2018

               Stefano Berretti   Improved Audio Steganalytic Feature and
                                  Its Applications in Audio Forensics  . . 43:1--43:??
              Abhinav Gupta and   
                  Divya Singhal   Analytical Global Median Filtering
                                  Forensics Based on Moment Histograms . . 44:1--44:??
                  Min Huang and   
                Song-Zhi Su and   
              Hong-Bo Zhang and   
               Guo-Rong Cai and   
              Dongying Gong and   
                Donglin Cao and   
                     Shao-Zi Li   Multifeature Selection for $3$D Human
                                  Action Recognition . . . . . . . . . . . 45:1--45:??
              Amir Mazaheri and   
                Boqing Gong and   
                   Mubarak Shah   Learning a Multi-Concept Video Retrieval
                                  Model with Multiple Latent Variables . . 46:1--46:??
           Aurora Tulilaulu and   
           Matti Nelimarkka and   
           Joonas Paalasmaa and   
             Daniel Johnson and   
                Dan Ventura and   
               Petri Myllys and   
                 Hannu Toivonen   Data Musicalization  . . . . . . . . . . 47:1--47:??
            Marcella Cornia and   
            Lorenzo Baraldi and   
             Giuseppe Serra and   
                 Rita Cucchiara   Paying More Attention to Saliency: Image
                                  Captioning with Saliency and Context
                                  Attention  . . . . . . . . . . . . . . . 48:1--48:??
                Longyin Wen and   
                Honggang Qi and   
                      Siwei Lyu   Contrast Enhancement Estimation for
                                  Digital Image Forensics  . . . . . . . . 49:1--49:??
              Yu-Gang Jiang and   
                  Minjun Li and   
                    Xi Wang and   
                    Wei Liu and   
                 Xian-Sheng Hua   DeepProduct: Mobile Product Search With
                                  Portable Deep Features . . . . . . . . . 50:1--50:??
               Kashif Ahmad and   
    Mohamed Lamine Mekhalfi and   
               Nicola Conci and   
              Farid Melgani and   
            Francesco De Natale   Ensemble of Deep Models for Event
                                  Recognition  . . . . . . . . . . . . . . 51:1--51:??
                     Wei Hu and   
              Mozhdeh Seifi and   
                  Erik Reinhard   Over- and Under-Exposure Reconstruction
                                  of a Single Plenoptic Capture  . . . . . 52:1--52:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 14, Number 2s, May, 2018

           Lea Skorin-Kapov and   
       Martín Varela and   
       Tobias Hoßfeld and   
                   Kuan-Ta Chen   Guest Editorial: Special Issue on ``QoE
                                  Management for Multimedia Services'' . . 28:1--28:??
           Lea Skorin-Kapov and   
       Martín Varela and   
       Tobias Hoßfeld and   
                   Kuan-Ta Chen   A Survey of Emerging Concepts and
                                  Challenges for QoE Management of
                                  Multimedia Services  . . . . . . . . . . 29:1--29:??
                     Yi Zhu and   
    Sharath Chandra Guntuku and   
                  Weisi Lin and   
          Gheorghita Ghinea and   
                 Judith A. Redi   Measuring Individual Video QoE: a
                                  Survey, and Proposal for Future
                                  Directions Using Social Media  . . . . . 30:1--30:??
         Stefano Petrangeli and   
       Jeroen Van Der Hooft and   
                Tim Wauters and   
                 Filip De Turck   Quality of Experience-Centric Management
                                  of Adaptive Video Streaming Services:
                                  Status and Challenges  . . . . . . . . . 31:1--31:??
             Divyashri Bhat and   
                   Amr Rizk and   
               Michael Zink and   
                 Ralf Steinmetz   SABR: Network-Assisted Content
                                  Distribution for QoE-Driven ABR Video
                                  Streaming  . . . . . . . . . . . . . . . 32:1--32:??
            Valentin Burger and   
              Thomas Zinner and   
              Lam Dinh-Xuan and   
             Florian Wamser and   
                 Phuoc Tran-Gia   A Generic Approach to Video Buffer
                                  Modeling Using Discrete-Time Analysis    33:1--33:??
            Matti Siekkinen and   
Teemu kämäräinen and   
           Leonardo Favario and   
                  Enrico Masala   Can You See What I See?
                                  Quality-of-Experience Measurements of
                                  Mobile Live Video Broadcasting . . . . . 34:1--34:??
   Joachim Bruneau-Queyreix and   
       Jordi Mongay Batalla and   
             Mathias Lacaud and   
                   Daniel Negru   PMS: a Novel Scale-Adaptive and
                                  Quality-Adaptive Hybrid P2P\slash
                                  Multisource Solution for Live Streaming  35:1--35:??
          Alessandro Floris and   
               Arslan Ahmad and   
                   Luigi Atzori   QoE-Aware OTT-ISP Collaboration in
                                  Service Management: Architecture and
                                  Approaches . . . . . . . . . . . . . . . 36:1--36:??
                    Yan Yan and   
                Liqiang Nie and   
                 Rita Cucchiara   Guest Editorial: Special Section on
                                  ``Multimedia Understanding via
                                  Multimodal Analytics'' . . . . . . . . . 37:1--37:??
            Akanksha Tiwari and   
     Christian Von Der Weth and   
           Mohan S. Kankanhalli   Multimodal Multiplatform Social Media
                                  Event Summarization  . . . . . . . . . . 38:1--38:??
                 Anran Wang and   
                Jianfei Cai and   
                   Jiwen Lu and   
                   Tat-Jen Cham   Structure-Aware Multimodal Feature
                                  Fusion for RGB-D Scene Classification
                                  and Beyond . . . . . . . . . . . . . . . 39:1--39:??
                 Cheng Wang and   
                Haojin Yang and   
               Christoph Meinel   Image Captioning with Deep Bidirectional
                                  LSTMs and Multi-Task Learning  . . . . . 40:1--40:??
              Zhenguang Liu and   
                Yingjie Xia and   
                     Qi Liu and   
                 Qinming He and   
                 Chao Zhang and   
               Roger Zimmermann   Toward Personalized Activity Level
                                  Prediction in Community Question
                                  Answering Websites . . . . . . . . . . . 41:1--41:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 14, Number 3, August, 2018

                  Maha Abdallah   Aesthetic Highlight Detection in Movies
                                  Based on Synchronization of Spectators'
                                  Reactions  . . . . . . . . . . . . . . . 68:1--68:??
                 Yalong Bai and   
               Kuiyuan Yang and   
                    Tao Mei and   
                Wei-Ying Ma and   
                    Tiejun Zhao   Automatic Data Augmentation from Massive
                                  Web Images for Deep Visual Recognition   69:1--69:??
                    Min Tan and   
                     Jun Yu and   
                    Zhou Yu and   
                    Fei Gao and   
                   Yong Rui and   
                    Dacheng Tao   User-Click-Data-Based Fine-Grained Image
                                  Recognition via Weakly Supervised Metric
                                  Learning . . . . . . . . . . . . . . . . 70:1--70:??
          Abdelhak Bentaleb and   
               Ali C. Begen and   
               Roger Zimmermann   ORL--SDN: Online Reinforcement Learning
                                  for SDN-Enabled HTTP Adaptive Streaming  71:1--71:??
              Lingchao Kong and   
                        Rui Dai   Efficient Video Encoding for Automatic
                                  Video Analysis in Distributed Wireless
                                  Surveillance Systems . . . . . . . . . . 72:1--72:??
                  Anqi Wang and   
                 Haifeng Hu and   
                     Liang Yang   Image Captioning with Affective Guiding
                                  and Selective Attention  . . . . . . . . 73:1--73:??
              Marjan Sikora and   
               Mladen Russo and   
               Jurica Derek and   
                Ante Jurcevi\'c   Soundscape of an Archaeological Site
                                  Recreated with Audio Augmented Reality   74:1--74:??
         Heiner Kirchhoffer and   
               Detlev Marpe and   
              Heiko Schwarz and   
                 Thomas Wiegand   Properties and Design of
                                  Variable-to-Variable Length Codes  . . . 75:1--75:??
             Johannes Kiess and   
               Stephan Kopf and   
           Benjamin Guthier and   
            Wolfgang Effelsberg   A Survey on Content-Aware Image and
                                  Video Retargeting  . . . . . . . . . . . 76:1--76:??
                   J. Cecil and   
              Avinash Gupta and   
             M. Pirela-Cruz and   
             Parmesh Ramanathan   A Network-Based Virtual Reality
                                  Simulation Training Approach for
                                  Orthopedic Surgery . . . . . . . . . . . 77:1--77:??
               Husheng Dong and   
                    Ping Lu and   
               Chunping Liu and   
                      Yi Ji and   
                 Shengrong Gong   Learning Multiple Kernel Metrics for
                                  Iterative Person Re-Identification . . . 78:1--78:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 14, Number 3s, August, 2018

              Maha Abdallah and   
               Kuan-Ta Chen and   
            Carsten Griwodz and   
                 Cheng-Hsin Hsu   Introduction to the Special Issue on
                                  Delay-Sensitive Video Computing in the
                                  Cloud  . . . . . . . . . . . . . . . . . 53:1--53:??
              Maha Abdallah and   
            Carsten Griwodz and   
               Kuan-Ta Chen and   
              Gwendal Simon and   
              Pin-Chun Wang and   
                 Cheng-Hsin Hsu   Delay-Sensitive Video Computing in the
                                  Cloud: a Survey  . . . . . . . . . . . . 54:1--54:??
                   Yusen Li and   
                Yunhua Deng and   
                Xueyan Tang and   
                Wentong Cai and   
              Xiaoguang Liu and   
                      Gang Wang   Cost-Efficient Server Provisioning for
                                  Cloud Gaming . . . . . . . . . . . . . . 55:1--55:??
                Ivan Slivar and   
            Mirko Suznjevic and   
               Lea Skorin-Kapov   Game Categorization for Deriving
                                  QoE-Driven Video Encoding Configuration
                                  Strategies for Cloud Gaming  . . . . . . 56:1--56:??
                  Mark Claypool   Game Input with Delay-Moving Target
                                  Selection with a Game Controller
                                  Thumbstick . . . . . . . . . . . . . . . 57:1--57:??
                 Xueshi Hou and   
                     Yao Lu and   
                      Sujit Dey   Novel Hybrid-Cast Approach to Reduce
                                  Bandwidth and Latency for Cloud-Based
                                  Virtual Space  . . . . . . . . . . . . . 58:1--58:??
                  Chang Liu and   
              Wei Tsang Ooi and   
                Jinyuan Jia and   
                       Lei Zhao   Cloud Baking: Collaborative Scene
                                  Illumination for Dynamic Web$3$D Scenes  59:1--59:??
                Pablo Cesar and   
             Cheng-Hsin Hsu and   
            Chun-Ying Huang and   
                        Pan Hui   Best Papers of the ACM Multimedia
                                  Systems (MMSys) Conference 2017 and the
                                  ACM Workshop on Network and Operating
                                  System Support for Digital Audio and
                                  Video (NOSSDAV) 2017 . . . . . . . . . . 60:1--60:??
            Ahmed H. Zahran and   
           Jason J. Quinlan and   
         K. K. Ramakrishnan and   
              Cormac J. Sreenan   ASAP: Adaptive Stall-Aware Pacing for
                                  Improved DASH Video Experience in
                                  Cellular Networks  . . . . . . . . . . . 61:1--61:??
                  Chao Zhou and   
                 Zhenhua Li and   
                 Joe Osgood and   
                        Yao Liu   On the Effectiveness of Offset
                                  Projections for $ 360$-Degree Video
                                  Streaming  . . . . . . . . . . . . . . . 62:1--62:??
            Kanchan Bahirat and   
              Chengyuan Lai and   
            Ryan P. Mcmahan and   
       Balakrishnan Prabhakaran   Designing and Evaluating a Mesh
                                  Simplification Algorithm for Virtual
                                  Reality  . . . . . . . . . . . . . . . . 63:1--63:??
                Junjue Wang and   
               Brandon Amos and   
                 Anupam Das and   
         Padmanabhan Pillai and   
               Norman Sadeh and   
         Mahadev Satyanarayanan   Enabling Live Video Analytics with a
                                  Scalable and Privacy-Aware Framework . . 64:1--64:??
Gylfi \Thornór Gudmundsson and   
Björn \Thornór Jónsson and   
            Laurent Amsaleg and   
            Michael J. Franklin   Prototyping a Web-Scale Multimedia
                                  Retrieval Service Using Spark  . . . . . 65:1--65:??
                    Ming Ma and   
                  Lei Zhang and   
             Jiangchuan Liu and   
                   Zhi Wang and   
               Haitian Pang and   
                 Lifeng Sun and   
                  Weihua Li and   
              Guangling Hou and   
                     Kaiyan Chu   Characterizing User Behaviors in Mobile
                                  Personal Livecast: Towards an Edge
                                  Computing-assisted Paradigm  . . . . . . 66:1--66:??
                  Lei Huang and   
                 Bowen Ding and   
                Aining Wang and   
                 Yuedong Xu and   
                Yipeng Zhou and   
                       Xiang Li   User Behavior Analysis and Video
                                  Popularity Prediction on a Large-Scale
                                  VoD System . . . . . . . . . . . . . . . 67:1--67:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 14, Number 4, November, 2018

              Junfeng Zhang and   
                     Haifeng Hu   Joint Head Attribute Classifier and
                                  Domain-Specific Refinement Networks for
                                  Face Alignment . . . . . . . . . . . . . 79:1--79:??
       Lucas Pascotti Valem and   
   Carlos Renan De Oliveira and   
Daniel Carlos Guimarães Pedronette and   
                Jurandy Almeida   Unsupervised Similarity Learning through
                                  Rank Correlation and kNN Sets  . . . . . 80:1--80:??
                 Hui-Yin Wu and   
           Francesca Pal\`u and   
              Roberto Ranon and   
                  Marc Christie   Thinking Like a Director: Film Editing
                                  Patterns for Virtual Cinematographic
                                  Storytelling . . . . . . . . . . . . . . 81:1--81:??
                     Tuo Yu and   
                Haiming Jin and   
               Wai-Tian Tan and   
                Klara Nahrstedt   SKEPRID: Pose and Illumination
                                  Change-Resistant Skeleton-Based Person
                                  Re-Identification  . . . . . . . . . . . 82:1--82:??
                   Hehe Fan and   
                Liang Zheng and   
              Chenggang Yan and   
                        Yi Yang   Unsupervised Person Re-identification:
                                  Clustering and Fine-tuning . . . . . . . 83:1--83:??
                Xiaodan Lin and   
                   Xiangui Kang   Robust Electric Network Frequency
                                  Estimation with Rank Reduction and
                                  Linear Prediction  . . . . . . . . . . . 84:1--84:??
                     Yue Li and   
                 Gaobo Yang and   
                  Yapei Zhu and   
             Xiangling Ding and   
                  Rongrong Gong   Probability Model-Based Early Merge Mode
                                  Decision for Dependent Views Coding in
                                  $3$D-HEVC  . . . . . . . . . . . . . . . 85:1--85:??
      Joel A. F. Dos Santos and   
Débora C. Muchaluat-Saade and   
       Cécile Roisin and   
               Nabil Laya\"\ida   A Hybrid Approach for Spatio-Temporal
                                  Validation of Declarative Multimedia
                                  Documents  . . . . . . . . . . . . . . . 86:1--86:??
                     Jie Wu and   
                 Haifeng Hu and   
                          Yi Wu   Image Captioning via Semantic Guidance
                                  Attention and Consensus Selection
                                  Strategy . . . . . . . . . . . . . . . . 87:1--87:??
          Gjorgji Strezoski and   
                 Marcel Worring   OmniArt: a Large-scale Artistic
                                  Benchmark  . . . . . . . . . . . . . . . 88:1--88:??
             Christian Koch and   
                Moritz Lode and   
                Denny Stohr and   
                   Amr Rizk and   
                 Ralf Steinmetz   Collaborations on YouTube: From
                                  Unsupervised Detection to the Impact on
                                  Video and Channel Popularity . . . . . . 89:1--89:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 15, Number 1, February, 2019

                      Wei Zhang   Efficient QoE-Aware Scheme for Video
                                  Quality Switching Operations in Dynamic
                                  Adaptive Streaming . . . . . . . . . . . 17:1--17:??
           Mariem Ben Yahia and   
         Yannick Le Louedec and   
              Gwendal Simon and   
              Loutfi Nuaymi and   
               Xavier Corbillon   HTTP/2-based Frame Discarding for
                                  Low-Latency Adaptive Video Streaming . . 18:1--18:??
                 Xianguo Li and   
                  Yemei Sun and   
                 Yanli Yang and   
                  Changyun Miao   Symmetrical Residual Connections for
                                  Single Image Super-Resolution  . . . . . 19:1--19:??
                      Yi Yu and   
                 Suhua Tang and   
           Francisco Raposo and   
                       Lei Chen   Deep Cross-Modal Correlation Learning
                                  for Audio and Lyrics in Music Retrieval  20:1--20:??
                    Jia Sun and   
                   Di Huang and   
               Yunhong Wang and   
                    Liming Chen   Expression Robust $3$D Facial
                                  Landmarking via Progressive
                                  Coarse-to-Fine Tuning  . . . . . . . . . 21:1--21:??
                 Yuxin Peng and   
                      Jinwei Qi   CM-GANs: Cross-modal Generative
                                  Adversarial Networks for Common
                                  Representation Learning  . . . . . . . . 22:1--22:??
                Pietro Pala and   
               Stefano Berretti   Reconstructing $3$D Face Models by
                                  Incremental Aggregation and Refinement
                                  of Depth Frames  . . . . . . . . . . . . 23:1--23:??
                     Han Hu and   
                 Yichao Jin and   
               Yonggang Wen and   
                Cedric Westphal   Orchestrating Caching, Transcoding and
                                  Request Routing for Adaptive Video
                                  Streaming Over ICN . . . . . . . . . . . 24:1--24:??
                    Bo Yuan and   
                  Xinbo Gao and   
               Zhenxing Niu and   
                        Qi Tian   Discovering Latent Topics by Gaussian
                                  Latent Dirichlet Allocation and Spectral
                                  Clustering . . . . . . . . . . . . . . . 25:1--25:??
                    Chen He and   
                     Haifeng Hu   Image Captioning With Visual-Semantic
                                  Double Attention . . . . . . . . . . . . 26:1--26:??
                  Ruoyu Liu and   
                   Yao Zhao and   
                 Shikui Wei and   
                Liang Zheng and   
                        Yi Yang   Modality-Invariant Image-Text Embedding
                                  for Image-Sentence Matching  . . . . . . 27:1--27:??
                  Ruijun Ma and   
                 Haifeng Hu and   
               Weixuan Wang and   
                     Jia Xu and   
                   Zhengming Li   Photorealistic Face Completion with
                                  Semantic Parsing and Face
                                  Identity-Preserving Features . . . . . . 28:1--28:??
                Jakub Lokoc and   
     Gregor Kovalcík and   
          Bernd Münzer and   
      Klaus Schöffmann and   
              Werner Bailer and   
               Ralph Gasser and   
         Stefanos Vrochidis and   
          Phuong Anh Nguyen and   
     Sitapa Rujikietgumjorn and   
                Kai Uwe Barthel   Interactive Search or Sequential
                                  Browsing? A Detailed Analysis of the
                                  Video Browser Showdown 2018  . . . . . . 29:1--29:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 15, Number 1s, February, 2019

                  Wei Zhang and   
                   Ting Yao and   
                  Shiai Zhu and   
         Abdulmotaleb El Saddik   Editorial to Special Issue on Deep
                                  Learning for Intelligent Multimedia
                                  Analytics  . . . . . . . . . . . . . . . 1:1--1:??
                  Wei Zhang and   
                   Ting Yao and   
                  Shiai Zhu and   
         Abdulmotaleb El Saddik   Deep Learning-Based Multimedia
                                  Analytics: a Review  . . . . . . . . . . 2:1--2:??
                Hongtao Xie and   
             Shancheng Fang and   
              Zheng-Jun Zha and   
                Yating Yang and   
                     Yan Li and   
                 Yongdong Zhang   Convolutional Attention Networks for
                                  Scene Text Recognition . . . . . . . . . 3:1--3:??
               Zhineng Chen and   
                Shanshan Ai and   
                     Caiyan Jia   Structure-Aware Deep Learning for
                                  Product Image Classification . . . . . . 4:1--4:??
             Shuqiang Jiang and   
               Gongwei Chen and   
               Xinhang Song and   
                      Linhu Liu   Deep Patch Representations with Shared
                                  Codebook for Scene Classification  . . . 5:1--5:??
               Rui-Wei Zhao and   
                   Qi Zhang and   
                  Zuxuan Wu and   
                 Jianguo Li and   
                  Yu-Gang Jiang   Visual Content Recognition by Exploiting
                                  Semantic Feature Map with Attention and
                                  Multi-task Learning  . . . . . . . . . . 6:1--6:??
               Xueliang Liu and   
                  Meng Wang and   
              Zheng-Jun Zha and   
                   Richang Hong   Cross-Modality Feature Learning via
                                  Convolutional Autoencoder  . . . . . . . 7:1--7:??
                 Jiawei Liu and   
              Zheng-Jun Zha and   
                Xuejin Chen and   
                 Zilei Wang and   
                 Yongdong Zhang   Dense $3$D-Convolutional Neural Network
                                  for Person Re-Identification in Videos   8:1--8:??
                 Liang Zhao and   
                Zhikui Chen and   
           Laurence T. Yang and   
              M. Jamal Deen and   
                   Z. Jane Wang   Deep Semantic Mapping for Heterogeneous
                                  Multimedia Transfer Learning Using
                                  Co-Occurrence Data . . . . . . . . . . . 9:1--9:??
          M. Shamim Hossain and   
             Syed Umar Amin and   
         Mansour Alsulaiman and   
                Ghulam Muhammad   Applying Deep Learning for Epilepsy
                                  Seizure Detection and Brain Mapping
                                  Visualization  . . . . . . . . . . . . . 10:1--10:??
      Xavier Alameda-Pineda and   
                Miriam Redi and   
         Mohammad Soleymani and   
                  Nicu Sebe and   
              Shih-Fu Chang and   
                 Samuel Gosling   Special Section on Multimodal
                                  Understanding of Social, Affective, and
                                  Subjective Attributes  . . . . . . . . . 11:1--11:??
              Chuan-Shen Hu and   
             Yi-Tsung Hsieh and   
              Hsiao-Wei Lin and   
                   Mei-Chen Yeh   Virtual Portraitist: an Intelligent Tool
                                  for Taking Well-Posed Selfies  . . . . . 12:1--12:??
                Shogo Okada and   
         Laurent Son Nguyen and   
                   Oya Aran and   
            Daniel Gatica-Perez   Modeling Dyadic and Group Impressions
                                  with Intermodal and Interperson Features 13:1--13:??
               Sicheng Zhao and   
          Amir Gholaminejad and   
              Guiguang Ding and   
                    Yue Gao and   
                Jungong Han and   
                   Kurt Keutzer   Personalized Emotion Recognition by
                                  Personality-Aware High-Order Learning of
                                  Physiological Signals  . . . . . . . . . 14:1--14:??
               Rim Trabelsi and   
     Jagannadan Varadarajan and   
                   Le Zhang and   
                Issam Jabri and   
                   Yong Pei and   
                Fethi Smach and   
           Ammar Bouallegue and   
                  Pierre Moulin   Understanding the Dynamics of Social
                                  Interactions: a Multi-Modal Multi-View
                                  Approach . . . . . . . . . . . . . . . . 15:1--15:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 15, Number 2, June, 2019

                   Tian Gan and   
                  Junnan Li and   
              Yongkang Wong and   
           Mohan S. Kankanhalli   A Multi-sensor Framework for Personal
                                  Presentation Analytics . . . . . . . . . 30:1--30:??
               Pengjie Tang and   
                 Hanli Wang and   
                       Qinyu Li   Rich Visual and Language Representation
                                  with Complementary Semantics for Video
                                  Captioning . . . . . . . . . . . . . . . 31:1--31:??
                  Chen Shen and   
              Zhongming Jin and   
                Wenqing Chu and   
              Rongxin Jiang and   
                 Yaowu Chen and   
                 Guo-Jun Qi and   
                 Xian-Sheng Hua   Multi-level Similarity Perception
                                  Network for Person Re-identification . . 32:1--32:??
                    Yu Miao and   
                Haiwei Dong and   
      Jihad Mohamad Al Jaam and   
         Abdulmotaleb El Saddik   A Deep Learning System for Recognizing
                                  Facial Expression in Real-Time . . . . . 33:1--33:??
         Gebremariam Mesfin and   
              Nadia Hussain and   
           Alexandra Covaci and   
              Gheorghita Ghinea   Using Eye Tracking and Heart-Rate
                                  Activity to Examine Crossmodal
                                  Correspondences QoE in Mulsemedia  . . . 34:1--34:??
                Ming Cheung and   
                  James She and   
                 Weiwei Sun and   
                   Jiantao Zhou   Detecting Online Counterfeit-goods
                                  Seller using Connection Discovery  . . . 35:1--35:??
       Hema Kumar Yarnagula and   
           Parikshit Juluri and   
          Sheyda Kiani Mehr and   
      Venkatesh Tamarapalli and   
                     Deep Medhi   QoE for Mobile Clients with
                                  Segment-aware Rate Adaptation Algorithm
                                  (SARA) for DASH Video Streaming  . . . . 36:1--36:??
           Pradeep K. Atrey and   
               Bakul Trehan and   
                Mukesh K. Saini   Watch Me from Distance (WMD): a
                                  Privacy-Preserving Long-Distance Video
                                  Surveillance System  . . . . . . . . . . 37:1--37:??
               Chih-Fan Hsu and   
              Yu-Shuen Wang and   
             Chin-Laung Lei and   
                   Kuan-Ta Chen   Look at Me! Correcting Eye Gaze in Live
                                  Video Communication  . . . . . . . . . . 38:1--38:??
               Kashif Ahmad and   
                   Nicola Conci   How Deep Features Have Improved Event
                                  Recognition in Multimedia: a Survey  . . 39:1--39:??
                Yadang Chen and   
               Chuanyan Hao and   
                Alex X. Liu and   
                       Enhua Wu   Appearance-consistent Video Object
                                  Segmentation Based on a Multinomial
                                  Event Model  . . . . . . . . . . . . . . 40:1--40:??
          Pierdicca Roberto and   
          Frontoni Emanuele and   
           Zingaretti Primo and   
            Mancini Adriano and   
           Loncarski Jelena and   
                Paolanti Marina   Design, Large-Scale Usage Testing, and
                                  Important Metrics for Augmented Reality
                                  Gaming Applications  . . . . . . . . . . 41:1--41:??
        Aliaksandr Siarohin and   
                 Gloria Zen and   
           Cveta Majtanovic and   
      Xavier Alameda-Pineda and   
                Elisa Ricci and   
                      Nicu Sebe   Increasing Image Memorability with
                                  Neural Style Transfer  . . . . . . . . . 42:1--42:??
              Thanh-Toan Do and   
                 Tuan Hoang and   
           Dang-Khoa Le Tan and   
                     Huu Le and   
              Tam V. Nguyen and   
                Ngai-Man Cheung   From Selective Deep Convolutional
                                  Features to Compact Binary
                                  Representations for Image Retrieval  . . 43:1--43:??
                Liquan Shen and   
                    Ping An and   
                    Guorui Feng   Low-Complexity Scalable Extension of the
                                  High-Efficiency Video Coding (SHVC)
                                  Encoding System  . . . . . . . . . . . . 44:1--44:??
                     Jun Hu and   
            Shengsheng Qian and   
                  Quan Fang and   
               Xueliang Liu and   
                  Changsheng Xu   A$^2$ CMHNE: Attention-Aware
                                  Collaborative Multimodal Heterogeneous
                                  Network Embedding  . . . . . . . . . . . 45:1--45:??
            Khalid M. Hosny and   
             Mohamed M. Darwish   Resilient Color Image Watermarking Using
                                  Accurate Quaternion Radial Substituted
                                  Chebyshev Moments  . . . . . . . . . . . 46:1--46:??
                Wenxuan Mou and   
               Hatice Gunes and   
                 Ioannis Patras   Alone versus In-a-group: a Multi-modal
                                  Framework for Automatic Affect
                                  Recognition  . . . . . . . . . . . . . . 47:1--47:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 15, Number 3, September, 2019

                   Richang Hong   Advanced Stereo Seam Carving by
                                  Considering Occlusions on Both Sides . . 69:1--69:??
                  Yun Zhang and   
                      Na Li and   
                  Sam Kwong and   
               Gangyi Jiang and   
                 Huanqiang Zeng   Statistical Early Termination and Early
                                  Skip Models for Fast Mode Decision in
                                  HEVC INTRA Coding  . . . . . . . . . . . 70:1--70:??
              Abhinav Gupta and   
                  Divya Singhal   A Simplistic Global Median Filtering
                                  Forensics Based on Frequency Domain
                                  Analysis of Image Residuals  . . . . . . 71:1--71:??
                     Kan Wu and   
                 Guanbin Li and   
                 Haofeng Li and   
              Jianjun Zhang and   
                      Yizhou Yu   Harvesting Visual Objects from Internet
                                  Images via Deep-Learning-Based
                                  Objectness Assessment  . . . . . . . . . 72:1--72:??
                  Yuan Yuan and   
                   Jie Fang and   
               Xiaoqiang Lu and   
                  Yachuang Feng   Spatial Structure Preserving Feature
                                  Pyramid Network for Semantic Image
                                  Segmentation . . . . . . . . . . . . . . 73:1--73:??
              Junxuan Zhang and   
                 Haifeng Hu and   
                     Xinlong Lu   Moving Foreground-Aware Visual Attention
                                  and Key Volume Mining for Human Action
                                  Recognition  . . . . . . . . . . . . . . 74:1--74:??
                  Amit More and   
             Subhasis Chaudhuri   A Pseudo-likelihood Approach for
                                  Geo-localization of Events from
                                  Crowd-sourced Sensor-Metadata  . . . . . 75:1--75:??
                Mohsin Shah and   
              Weiming Zhang and   
                Honggang Hu and   
                     Nenghai Yu   Paillier Cryptosystem based Mean Value
                                  Computation for Encrypted Domain Image
                                  Processing Operations  . . . . . . . . . 76:1--76:??
               Guanghui Yue and   
               Chunping Hou and   
                   Tianwei Zhou   Subtitle Region Selection of S$3$D
                                  Images in Consideration of Visual
                                  Discomfort and Viewing Habit . . . . . . 77:1--77:??
                   Yehao Li and   
                Yingwei Pan and   
                   Ting Yao and   
              Hongyang Chao and   
                   Yong Rui and   
                        Tao Mei   Learning Click-Based Deep
                                  Structure-Preserving Embeddings with
                                  Visual Attention . . . . . . . . . . . . 78:1--78:??
                Tengfei Cao and   
               Changqiao Xu and   
                    Mu Wang and   
             Zhongbai Jiang and   
               Xingyan Chen and   
                Lujie Zhong and   
           Luigi Alfredo Grieco   Stochastic Optimization for Green
                                  Multimedia Services in Dense $5$G
                                  Networks . . . . . . . . . . . . . . . . 79:1--79:??
                     Jie Wu and   
                 Haifeng Hu and   
                     Liang Yang   Pseudo-$3$D Attention Transfer Network
                                  with Content-aware Strategy for Image
                                  Captioning . . . . . . . . . . . . . . . 80:1--80:??
                   Min Wang and   
               Wengang Zhou and   
                    Qi Tian and   
                    Houqiang Li   Deep Scalable Supervised Quantization by
                                  Self-Organizing Map  . . . . . . . . . . 81:1--81:??
         Ihsan Mert Ozcelik and   
                      Cem Ersoy   Chunk Duration-Aware SDN-Assisted DASH   82:1--82:??
              Naifan Zhuang and   
                 Guo-Jun Qi and   
               The Duc Kieu and   
                    Kien A. Hua   Rethinking the Combined and Individual
                                  Orders of Derivative of States for
                                  Differential Recurrent Neural Networks:
                                  Deep Differential Recurrent Neural
                                  Networks . . . . . . . . . . . . . . . . 83:1--83:??
            Zhangcheng Wang and   
                      Ya Li and   
               Richang Hong and   
                    Xinmei Tian   Eigenvector-Based Distance Metric
                                  Learning for Image Classification and
                                  Retrieval  . . . . . . . . . . . . . . . 84:1--84:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 15, Number 3s, January, 2020

                Pietro Pala and   
                Liming Chen and   
                   Di Huang and   
               Xiaoming Liu and   
             Stefanos Zafeiriou   Introduction to the Special Issue on
                                  Face Analysis Applications . . . . . . . 1--2
              Zhen-Hua Feng and   
              Josef Kittler and   
             Bill Christmas and   
                    Xiao-Jun Wu   A Unified Tensor-based Active Appearance
                                  Model  . . . . . . . . . . . . . . . . . 1--22
                 Gil Shamai and   
              Ron Slossberg and   
                     Ron Kimmel   Synthesizing Facial Photometries and
                                  Corresponding Geometries Using
                                  Generative Adversarial Networks  . . . . 1--24
               Xueping Wang and   
               Yunhong Wang and   
                      Weixin Li   U-Net Conditional GANs for
                                  Photo-Realistic and Identity-Preserving
                                  Facial Expression Synthesis  . . . . . . 1--23
                 Zhiwei Liu and   
                Xiangyu Zhu and   
                  Ming Tang and   
                   Zhen Lei and   
                   Jinqiao Wang   Efficient Face Alignment with Fast
                                  Normalization and Contour Fitting Loss   1--16
                 Huiyu Duan and   
               Xiongkuo Min and   
                    Yi Fang and   
                    Lei Fan and   
              Xiaokang Yang and   
                  Guangtao Zhai   Visual Attention Analysis and Prediction
                                  on Human Faces for Children with Autism
                                  Spectrum Disorder  . . . . . . . . . . . 1--23
              Mingxing Duan and   
                   Kenli Li and   
               Xiangke Liao and   
                   Keqin Li and   
                        Qi Tian   Features-Enhanced Multi-Attribute
                                  Estimation with Convolutional Tensor
                                  Correlation Fusion Network . . . . . . . 1--23
               Sicheng Zhao and   
               Dhiraj Joshi and   
         Mohammad Soleymani and   
                       Qiang Ji   Introduction to the Special Issue on
                                  Affective Computing for Large-scale
                                  Heterogeneous Multimedia Data  . . . . . 1--2
               Sicheng Zhao and   
              Shangfei Wang and   
         Mohammad Soleymani and   
               Dhiraj Joshi and   
                       Qiang Ji   Affective Computing for Large-scale
                                  Heterogeneous Multimedia Data: a Survey  1--32
              Xiaopeng Hong and   
                   Wei Peng and   
           Mehrtash Harandi and   
                Ziheng Zhou and   
     Matti Pietikäinen and   
                   Guoying Zhao   Characterizing Subtle Facial Movements
                                  via Riemannian Manifold  . . . . . . . . 1--24
                 Junjie Zhu and   
                 Yuxuan Wei and   
                 Yifan Feng and   
                 Xibin Zhao and   
                        Yue Gao   Physiological Signals-based Emotion
                                  Recognition via High-order Correlation
                                  Learning . . . . . . . . . . . . . . . . 1--18
                 Dongyu She and   
                   Ming Sun and   
                    Jufeng Yang   Learning Discriminative Sentiment
                                  Representation from Strongly- and Weakly
                                  Supervised CNNs  . . . . . . . . . . . . 1--19
                   Liang Li and   
                  Xinge Zhu and   
                 Yiming Hao and   
                Shuhui Wang and   
                 Xingyu Gao and   
                 Qingming Huang   A Hierarchical CNN-RNN Approach for
                                  Visual Emotion Classification  . . . . . 1--17
                 Liang Yang and   
                Yuexue Wang and   
                  Junhua Gu and   
               Xiaochun Cao and   
                  Xiao Wang and   
                     Di Jin and   
              Guiguang Ding and   
                Jungong Han and   
                 Weixiong Zhang   Autonomous Semantic Community Detection
                                  via Adaptively Weighted Low-rank
                                  Approximation  . . . . . . . . . . . . . 1--22
                  Yuxin Hou and   
                Hongxun Yao and   
              Xiaoshuai Sun and   
                      Haoran Li   Soul Dancer: Emotion-Based Human Action
                                  Generation . . . . . . . . . . . . . . . 1--19
               Shenghong Hu and   
                     Min Xu and   
               Haimin Zhang and   
               Chunxia Xiao and   
                       Chao Gui   Affective Content-aware Adaptation
                                  Scheme on QoE Optimization of Adaptive
                                  Streaming over HTTP  . . . . . . . . . . 1--18

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 15, Number 4, January, 2020

                 Weizhi Nie and   
                Weijie Wang and   
                   Anan Liu and   
                  Yuting Su and   
                        Jie Nie   HGAN: Holistic Generative Adversarial
                                  Networks for Two-dimensional Image-based
                                  Three-dimensional Object Retrieval . . . 1--24
                  Mading Li and   
                Jiaying Liu and   
                Xiaoyan Sun and   
                   Zhiwei Xiong   Image/Video Restoration via Multiplanar
                                  Autoregressive Model and Low-Rank
                                  Optimization . . . . . . . . . . . . . . 1--23
            Sheng-Hua Zhong and   
              Yuantian Wang and   
                Tongwei Ren and   
              Mingjie Zheng and   
                    Yan Liu and   
                    Gangshan Wu   Steganographer Detection via Multi-Scale
                                  Embedding Probability Estimation . . . . 1--23
    Marcos Alves de Almeida and   
    Carolina Coimbra Vieira and   
Pedro Olmo Stancioli Vaz De Melo and   
Renato Martins Assunção   Random Playlists Smoothly Commuting
                                  Between Styles . . . . . . . . . . . . . 1--20
                  Zhaoda Ye and   
                     Yuxin Peng   Sequential Cross-Modal Hashing Learning
                                  via Multi-scale Correlation Mining . . . 1--20
               Shiguang Liu and   
                   Ziqing Huang   Efficient Image Hashing with Geometric
                                  Invariant Vector Distance for Copy
                                  Detection  . . . . . . . . . . . . . . . 1--22
               Zhandong Liu and   
               Wengang Zhou and   
                    Houqiang Li   AB-LSTM: Attention-based Bidirectional
                                  LSTM Model for Scene Text Detection  . . 1--23
           Deepayan Bhowmik and   
            Charith Abhayaratne   Embedding Distortion Analysis in
                                  Wavelet-domain Watermarking  . . . . . . 1--24
                  Ling Shen and   
               Richang Hong and   
               Haoran Zhang and   
                Xinmei Tian and   
                      Meng Wang   Video Retrieval with
                                  Similarity-Preserving Deep Temporal
                                  Hashing  . . . . . . . . . . . . . . . . 1--16
       Jeroen Van der Hooft and   
          Maria Torres Vega and   
         Stefano Petrangeli and   
                Tim Wauters and   
                 Filip De Turck   Tile-based Adaptive Streaming for
                                  Virtual Reality Video  . . . . . . . . . 1--24
Roberto Iraja Tavares Da Costa Filho and   
  Marcelo Caggiani Luizelli and   
         Stefano Petrangeli and   
          Maria Torres Vega and   
       Jeroen Van der Hooft and   
                Tim Wauters and   
             Filip De Turck and   
       Luciano Paschoal Gaspary   Dissecting the Performance of VR Video
                                  Streaming through the VR-EXP
                                  Experimentation Platform . . . . . . . . 1--23
              Yunpeng Zheng and   
                 Xuelong Li and   
                   Xiaoqiang Lu   Unsupervised Learning of Human Action
                                  Categories in Still Images with Deep
                                  Representations  . . . . . . . . . . . . 1--20
                  Meng Xing and   
               Zhiyong Feng and   
                    Yong Su and   
                  Jianhai Zhang   An Image Cues Coding Approach for $3$D
                                  Human Pose Estimation  . . . . . . . . . 1--20
                Jinhuan Liu and   
               Xuemeng Song and   
                Liqiang Nie and   
                   Tian Gan and   
                         Jun Ma   An End-to-End Attention-Based Neural
                                  Model for Complementary Clothing
                                  Matching . . . . . . . . . . . . . . . . 1--16
               Jonathan Kua and   
         Grenville Armitage and   
              Philip Branch and   
                      Jason But   Adaptive Chunklets and AQM for
                                  Higher-Performance Content Streaming . . 1--24

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 16, Number 1, April, 2020

                   Bin Chen and   
               Lingyan Ruan and   
                   Miu-Ling Lam   LFGAN: $4$D Light Field Synthesis from a
                                  Single RGB Image . . . . . . . . . . . . 2:1--2:20
                Yuhang Ding and   
                   Hehe Fan and   
               Mingliang Xu and   
                        Yi Yang   Adaptive Exploration for Unsupervised
                                  Person Re-identification . . . . . . . . 3:1--3:19
          Abdelhak Bentaleb and   
        Praveen Kumar Yadav and   
              Wei Tsang Ooi and   
               Roger Zimmermann   DQ-DASH: a Queuing Theory Approach to
                                  Distributed Adaptive Video Streaming . . 4:1--4:24
                  Xin Huang and   
                 Yuxin Peng and   
                      Zhang Wen   RCE-HIL: Recognizing Cross-media
                                  Entailment with Heterogeneous
                                  Interactive Learning . . . . . . . . . . 5:1--5:21
                Miaopeng Li and   
                Zimeng Zhou and   
                     Xinguo Liu   Cross Refinement Techniques for
                                  Markerless Human Motion Capture  . . . . 6:1--6:18
          Gazi Karam Illahi and   
          Thomas Van Gemert and   
            Matti Siekkinen and   
              Enrico Masala and   
           Antti Oulasvirta and   
Antti Ylä-Jääski   Cloud Gaming with Foveated Video
                                  Encoding . . . . . . . . . . . . . . . . 7:1--7:24
              Duc V. Nguyen and   
           Huyen T. T. Tran and   
              Truong Cong Thang   An Evaluation of Tile Selection Methods
                                  for Viewport-Adaptive Streaming of
                                  360-Degree Video . . . . . . . . . . . . 8:1--8:24
               Zhenguo Yang and   
                 Zehang Lin and   
                Peipei Kang and   
                Jianming Lv and   
                    Qing Li and   
                     Wenyin Liu   Learning Shared Semantic Space with
                                  Correlation Alignment for Cross-Modal
                                  Event Retrieval  . . . . . . . . . . . . 9:1--9:22
              Junfeng Zhang and   
                 Haifeng Hu and   
                    Guobin Shen   Joint Stacked Hourglass Network and
                                  Salient Region Attention Refinement for
                                  Robust Face Alignment  . . . . . . . . . 10:1--10:18
                   Shuji Tasaka   Causal Structures of Multidimensional
                                  QoE in Haptic-Audiovisual
                                  Communications: Bayesian Modeling  . . . 11:1--11:23
        Narinder Singh Punn and   
                 Sonali Agarwal   Inception U-Net Architecture for
                                  Semantic Segmentation to Identify Nuclei
                                  in Microscopy Cell Images  . . . . . . . 12:1--12:15
      Chandramani Chaudhary and   
               Poonam Goyal and   
              Navneet Goyal and   
            Yi-Ping Phoebe Chen   Image Retrieval for Complex Queries
                                  Using Knowledge Embedding  . . . . . . . 13:1--13:23
               Guoliang Luo and   
               Zhigang Deng and   
                   Xin Zhao and   
               Xiaogang Jin and   
                   Wei Zeng and   
               Wenqiang Xie and   
                     Hyewon Seo   Spatio-temporal Segmentation Based
                                  Adaptive Compression of Dynamic Mesh
                                  Sequences  . . . . . . . . . . . . . . . 14:1--14:24
               Zhaoqing Pan and   
                 Xiaokai Yi and   
                  Yun Zhang and   
                   Hui Yuan and   
                Fu Lee Wang and   
                      Sam Kwong   Frame-level Bit Allocation Optimization
                                  Based on Video Content Characteristics
                                  for HEVC . . . . . . . . . . . . . . . . 15:1--15:20
            Jean-Paul Ainam and   
                     Ke Qin and   
                Guisong Liu and   
              Guangchun Luo and   
              Brighter Agyemang   Enforcing Affinity Feature Learning
                                  through Self-attention for Person
                                  Re-identification  . . . . . . . . . . . 16:1--16:22
                 Mengyan Li and   
               Zhaoyu Zhang and   
                Guochen Xie and   
                         Jun Yu   A Deep Learning Approach for Face
                                  Hallucination Guided by Facial Boundary
                                  Responses  . . . . . . . . . . . . . . . 17:1--17:23
                    Zan Gao and   
                 Yinming Li and   
                    Shaohua Wan   Exploring Deep Learning for View-Based
                                  $3$D Model Retrieval . . . . . . . . . . 18:1--18:21

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 16, Number 1s, April, 2020

            Shengping Zhang and   
                 Huiyu Zhou and   
                    Dong Xu and   
             M. Emre Celebi and   
               Thierry Bouwmans   Introduction to the Special Issue on
                                  Multimodal Machine Learning for Human
                                  Behavior Analysis  . . . . . . . . . . . 19:1--19:2
              Changyong Guo and   
              Zhaoxin Zhang and   
                Jinjiang Li and   
              Xuesong Jiang and   
                  Jun Zhang and   
                      Lei Zhang   Robust Visual Tracking Using Kernel
                                  Sparse Coding on Multiple Covariance
                                  Descriptors  . . . . . . . . . . . . . . 20:1--20:22
              Zhaoxin Zhang and   
              Changyong Guo and   
                Fanzhi Meng and   
                Taizhong Xu and   
                   Junkai Huang   CovLets: a Second-Order Descriptor for
                                  Modeling Multiple Features . . . . . . . 21:1--21:14
              Quanling Meng and   
                  Heyan Zhu and   
              Weigang Zhang and   
               Xuefeng Piao and   
                    Aijie Zhang   Action Recognition Using Form and Motion
                                  Modalities . . . . . . . . . . . . . . . 22:1--22:16
        Pourya Shamsolmoali and   
         Masoumeh Zareapoor and   
                 Huiyu Zhou and   
                       Jie Yang   AMIL: Adversarial Multi-instance
                                  Learning for Human Pose Estimation . . . 23:1--23:23
             Yueting Zhuang and   
                  Dejing Xu and   
                    Xin Yan and   
              Wenzhuo Cheng and   
                  Zhou Zhao and   
                Shiliang Pu and   
                       Jun Xiao   Multichannel Attention Refinement for
                                  Video Question Answering . . . . . . . . 24:1--24:23
           Aleksei Grigorev and   
                Shaohui Liu and   
               Zhihong Tian and   
              Jianxin Xiong and   
               Seungmin Rho and   
                     Jiang Feng   Delving Deeper in Drone-Based Person
                                  Re-Id by Employing Deep Decision Forest
                                  and Attributes Fusion  . . . . . . . . . 25:1--25:15
                  Zhaoju Li and   
               Zongwei Zhou and   
                  Nan Jiang and   
                Zhenjun Han and   
              Junliang Xing and   
                   Jianbin Jiao   Spatial Preserved Graph Convolution
                                  Networks for Person Re-identification    26:1--26:14
                   Hui Chen and   
              Guiguang Ding and   
                  Zijia Lin and   
               Sicheng Zhao and   
                Xiaopeng Gu and   
                 Wenyuan Xu and   
                    Jungong Han   ACMNet: Adaptive Confidence Matching
                                  Network for Human Behavior Analysis via
                                  Cross-modal Retrieval  . . . . . . . . . 27:1--27:21
                Anran Zhang and   
             Xiaolong Jiang and   
             Baochang Zhang and   
                    Xianbin Cao   Multi-scale Supervised Attentive
                                  Encoder--Decoder Network for Crowd
                                  Counting . . . . . . . . . . . . . . . . 28:1--28:20
                 M. Tanveer and   
                  P. Khanna and   
                  M. Prasad and   
                      C. T. Lin   Introduction to the Special Issue on
                                  Computational Intelligence for
                                  Biomedical Data and Imaging  . . . . . . 29:1--29:4
                 M. Tanveer and   
              B. Richhariya and   
                 R. U. Khan and   
               A. H. Rashid and   
                  P. Khanna and   
                  M. Prasad and   
                      C. T. Lin   Machine Learning Techniques for the
                                  Diagnosis of Alzheimer's Disease: a
                                  Review . . . . . . . . . . . . . . . . . 30:1--30:35
               Shweta Yadav and   
             Pralay Ramteke and   
                 Asif Ekbal and   
              Sriparna Saha and   
          Pushpak Bhattacharyya   Exploring Disorder-Aware Attention for
                                  Clinical Event Extraction  . . . . . . . 31:1--31:21
           Suvidha Tripathi and   
             Satish Kumar Singh   Cell Nuclei Classification in
                                  Histopathological Images using Hybrid O
                                  L ConvNet  . . . . . . . . . . . . . . . 32:1--32:22
                Nengjun Zhu and   
                   Jian Cao and   
                Kunwei Shen and   
              Xiaosong Chen and   
                       Siji Zhu   A Decision Support System with
                                  Intelligent Recommendation for
                                  Multi-disciplinary Medical Treatment . . 33:1--33:23
              Qingyong Wang and   
                   Yun Zhou and   
               Weiping Ding and   
               Zhiguo Zhang and   
              Khan Muhammad and   
                     Zehong Cao   Random Forest with Self-Paced Bootstrap
                                  Learning in Lung Cancer Prognosis  . . . 34:1--34:12
               Naveen Saini and   
              Sriparna Saha and   
      Pushpak Bhattacharyya and   
                Himanshu Tuteja   Textual Entailment-Based Figure
                                  Summarization for Biomedical Articles    35:1--35:24
                  Chao Tong and   
                Baoyu Liang and   
               Mengze Zhang and   
              Rongshan Chen and   
        Arun Kumar Sangaiah and   
               Zhigao Zheng and   
                    Tao Wan and   
               Chenyang Yue and   
                     Xinyi Yang   Pulmonary Nodule Detection Based on
                                  ISODATA-Improved Faster RCNN and
                                  $3$D-CNN with Focal Loss . . . . . . . . 36:1--36:9
            Utkarsh Agrawal and   
                Jatin Arora and   
                Rahul Singh and   
               Deepak Gupta and   
              Ashish Khanna and   
               Aditya Khamparia   Hybrid Wolf--Bat Algorithm for
                                  Optimization of Connection Weights in
                                  Multi-layer Perceptron . . . . . . . . . 37:1--37:20
         Ranjeet Kumar Rout and   
           Sk. Sarif Hassan and   
          Sanchit Sindhwani and   
          Hari Mohan Pandey and   
                    Saiyed Umer   Intelligent Classification and Analysis
                                  of Essential Genes Using Quantitative
                                  Methods  . . . . . . . . . . . . . . . . 38:1--38:21
               Hongyi Zhang and   
                Haoke Zhang and   
          Sandeep Pirbhulal and   
                 Wanqing Wu and   
  Victor Hugo C. De Albuquerque   Active Balancing Mechanism for
                                  Imbalanced Medical Data in Deep
                                  Learning-Based Classification Models . . 39:1--39:15

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 16, Number 2, June, 2020

         Shanthi Vellingiri and   
            Ryan P. McMahan and   
       Balakrishnan Prabhakaran   SCeVE: a Component-based Framework to
                                  Author Mixed Reality Tours . . . . . . . 40:1--40:23
                Jiaying Liu and   
                 Sijie Song and   
                Chunhui Liu and   
                 Yanghao Li and   
                       Yueyu Hu   A Benchmark Dataset and Comparison Study
                                  for Multi-modal Human Action Analytics   41:1--41:24
              Mingxing Duan and   
                   Kenli Li and   
               Aijia Ouyang and   
            Khin Nandar Win and   
                   Keqin Li and   
                        Qi Tian   EGroupNet: a Feature-enhanced Network
                                  for Age Estimation with Novel Age Group
                                  Schemes  . . . . . . . . . . . . . . . . 42:1--42:23
Abraham Báez-Suárez and   
                 Nolan Shah and   
 Juan Arturo Nolazco-Flores and   
        Shou-Hsuan S. Huang and   
          Omprakash Gnawali and   
                    Weidong Shi   SAMAF: Sequence-to-sequence Autoencoder
                                  Model for Audio Fingerprinting . . . . . 43:1--43:23
              Pascal Mettes and   
           Dennis C. Koelma and   
               Cees G. M. Snoek   Shuffled ImageNet Banks for Video Event
                                  Detection and Search . . . . . . . . . . 44:1--44:21
        Farzan Majeed Noori and   
            Michael Riegler and   
               Md Zia Uddin and   
                   Jim Torresen   Human Activity Recognition from Multiple
                                  Sensors Data Using Multi-fusion
                                  Representations and CNNs . . . . . . . . 45:1--45:19
               Silvia Rossi and   
              Cagri Ozcinar and   
              Aljosa Smolic and   
                     Laura Toni   Do Users Behave Similarly in VR?
                                  Investigation of the User Influence on
                                  the System Design  . . . . . . . . . . . 46:1--46:26
                  Xiao Wang and   
                     Wu Liu and   
                   Jun Chen and   
                Xiaobo Wang and   
              Chenggang Yan and   
                        Tao Mei   Listen, Look, and Find the One: Robust
                                  Person Search with Multimodality Index   47:1--47:20
                Xiaofan Luo and   
               Fukoeng Wong and   
                     Haifeng Hu   FIN: Feature Integrated Network for
                                  Object Detection . . . . . . . . . . . . 48:1--48:18
           Kutalmis Akpinar and   
                    Kien A. Hua   PPNet: Privacy Protected CDN--ISP
                                  Collaboration for QoS-aware Multi-CDN
                                  Adaptive Video Streaming . . . . . . . . 49:1--49:23
       Vishesh Kumar Tanwar and   
      Balasubramanian Raman and   
       Amitesh Singh Rajput and   
                  Rama Bhargava   CryptoLesion: a Privacy-preserving Model
                                  for Lesion Segmentation Using Whale
                                  Optimization over Cloud  . . . . . . . . 50:1--50:23
              Zhedong Zheng and   
                Liang Zheng and   
            Michael Garrett and   
                    Yi Yang and   
               Mingliang Xu and   
                   Yi-Dong Shen   Dual-path Convolutional Image-Text
                                  Embeddings with Instance Loss  . . . . . 51:1--51:23
              Xiaowen Huang and   
            Shengsheng Qian and   
                  Quan Fang and   
                 Jitao Sang and   
                  Changsheng Xu   Meta-path Augmented Sequential
                                  Recommendation with Contextual
                                  Co-attention Network . . . . . . . . . . 52:1--52:24
               Lingxiang Wu and   
                     Min Xu and   
            Shengsheng Qian and   
                    Jianwei Cui   Image to Modern Chinese Poetry Creation
                                  via a Constrained Topic-aware Model  . . 53:1--53:21
                 Zhili Zhou and   
          Q. M. Jonathan Wu and   
                 Yimin Yang and   
                   Xingming Sun   Region-Level Visual Consistency
                                  Verification for Large-Scale
                                  Partial-Duplicate Image Search . . . . . 54:1--54:25
                   Jiale He and   
                 Gaobo Yang and   
                    Xin Liu and   
                 Xiangling Ding   Spatio-temporal Saliency-based Motion
                                  Vector Refinement for Frame Rate
                                  Up-conversion  . . . . . . . . . . . . . 55:1--55:18
            Francesco Gelli and   
           Tiberio Uricchio and   
                Xiangnan He and   
          Alberto Del Bimbo and   
                  Tat-Seng Chua   Learning Visual Elements of Images for
                                  Discovery of Brand Posts . . . . . . . . 56:1--56:21
               Xian-Hua Han and   
             Yinqiang Zheng and   
                 Jiande Sun and   
                   Yen-Wei Chen   Hyperspectral Reconstruction with
                                  Redundant Camera Spectral Sensitivity
                                  Functions  . . . . . . . . . . . . . . . 57:1--57:15

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 16, Number 2s, July, 2020

                Honghao Gao and   
                   Yudong Zhang   Introduction to the Special Issue on
                                  Smart Communications and Networking for
                                  Future Video Surveillance  . . . . . . . 58:1--58:2
              Yizhang Jiang and   
                Xiaoqing Gu and   
               Dingcheng Ji and   
             Pengjiang Qian and   
                   Jing Xue and   
             Yuanpeng Zhang and   
                  Jiaqi Zhu and   
                Kaijian Xia and   
                   Shitong Wang   Smart Diagnosis: a Multiple-Source
                                  Transfer TSK Fuzzy System for EEG
                                  Seizure Identification . . . . . . . . . 59:1--59:21
              Shui-Hua Wang and   
                  Yu-Dong Zhang   DenseNet-201-Based Deep Neural Network
                                  with Composite Learning Factor and
                                  Precomputation for Multiple Sclerosis
                                  Classification . . . . . . . . . . . . . 60:1--60:19
                Kaijian Xia and   
              Hongsheng Yin and   
                   Yong Jin and   
                    Shi Qiu and   
                    Hongru Zhao   Cross-Domain Brain CT Image Smart
                                  Segmentation via Shared Hidden Space
                                  Transfer FCM Clustering  . . . . . . . . 61:1--61:21
                Yonggang Li and   
               Chunping Liu and   
                      Yi Ji and   
             Shengrong Gong and   
                      Haibao Xu   Spatio-Temporal Deep Residual Network
                                  with Hierarchical Attentions for Video
                                  Event Recognition  . . . . . . . . . . . 62:1--62:21
                     Wen Si and   
                   Cong Liu and   
                Zhongqin Bi and   
                   Meijing Shan   Modeling Long-Term Dependencies from
                                  Videos Using Deep Multiplicative Neural
                                  Networks . . . . . . . . . . . . . . . . 63:1--63:19
                  Suguo Zhu and   
              Xiaoxian Yang and   
                     Jun Yu and   
              Zhenying Fang and   
                  Meng Wang and   
                 Qingming Huang   Proposal Complementary Action Detection  64:1--64:12
               Chenxi Huang and   
                  Yisha Lan and   
               Guokai Zhang and   
                  Gaowei Xu and   
                Landu Jiang and   
               Nianyin Zeng and   
                Jenhong Tan and   
                E. Y. K. Ng and   
            Yongqiang Cheng and   
                Ningzhi Han and   
                Rongrong Ji and   
                  Yonghong Peng   A New Transfer Function for Volume
                                  Visualization of Aortic Stent and Its
                                  Application to Virtual Endoscopy . . . . 65:1--65:14
               Michael Zink and   
                 Laura Toni and   
                   Ali C. Begen   Introduction to the Best Papers from the
                                  ACM Multimedia Systems (MMSys) 2019 and
                                  Co-Located Workshops . . . . . . . . . . 66:1--66:2
             Rui-Xiao Zhang and   
                    Ming Ma and   
              Tianchi Huang and   
               Haitian Pang and   
                    Xin Yao and   
                Chenglei Wu and   
                     Lifeng Sun   A Practical Learning-based Approach for
                                  Viewer Scheduling in the Crowdsourced
                                  Live Streaming . . . . . . . . . . . . . 67:1--67:22
             Sa'di Altamimi and   
          Shervin Shirmohammadi   QoE-Fair DASH Video Streaming Using
                                  Server-side Reinforcement Learning . . . 68:1--68:21
          Abdelhak Bentaleb and   
         Christian Timmerer and   
               Ali C. Begen and   
               Roger Zimmermann   Performance Analysis of ACTE: a
                                  Bandwidth Prediction Method for
                                  Low-latency Chunked Streaming  . . . . . 69:1--69:24
                Stefan Pham and   
             Patrick Heeren and   
             Calvin Schmidt and   
             Daniel Silhavy and   
              Stefan Arbanowski   Evaluation of Shared Resource Allocation
                                  Using SAND for ABR Streaming . . . . . . 70:1--70:18
            Craig Gutterman and   
              Katherine Guo and   
              Sarthak Arora and   
             Trey Gilliland and   
              Xiaoyang Wang and   
                     Les Wu and   
         Ethan Katz-Bassett and   
                    Gil Zussman   Requet: Real-Time QoE Metric Detection
                                  for Encrypted YouTube Traffic  . . . . . 71:1--71:28
                  Xinjue Hu and   
              Jingming Shan and   
                     Yu Liu and   
                  Lin Zhang and   
          Shervin Shirmohammadi   An Adaptive Two-Layer Light Field
                                  Compression Scheme Using GNN-Based
                                  Reconstruction . . . . . . . . . . . . . 72:1--72:23
              Mark Claypool and   
              Andy Cockburn and   
                    Carl Gutwin   The Impact of Motion and Delay on
                                  Selecting Game Targets with a Mouse  . . 73:1--73:24

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 16, Number 3, September, 2020

                      Anonymous   Table of Contents: Online Supplement
                                  Volume 16, Number 1s . . . . . . . . . . 74:1--74:5
                 Liang Yang and   
                 Haifeng Hu and   
              Songlong Xing and   
                     Xinlong Lu   Constrained LSTM and Residual Attention
                                  for Image Captioning . . . . . . . . . . 75:1--75:18
               Donghuo Zeng and   
                      Yi Yu and   
                    Keizo Oyama   Deep Triplet Neural Networks with
                                  Cluster-CCA for Audio-Visual Cross-Modal
                                  Retrieval  . . . . . . . . . . . . . . . 76:1--76:23
                 Yu-Ting Su and   
                 Wen-Hui Li and   
                Wei-Zhi Nie and   
                      An-An Liu   Multi-View Graph Matching for $3$D Model
                                  Retrieval  . . . . . . . . . . . . . . . 77:1--77:20
                   Hehe Fan and   
                Linchao Zhu and   
                    Yi Yang and   
                         Fei Wu   Recurrent Attention Network with
                                  Reinforced Generator for Visual Dialog   78:1--78:16
               Feiran Huang and   
                 Kaimin Wei and   
                  Jian Weng and   
                     Zhoujun Li   Attention-Based Modality-Gated Networks
                                  for Image-Text Sentiment Analysis  . . . 79:1--79:19
              Shangfei Wang and   
                Longfei Hao and   
                       Qiang Ji   Posed and Spontaneous Expression
                                  Distinction Using Latent Regression
                                  Bayesian Networks  . . . . . . . . . . . 80:1--80:18
                 Fangyu Liu and   
         Rémi Lebret and   
                Didier Orel and   
            Philippe Sordet and   
                    Karl Aberer   Upgrading the Newsroom: an Automated
                                  Image Selection System for News Articles 81:1--81:28
                 Chenlei Lv and   
                 Zhongke Wu and   
                Xingce Wang and   
                  Mingquan Zhou   $3$D Facial Similarity Measurement and
                                  Its Application in Facial Organization   82:1--82:20
                   Jin Yuan and   
                  Lei Zhang and   
                Songrui Guo and   
                    Yi Xiao and   
                     Zhiyong Li   Image Captioning with a Joint Attention
                                  Mechanism by Visual Concept Samples  . . 83:1--83:22
                   Xun Wang and   
                   Yan Tian and   
                 Xuran Zhao and   
                   Tao Yang and   
           Judith Gelernter and   
                Jialei Wang and   
               Guohua Cheng and   
                         Wei Hu   Improving Multiperson Pose Estimation by
                                  Mask-aware Deep Reinforcement Learning   84:1--84:18
              Shenming Feng and   
                     Haifeng Hu   Learning Joint Structure for Human Pose
                                  Estimation . . . . . . . . . . . . . . . 85:1--85:17
                   Feng Lin and   
                     Bin Li and   
               Wengang Zhou and   
                Houqiang Li and   
                         Yan Lu   Single-stage Instance Segmentation . . . 86:1--86:19
             Shuqiang Jiang and   
                Weiqing Min and   
              Yongqiang Lyu and   
                      Linhu Liu   Few-shot Food Recognition via Multi-view
                                  Representation Learning  . . . . . . . . 87:1--87:20
               Trang-Thi Ho and   
       John Jethro Virtusio and   
              Yung-Yao Chen and   
              Chih-Ming Hsu and   
                   Kai-Lung Hua   Sketch-guided Deep Portrait Generation   88:1--88:18
           Gargi Srivastava and   
              Rajeev Srivastava   Design, Analysis, and Implementation of
                                  Efficient Framework for Image Annotation 89:1--89:24
             Dongyang Zhang and   
                   Jie Shao and   
                  Heng Tao Shen   Kernel Attention Network for Single
                                  Image Super-Resolution . . . . . . . . . 90:1--90:15
                  Yutao Liu and   
                      Ke Gu and   
                     Xiu Li and   
                 Yongbing Zhang   Blind Image Quality Assessment by
                                  Natural Scene Statistics and Perceptual
                                  Characteristics  . . . . . . . . . . . . 91:1--91:91
              Jobin Francis and   
                 Baburaj M. and   
              Sudhish N. George   A Unified Tensor Framework for
                                  Clustering and Simultaneous
                                  Reconstruction of Incomplete Imaging
                                  Data . . . . . . . . . . . . . . . . . . 92:1--92:24

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 16, Number 3s, January, 2021

               Suraj Sharma and   
                Xuyun Zhang and   
            Hesham El-Sayed and   
                    Zhiyuan Tan   Introduction to the Special Issue on
                                  Privacy and Security in Evolving
                                  Internet of Multimedia Things  . . . . . 93:1--93:3
                Xiaolong Xu and   
                 Qihe Huang and   
                Yiwen Zhang and   
                Shancang Li and   
                Lianyong Qi and   
                    Wanchun Dou   An LSH-based Offloading Method for IoMT
                                  Services in Integrated Cloud-Edge
                                  Environment  . . . . . . . . . . . . . . 94:1--94:19
          Nicholaus J. Gati and   
           Laurence T. Yang and   
                   Jun Feng and   
                   Yijun Mo and   
                  Mamoun Alazab   Differentially Private Tensor Train Deep
                                  Computation for Internet of Multimedia
                                  Things . . . . . . . . . . . . . . . . . 95:1--95:20
               Haoran Liang and   
                     Jun Wu and   
                   Xi Zheng and   
              Mengshi Zhang and   
                 Jianhua Li and   
                Alireza Jolfaei   Fog-based Secure Service Discovery for
                                  Internet of Multimedia Things: a
                                  Cross-blockchain Approach  . . . . . . . 96:1--96:23
                  Zhihan Lv and   
                 Liang Qiao and   
                   Houbing Song   Analysis of the Security of Internet of
                                  Multimedia Things  . . . . . . . . . . . 97:1--97:16
         Kshira Sagar Sahoo and   
                  Deepak Puthal   SDN-Assisted DDoS Defense Framework for
                                  the Internet of Multimedia Things  . . . 98:1--98:18
            Suyel Namasudra and   
          Rupak Chakraborty and   
          Abhishek Majumder and   
         Nageswara Rao Moparthi   Securing Multimedia by Using DNA-Based
                                  Encryption in the Cloud Computing
                                  Environment  . . . . . . . . . . . . . . 99:1--99:19
                Liming Fang and   
              Changchun Yin and   
                 Juncen Zhu and   
                Chunpeng Ge and   
                 M. Tanveer and   
            Alireza Jolfaei and   
                     Zehong Cao   Privacy Protection for Medical Data
                                  Sharing in Smart Healthcare  . . . . . . 100:1--100:18
                    A. K. Singh   Data Hiding: Current Trends, Innovation
                                  and Potential Challenges . . . . . . . . 101:1--101:16
                  Hezhen Hu and   
               Wengang Zhou and   
                  Xingze Li and   
                   Ning Yan and   
                    Houqiang Li   MV2Flow: Learning Motion Representation
                                  for Fast Compressed Video Action
                                  Recognition  . . . . . . . . . . . . . . 102:1--102:19
                Chaoran Cui and   
               Peiguang Lin and   
                Xiushan Nie and   
                 Muwei Jian and   
                     Yilong Yin   Social-sensed Image Aesthetics
                                  Assessment . . . . . . . . . . . . . . . 103:1--103:19

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 16, Number 4, January, 2021

                   Suraj Sharma   Table of Contents: Online Supplement
                                  Volume 16, Number 3s . . . . . . . . . . 117e-1:117e-2
                 Huiru Shao and   
                    Jing Li and   
                  Jia Zhang and   
                     Hui Yu and   
                     Jiande Sun   Eye-based Recognition for User
                                  Identification on Mobile Devices . . . . 117:1--117:19
                 Zuquan Liu and   
                  Guopu Zhu and   
              Yuan-Gen Wang and   
              Jianquan Yang and   
                      Sam Kwong   A Novel $ (t, s, k, n)$-Threshold Visual
                                  Secret Sharing Scheme Based on Access
                                  Structure Partition  . . . . . . . . . . 118:1--118:21
         Federico Becattini and   
           Tiberio Uricchio and   
          Lorenzo Seidenari and   
            Lamberto Ballan and   
              Alberto Del Bimbo   Am I Done? Predicting Action Progress in
                                  Videos . . . . . . . . . . . . . . . . . 119:1--119:24
               Weijian Ruan and   
                 Chao Liang and   
                      Yi Yu and   
                 Zheng Wang and   
                     Wu Liu and   
                   Jun Chen and   
                       Jiayi Ma   Correlation Discrepancy Insight Network
                                  for Video Re-identification  . . . . . . 120:1--120:21
                   Xin Yang and   
                    Yu Qiao and   
               Shaozhe Chen and   
               Shengfeng He and   
                 Baocai Yin and   
                Qiang Zhang and   
               Xiaopeng Wei and   
               Rynson W. H. Lau   Smart Scribbles for Image Matting  . . . 121:1--121:21
              Chenggang Yan and   
                Zhisheng Li and   
             Yongbing Zhang and   
                  Yutao Liu and   
               Xiangyang Ji and   
                 Yongdong Zhang   Depth Image Denoising Using Nuclear Norm
                                  and Learning Graph Model . . . . . . . . 122:1--122:17
                    Lin Zhu and   
              Xiurong Jiang and   
                 Jianing Li and   
               Yuanhong Hao and   
                  Yonghong Tian   Motion-Aware Structured Matrix
                                  Factorization for Foreground Detection
                                  in Complex Scenes  . . . . . . . . . . . 123:1--123:23
                   Yang Wei and   
                Zhuzhu Wang and   
                   Bin Xiao and   
                 Ximeng Liu and   
                  Zheng Yan and   
                    Jianfeng Ma   Controlling Neural Learning Network with
                                  Multiple Scales for Image Splicing
                                  Forgery Detection  . . . . . . . . . . . 124:1--124:22
                   Kun Zeng and   
              Jiangchuan Hu and   
                Yongyi Gong and   
      Kanoksak Wattanachote and   
                 Runpeng Yu and   
                    Xiaonan Luo   Vertical Retargeting for Stereoscopic
                                  Images via Stereo Seam Carving . . . . . 125:1--125:22
                   Tao Tian and   
                 Hanli Wang and   
                  Sam Kwong and   
                  C.-C. Jay Kuo   Perceptual Image Compression with
                                  Block-Level Just Noticeable Difference
                                  Prediction . . . . . . . . . . . . . . . 126:1--126:15
                     Xin He and   
                  Qiong Liu and   
                       You Yang   Make Full Use of Priors: Cross-View
                                  Optimized Filter for Multi-View Depth
                                  Enhancement  . . . . . . . . . . . . . . 127:1--127:19
               Xiaoxiao Liu and   
                    Qingyang Xu   Adaptive Attention-based High-level
                                  Semantic Introduction for Image Caption  128:1--128:22
      Muhammad Abu Ul Fazal and   
               Sam Ferguson and   
                Andrew Johnston   Evaluation of Information Comprehension
                                  in Concurrent Speech-based Designs . . . 129:1--129:19
                Yucheng Zhu and   
              Guangtao Zhai and   
               Xiongkuo Min and   
                   Jiantao Zhou   Learning a Deep Agent to Predict Head
                                  Movement in 360-Degree Images  . . . . . 130:1--130:23
                 Weizhi Nie and   
                   Qi Liang and   
                 Yixin Wang and   
                   Xing Wei and   
                      Yuting Su   MMFN: Multimodal Information Fusion
                                  Networks for $3$D Model Classification
                                  and Retrieval  . . . . . . . . . . . . . 131:1--131:22
             Zhongying Zhao and   
               Yonghao Yang and   
                    Chao Li and   
                    Liqiang Nie   GuessUNeed: Recommending Courses via
                                  Neural Attention Network and Course
                                  Prerequisite Relation Embeddings . . . . 132:1--132:17
                   Yi Huang and   
              Xiaoshan Yang and   
                  Junyu Gao and   
                 Jitao Sang and   
                  Changsheng Xu   Knowledge-driven Egocentric Multimodal
                                  Activity Recognition . . . . . . . . . . 133:1--133:133
                   Yaoyu Li and   
                 Hantao Yao and   
              Tianzhu Zhang and   
                  Changsheng Xu   Part-based Structured Representation
                                  Learning for Person Re-identification    134:1--134:22

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 17, Number 1, April, 2021

                    Xin Jin and   
                Jianfeng Xu and   
            Kazuyuki Tasaka and   
                     Zhibo Chen   Multi-task Learning-based All-in-one
                                  Collaboration Framework for Degraded
                                  Image Super-resolution . . . . . . . . . 21:1--21:21
           Huyen T. T. Tran and   
              Nam Pham Ngoc and   
       Tobias Hoßfeld and   
            Michael Seufert and   
              Truong Cong Thang   Cumulative Quality Modeling for HTTP
                                  Adaptive Streaming . . . . . . . . . . . 22:1--22:24
                    Tong Xu and   
                Peilun Zhou and   
                 Linkang Hu and   
                Xiangnan He and   
                     Yao Hu and   
                    Enhong Chen   Socializing the Videos: a Multimodal
                                  Approach for Social Relation Recognition 23:1--23:23
                  Xuehu Yan and   
                 Lintao Liu and   
                Longlong Li and   
                     Yuliang Lu   Robust Secret Image Sharing Resistant to
                                  Noise in Shares  . . . . . . . . . . . . 24:1--24:22
               Mingliang Xu and   
                Qingfeng Li and   
                Jianwei Niu and   
                     Hao Su and   
                 Xiting Liu and   
                  Weiwei Xu and   
                     Pei Lv and   
                  Bing Zhou and   
                        Yi Yang   ART-UP: a Novel Method for Generating
                                  Scanning-Robust Aesthetic QR Codes . . . 25:1--25:23
                Peihao Yang and   
                Linghe Kong and   
                Meikang Qiu and   
                    Xue Liu and   
                    Guihai Chen   Compressed Imaging Reconstruction with
                                  Sparse Random Projection . . . . . . . . 26:1--26:25
                     Lei Qi and   
                   Lei Wang and   
                   Jing Huo and   
               Yinghuan Shi and   
                       Yang Gao   GreyReID: a Novel Two-stream Deep
                                  Framework with RGB-grey Information for
                                  Person Re-identification . . . . . . . . 27:1--27:22
          Said Chehabeddine and   
      Muhammad Hassan Jamil and   
                Wanjoo Park and   
             Dianne L. Sefo and   
            Peter M. Loomer and   
                    Mohamad Eid   Bi-manual Haptic-based Periodontal
                                  Simulation with Finger Support and
                                  Vibrotactile Feedback  . . . . . . . . . 28:1--28:17
                 Jianshu Li and   
                  Jian Zhao and   
               Congyan Lang and   
                  Yidong Li and   
                Yunchao Wei and   
                Guodong Guo and   
                Terence Sim and   
              Shuicheng Yan and   
                    Jiashi Feng   Multi-human Parsing with a Graph-based
                                  Generative Adversarial Model . . . . . . 29:1--29:21
                Yusuf Cinar and   
                Peter Pocta and   
           Desmond Chambers and   
                    Hugh Melvin   Improved Jitter Buffer Management for
                                  WebRTC . . . . . . . . . . . . . . . . . 30:1--30:20
           Lukasz Czekierda and   
      Krzysztof Zieli\'nski and   
          S\lawomir Zieli\'nski   Automated Orchestration of Online
                                  Educational Collaboration in Cloud-based
                                  Environments . . . . . . . . . . . . . . 31:1--31:26
                    My Kieu and   
         Andrew D. Bagdanov and   
                  Marco Bertini   Bottom-up and Layerwise Domain
                                  Adaptation for Pedestrian Detection in
                                  Thermal Images . . . . . . . . . . . . . 32:1--32:19
                Wenjie Wang and   
               Ling-Yu Duan and   
                  Hao Jiang and   
              Peiguang Jing and   
               Xuemeng Song and   
                    Liqiang Nie   Market$2$Dish: Health-aware Food
                                  Recommendation . . . . . . . . . . . . . 33:1--33:19
                 Yiding Liu and   
                  Siyu Yang and   
                     Bin Li and   
               Wengang Zhou and   
                 Jizheng Xu and   
                Houqiang Li and   
                         Yan Lu   Affinity Derivation for Accurate
                                  Instance Segmentation  . . . . . . . . . 34:1--34:20
                      Yi Yu and   
        Abhishek Srivastava and   
                  Simon Canales   Conditional LSTM-GAN for Melody
                                  Generation from Lyrics . . . . . . . . . 35:1--35:20
                   Xin Yang and   
               Xuemeng Song and   
                  Fuli Feng and   
                 Haokun Wen and   
               Ling-Yu Duan and   
                    Liqiang Nie   Attribute-wise Explainable Fashion
                                  Compatibility Modeling . . . . . . . . . 36:1--36:21
                  Zhixin Li and   
                    Lan Lin and   
              Canlong Zhang and   
                 Huifang Ma and   
              Weizhong Zhao and   
                    Zhiping Shi   A Semi-supervised Learning Approach
                                  Based on Adaptive Weighted Fusion for
                                  Automatic Image Annotation . . . . . . . 37:1--37:23
                 Yanwei Liu and   
                 Jinxia Liu and   
          Antonios Argyriou and   
                   Siwei Ma and   
                Liming Wang and   
                        Zhen Xu   $ 360$-Degree VR Video Watermarking
                                  Based on Spherical Wavelet Transform . . 38:1--38:23

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 17, Number 1s, March, 2021

                  Yang Wang and   
                  Meng Fang and   
           Joey Tianyi Zhou and   
                Tingting Mu and   
                    Dacheng Tao   Introduction to Big Multimodal
                                  Multimedia Data with Deep Analytics  . . 1:1--1:3
                    Xing Xu and   
                Jialin Tian and   
                  Kaiyi Lin and   
                  Huimin Lu and   
                   Jie Shao and   
                  Heng Tao Shen   Zero-shot Cross-modal Retrieval by
                                  Assembling AutoEncoder and Generative
                                  Adversarial Network  . . . . . . . . . . 3:1--3:17
                  Sichao Fu and   
                Weifeng Liu and   
                 Weili Guan and   
                Yicong Zhou and   
                 Dapeng Tao and   
                  Changsheng Xu   Dynamic Graph Learning Convolutional
                                  Networks for Semi-supervised
                                  Classification . . . . . . . . . . . . . 4:1--4:13
                 Zhao Zhang and   
                Jiahuan Ren and   
               Haijun Zhang and   
                Zheng Zhang and   
               Guangcan Liu and   
                  Shuicheng Yan   DLRF-Net: a Progressive Deep Latent
                                  Low-Rank Fusion Network for Hierarchical
                                  Subspace Discovery . . . . . . . . . . . 5:1--5:24
                   Yi Zhang and   
                Miaomiao Li and   
                 Siwei Wang and   
                   Sisi Dai and   
                    Lei Luo and   
                     En Zhu and   
                 Huiying Xu and   
               Xinzhong Zhu and   
                Chaoyun Yao and   
                    Haoran Zhou   Gaussian Mixture Model Clustering with
                                  Incomplete Data  . . . . . . . . . . . . 6:1--6:14
                 Jing Zhang and   
                  Jiaqi Guo and   
                   Yonggong Ren   Robust Ordinal Regression: User Credit
                                  Grading with Triplet Loss-Based Sampling 7:1--7:20
                     Xin Xu and   
                Shiqin Wang and   
                 Zheng Wang and   
             Xiaolong Zhang and   
                      Ruimin Hu   Exploring Image Enhancement for Salient
                                  Object Detection in Low Light Images . . 8:1--8:19
                 Yanchun Li and   
              Jianglian Cao and   
                  Zhetao Li and   
                Sangyoon Oh and   
               Nobuyoshi Komuro   Lightweight Single Image
                                  Super-resolution with Dense Connection
                                  Distillation Network . . . . . . . . . . 9:1--9:17
                      Yang Wang   Survey on Deep Multi-modal Data
                                  Analytics: Collaboration, Rivalry, and
                                  Fusion . . . . . . . . . . . . . . . . . 10:1--10:25
                  Yang Wang and   
                  Meng Fang and   
           Joey Tianyi Zhou and   
                Tingting Mu and   
                    Dacheng Tao   Introduction to the Special Issue on
                                  Fine-grained Visual Computing  . . . . . 11:1--11:3
                   Yutao Hu and   
                  Xuhui Liu and   
             Baochang Zhang and   
                Jungong Han and   
                    Xianbin Cao   Alignment Enhancement Network for
                                  Fine-grained Visual Categorization . . . 12:1--12:20
                 Weili Guan and   
             Zhaozheng Chen and   
                  Fuli Feng and   
                Weifeng Liu and   
                    Liqiang Nie   Urban Perception: Sensing Cities via a
                                  Deep Interactive Multi-task Learning
                                  Framework  . . . . . . . . . . . . . . . 13:1--13:20
                  Huimin Lu and   
                   Rui Yang and   
              Zhenrong Deng and   
              Yonglin Zhang and   
               Guangwei Gao and   
                      Rushi Lan   Chinese Image Captioning via Fuzzy
                                  Attention-based DenseNet-BiLSTM  . . . . 14:1--14:18
              Junsheng Xiao and   
                   Huahu Xu and   
                Honghao Gao and   
                Minjie Bian and   
                        Yang Li   A Weakly Supervised Semantic
                                  Segmentation Network by Aggregating Seed
                                  Cues: The Multi-Object Proposal
                                  Generation Perspective . . . . . . . . . 15:1--15:19
                 Chao Zhang and   
                 Xiaopei Wu and   
                Jianchao Lu and   
                   Xi Zheng and   
            Alireza Jolfaei and   
              Quan Z. Sheng and   
                     Dongjin Yu   RICA-MD: a Refined ICA Algorithm for
                                  Motion Detection . . . . . . . . . . . . 17:1--17:17
            MD Abdur Rahman and   
          M. Shamim Hossain and   
           Nabil A. Alrajeh and   
                    B. B. Gupta   A Multimodal, Multimedia Point-of-Care
                                  Deep Learning Framework for COVID-19
                                  Diagnosis  . . . . . . . . . . . . . . . 18:1--18:24
                  Yidong Li and   
                 Wenhua Liu and   
                     Yi Jin and   
                Yuanzhouhan Cao   SPGAN: Face Forgery Using Spoofing
                                  Generative Adversarial Networks  . . . . 19:1--19:20
                Lianyong Qi and   
               Houbing Song and   
                Xuyun Zhang and   
          Gautam Srivastava and   
                Xiaolong Xu and   
                        Shui Yu   Compatibility-Aware Web API
                                  Recommendation for Mashup Creation via
                                  Textual Description Mining . . . . . . . 20:1--20:19

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 17, Number 2, June, 2021

         Prabhakar Krishnan and   
            Kurunandan Jain and   
         Pramod George Jose and   
      Krishnashree Achuthan and   
                 Rajkumar Buyya   SDN Enabled QoE and Security Framework
                                  for Multimedia Applications in 5G
                                  Networks . . . . . . . . . . . . . . . . 39:1--39:29
           S. Sambath Kumar and   
                    M. Nandhini   Entropy Slicing Extraction and Transfer
                                  Learning Classification for Early
                                  Diagnosis of Alzheimer Diseases with
                                  sMRI . . . . . . . . . . . . . . . . . . 40:1--40:22
                Xiaolong Xu and   
                 Zijie Fang and   
                Lianyong Qi and   
                Xuyun Zhang and   
                   Qiang He and   
                  Xiaokang Zhou   TripRes: Traffic Flow Prediction Driven
                                  Resource Reservation for Multimedia IoV
                                  with Edge Computing  . . . . . . . . . . 41:1--41:21
                  Wei Liang and   
                  Jing Long and   
              Kuan-Ching Li and   
                  Jianbo Xu and   
                  Nanjun Ma and   
                        Xia Lei   A Fast Defogging Image Recognition
                                  Algorithm Based on Bilateral Hybrid
                                  Filtering  . . . . . . . . . . . . . . . 42:1--42:16
                  Chao Tong and   
               Mengze Zhang and   
                  Chao Lang and   
                   Zhigao Zheng   An Image Privacy Protection Algorithm
                                  Based on Adversarial Perturbation
                                  Generative Networks  . . . . . . . . . . 43:1--43:14
                  Yunfei Fu and   
               Hongchuan Yu and   
               Chih-Kuo Yeh and   
               Tong-Yee Lee and   
                  Jian J. Zhang   Fast Accurate and Automatic Brushstroke
                                  Extraction . . . . . . . . . . . . . . . 44:1--44:24
                 Mythili K. and   
                Manish Narwaria   Assessment of Machine Learning-Based
                                  Audiovisual Quality Predictors: Why
                                  Uncertainty Matters  . . . . . . . . . . 45:1--45:22
                 Kenta Hama and   
          Takashi Matsubara and   
             Kuniaki Uehara and   
                    Jianfei Cai   Exploring Uncertainty Measures for
                                  Image-caption Embedding-and-retrieval
                                  Task . . . . . . . . . . . . . . . . . . 46:1--46:19
          Phuong-Anh Nguyen and   
                  Chong-Wah Ngo   Interactive Search vs. Automatic Search:
                                  an Extensive Study on Video Retrieval    47:1--47:24
                    Yang Li and   
               Guangcan Liu and   
                  Yubao Sun and   
               Qingshan Liu and   
                 Shengyong Chen   $3$D Tensor Auto-encoder with
                                  Application to Video Compression . . . . 48:1--48:18
              Abbas Mehrabi and   
            Matti Siekkinen and   
Teemu Kämäräinen and   
Antti Ylä-Jääski   Multi-Tier CloudVR: Leveraging Edge
                                  Computing in Remote Rendered Virtual
                                  Reality  . . . . . . . . . . . . . . . . 49:1--49:24
                     Lu Sun and   
           Hussein Al Osman and   
                    Jochen Lang   An Augmented Reality Online Assistance
                                  Platform for Repair Tasks  . . . . . . . 50:1--50:23
                 Meiqi Zhao and   
              Jianmin Zheng and   
                   Elvis S. Liu   Server Allocation for Massively
                                  Multiplayer Online Cloud Games Using
                                  Evolutionary Optimization  . . . . . . . 51:1--51:23
                Haiyang Wei and   
                  Zhixin Li and   
             Feicheng Huang and   
              Canlong Zhang and   
                 Huifang Ma and   
                   Zhongzhi Shi   Integrating Scene Semantic Knowledge
                                  into Image Captioning  . . . . . . . . . 52:1--52:22
               Shikha Gupta and   
             Krishan Sharma and   
        Dileep Aroor Dinesh and   
           Veena Thenkanidiyoor   Visual Semantic-Based Representation
                                  Learning Using Deep CNNs for Scene
                                  Recognition  . . . . . . . . . . . . . . 53:1--53:24
            Chun-ying Huang and   
             Yun-chen Cheng and   
           Guan-zhang Huang and   
             Ching-ling Fan and   
                 Cheng-hsin Hsu   On the Performance Comparisons of Native
                                  and Clientless Real-Time Screen-Sharing
                                  Technologies . . . . . . . . . . . . . . 54:1--54:26
                   Xin Yang and   
               Zongliang Ma and   
                  Letian Yu and   
                   Ying Cao and   
                 Baocai Yin and   
               Xiaopeng Wei and   
                Qiang Zhang and   
               Rynson W. H. Lau   Automatic Comic Generation with
                                  Stylistic Multi-page Layouts and
                                  Emotion-driven Text Balloon Generation   55:1--55:19
        Prasen Kumar Sharma and   
                Sujoy Ghosh and   
                     Arijit Sur   High-quality Frame Recurrent Video
                                  De-raining with Multi-contextual
                                  Adversarial Network  . . . . . . . . . . 56:1--56:24
              Xiangyuan Lan and   
                 Zifei Yang and   
                  Wei Zhang and   
                   Pong C. Yuen   Spatial-temporal Regularized
                                  Multi-modality Correlation Filters for
                                  Tracking with Re-detection . . . . . . . 57:1--57:16

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 17, Number 2s, June, 2021

           Amit Kumar Singh and   
                  Zhihan Lv and   
                        Hoon Ko   Introduction to the Special Issue on
                                  Recent Trends in Medical Data Security
                                  for e-Health Applications  . . . . . . . 58:1--58:3
                A. K. Singh and   
                   A. Anand and   
                      Z. Lv and   
                      H. Ko and   
                       A. Mohan   A Survey on Healthcare Data: a Security
                                  Perspective  . . . . . . . . . . . . . . 59:1--59:26
                Hongjiao Wu and   
      Ashutosh Dhar Dwivedi and   
              Gautam Srivastava   Security and Privacy of Patient
                                  Information in Medical Systems Based on
                                  Blockchain Technology  . . . . . . . . . 60:1--60:17
                  Ting Wang and   
                Xiangjun Ji and   
                 Aiguo Song and   
              Kurosh Madani and   
               Amine Chohra and   
                  Huimin Lu and   
                   Ramon Monero   Output-Bounded and RBFNN-Based Position
                                  Tracking and Adaptive Force Control for
                                  Security Tele-Surgery  . . . . . . . . . 61:1--61:15
            Lamya Alkhariji and   
              Nada Alhirabi and   
       Mansour Naser Alraja and   
           Mahmoud Barhamgi and   
                  Omer Rana and   
                 Charith Perera   Synthesising Privacy by Design Knowledge
                                  Toward Explainable Internet of Things
                                  Application Designing in Healthcare  . . 62:1--62:29
                 M. Tanveer and   
                Tarun Gupta and   
                 Miten Shah and   
For the Alzheimer's Disease Neuroimaging Initiative   Pinball Loss Twin Support Vector
                                  Clustering . . . . . . . . . . . . . . . 63:1--63:23
           Amiya Kumar Sahu and   
               Suraj Sharma and   
                  Deepak Puthal   Lightweight Multi-party Authentication
                                  and Key Agreement Protocol in IoT-based
                                  E-Healthcare Service . . . . . . . . . . 64:1--64:20
       Amitesh Singh Rajput and   
       Vishesh Kumar Tanwar and   
          Balasubramanian Raman   -Score-Based Secure Biomedical Model for
                                  Effective Skin Lesion Segmentation Over
                                  eHealth Cloud  . . . . . . . . . . . . . 65:1--65:19
               Ashima Singh and   
           Arwinder Dhillon and   
               Neeraj Kumar and   
          M. Shamim Hossain and   
            Ghulam Muhammad and   
                    Manoj Kumar   eDiaPredict: an Ensemble-based Framework
                                  for Diabetes Prediction  . . . . . . . . 66:1--66:26
                Flora Amato and   
           Valentina Casola and   
         Giovanni Cozzolino and   
   Alessandra De Benedictis and   
            Nicola Mazzocca and   
              Francesco Moscato   A Security and Privacy Validation
                                  Methodology for e-Health Systems . . . . 67:1--67:22
               Harsh Kasyap and   
              Somanath Tripathy   Privacy-preserving Decentralized
                                  Learning Framework for Healthcare System 68:1--68:24
        Pourya Shamsolmoali and   
                 Ruili Wang and   
                    A. H. Sadka   Introduction to the Special Issue on
                                  Advanced Approaches for Multiple
                                  Instance Learning on Multimedia
                                  Applications . . . . . . . . . . . . . . 69:1--69:2
                    Ruyi Ji and   
                   Zeyu Liu and   
                 Libo Zhang and   
                Jianwei Liu and   
                    Xin Zuo and   
                  Yanjun Wu and   
                  Chen Zhao and   
               Haofeng Wang and   
                       Lin Yang   Multi-peak Graph-based Multi-instance
                                  Learning for Weakly Supervised Object
                                  Detection  . . . . . . . . . . . . . . . 70:1--70:21
               Yaoling Ding and   
               Liehuang Zhu and   
                    An Wang and   
                    Yuan Li and   
              Yongjuan Wang and   
               Siu Ming Yiu and   
                       Keke Gai   A Multiple Sieve Approach Based on
                                  Artificial Intelligent Techniques and
                                  Correlation Power Analysis . . . . . . . 71:1--71:21
                 Wanting Ji and   
                     Ruili Wang   A Multi-instance Multi-label Dual
                                  Learning Approach for Video Captioning   72:1--72:18
         Masoumeh Zareapoor and   
                       Jie Yang   Equivariant Adversarial Network for
                                  Image-to-image Translation . . . . . . . 73:1--73:14
        Mazin Abed Mohammed and   
           Mohamed Elhoseny and   
  Karrar Hameed Abdulkareem and   
          Salama A. Mostafa and   
              Mashael S. Maashi   A Multi-agent Feature Selection and
                                  Hybrid Classification Model for
                                  Parkinson's Disease Diagnosis  . . . . . 74:1--74:22
                      Na An and   
                     Wei Qi Yan   Multitarget Tracking Using Siamese
                                  Neural Networks  . . . . . . . . . . . . 75:1--75:16
             Xiaochuan Tang and   
                Mingzhe Liu and   
                  Hao Zhong and   
                Yuanzhen Ju and   
                   Weile Li and   
                       Qiang Xu   MILL: Channel Attention-based Deep
                                  Multiple Instance Learning for Landslide
                                  Recognition  . . . . . . . . . . . . . . 76:1--76:11

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 17, Number 3, August, 2021

                     Yue Li and   
                     Yan Yi and   
                   Dong Liu and   
                      Li Li and   
                     Zhu Li and   
                    Houqiang Li   Neural-Network-Based Cross-Channel Intra
                                  Prediction . . . . . . . . . . . . . . . 77:1--77:23
               Zhandong Liu and   
               Wengang Zhou and   
                    Houqiang Li   MFECN: Multi-level Feature Enhanced
                                  Cumulative Network for Scene Text
                                  Detection  . . . . . . . . . . . . . . . 78:1--78:22
                Xingbo Dong and   
               Soohyong Kim and   
                    Zhe Jin and   
            Jung Yeon Hwang and   
                Sangrae Cho and   
           Andrew Beng Jin Teoh   Secure Chaff-less Fuzzy Vault for Face
                                  Identification Systems . . . . . . . . . 79:1--79:22
                  Hezhen Hu and   
               Wengang Zhou and   
                   Junfu Pu and   
                    Houqiang Li   Global-Local Enhancement Network for
                                  NMF-Aware Sign Language Recognition  . . 80:1--80:19
                   Feng Lin and   
               Wengang Zhou and   
                Jiajun Deng and   
                     Bin Li and   
                     Yan Lu and   
                    Houqiang Li   Residual Refinement Network with
                                  Attribute Guidance for Precise Saliency
                                  Detection  . . . . . . . . . . . . . . . 81:1--81:19
               Hongdi Zheng and   
               Junfeng Wang and   
             Jianping Zhang and   
                      Ruirui Li   IRTS: an Intelligent and Reliable
                                  Transmission Scheme for Screen Updates
                                  Delivery in DaaS . . . . . . . . . . . . 82:1--82:24
                   Rui Wang and   
                 Dong Liang and   
               Xiaochun Cao and   
                   Yuanfang Guo   Semantic Correspondence with Geometric
                                  Structure Analysis . . . . . . . . . . . 83:1--83:21
                Xinfang Liu and   
                Xiushan Nie and   
                 Junya Teng and   
                    Li Lian and   
                     Yilong Yin   Single-shot Semantic Matching Network
                                  for Moment Localization in Videos  . . . 84:1--84:14
                   Bechir Alaya   Payoff-based Dynamic Segment Replication
                                  and Graph Classification Method with
                                  Attribute Vectors Adapted to Urban VANET 85:1--85:22
              Chhavi Dhiman and   
   Dinesh Kumar Vishwakarma and   
                  Paras Agarwal   Part-wise Spatio-temporal Attention
                                  Driven CNN-based $3$D Human Action
                                  Recognition  . . . . . . . . . . . . . . 86:1--86:24
                    Jie Nie and   
              Zhi-Qiang Wei and   
                 Weizhi Nie and   
                      An-An Liu   PGNet: Progressive Feature Guide
                                  Learning Network for Three-dimensional
                                  Shape Recognition  . . . . . . . . . . . 87:1--87:17
               Shiguang Liu and   
                Huixin Wang and   
                   Xiaoli Zhang   Video Decolorization Based on the CNN
                                  and LSTM Neural Network  . . . . . . . . 88:1--88:18
              Zhenzhen Yang and   
                 Pengfei Xu and   
              Yongpeng Yang and   
                   Bing-Kun Bao   A Densely Connected Network Based on
                                  U-Net for Medical Image Segmentation . . 89:1--89:14
              Donglin Zhang and   
                Xiao-Jun Wu and   
                         Jun Yu   Label Consistent Flexible Matrix
                                  Factorization Hashing for Efficient
                                  Cross-modal Retrieval  . . . . . . . . . 90:1--90:18
                Jakub Lokoc and   
       Patrik Veselý and   
   Frantisek Mejzlík and   
     Gregor Kovalcík and   
        Tomás Soucek and   
              Luca Rossetto and   
          Klaus Schoeffmann and   
              Werner Bailer and   
              Cathal Gurrin and   
               Loris Sauter and   
                Jaeyub Song and   
         Stefanos Vrochidis and   
                  Jiaxin Wu and   
Björn \thornóR Jónsson   Is the Reign of Interactive Search
                                  Eternal? Findings from the Video Browser
                                  Showdown 2020  . . . . . . . . . . . . . 91:1--91:26
                  Qianli Xu and   
      Ana Garcia Del Molino and   
                    Jie Lin and   
                   Fen Fang and   
     Vigneshwaran Subbaraju and   
                  Liyuan Li and   
                   Joo-Hwee Lim   Lifelog Image Retrieval Based on
                                  Semantic Relevance Mapping . . . . . . . 92:1--92:18
                 Gaoming Du and   
                  Jiting Wu and   
               Hongfang Cao and   
                   Kun Xing and   
                 Zhenmin Li and   
                Duoli Zhang and   
                   Xiaolei Wang   A Real-Time Effective Fusion-Based Image
                                  Defogging Architecture on FPGA . . . . . 93:1--93:21
           Chenglizhao Chen and   
              Hongmeng Zhao and   
                  Huan Yang and   
                    Teng Yu and   
                 Chong Peng and   
                       Hong Qin   Full-reference Screen Content Image
                                  Quality Assessment by Fusing Multilevel
                                  Structure Similarity . . . . . . . . . . 94:1--94:21
                 Honglin Li and   
               Xiaoyang Mao and   
                  Mengdi Xu and   
                   Xiaogang Jin   Deep-based Self-refined Face-top
                                  Coordination . . . . . . . . . . . . . . 95:1--95:23
                Minxuan Lin and   
                   Fan Tang and   
               Weiming Dong and   
                    Xiao Li and   
              Changsheng Xu and   
                   Chongyang Ma   Distribution Aligned Multimodal and
                                  Multi-domain Image Stylization . . . . . 96:1--96:17
                    Yong Du and   
                Yangyang Xu and   
                Taizhong Ye and   
                  Qiang Wen and   
               Chufeng Xiao and   
                 Junyu Dong and   
               Guoqiang Han and   
                   Shengfeng He   Invertible Grayscale with Sparsity
                                  Enforcing Priors . . . . . . . . . . . . 97:1--97:17
            Shengsheng Qian and   
                     Jun Hu and   
                  Quan Fang and   
                  Changsheng Xu   Knowledge-aware Multi-modal Adaptive
                                  Graph Convolutional Networks for Fake
                                  News Detection . . . . . . . . . . . . . 98:1--98:23

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 17, Number 3s, October, 2021

              Yu-Dong Zhang and   
         Juan Manuel Gorriz and   
                 Zhengchao Dong   Introduction to the Special Issue on
                                  Explainable Deep Learning for Medical
                                  Image Computing  . . . . . . . . . . . . 99:1--99:2
               Tongguang Ni and   
                   Yan Ding and   
                   Jing Xue and   
                Kaijian Xia and   
                Xiaoqing Gu and   
                  Yizhang Jiang   Local Constraint and Label Embedding
                                  Multi-layer Dictionary Learning for
                                  Sperm Head Classification  . . . . . . . 100:1--100:16
               Bingzhi Chen and   
                  Yishu Liu and   
                Zheng Zhang and   
                Yingjian Li and   
                 Zhao Zhang and   
               Guangming Lu and   
                    Hongbing Yu   Deep Active Context Estimation for
                                  Automated COVID-19 Diagnosis . . . . . . 101:1--101:22
               Xiangbin Liu and   
                Jiesheng He and   
                Liping Song and   
                  Shuai Liu and   
              Gautam Srivastava   Medical Image Classification based on an
                                  Adaptive Size Deep Learning Model  . . . 102:1--102:18
                  Siyuan Lu and   
                      Di Wu and   
                Zheng Zhang and   
                  Shui-Hua Wang   An Explainable Framework for Diagnosis
                                  of COVID-19 Pneumonia via Transfer
                                  Learning and Discriminant Correlation
                                  Analysis . . . . . . . . . . . . . . . . 103:1--103:16
     Roohallah Alizadehsani and   
          Danial Sharifrazi and   
        Navid Hoseini Izadi and   
Javad Hassannataj Joloudari and   
             Afshin Shoeibi and   
             Juan M. Gorriz and   
              Sadiq Hussain and   
               Juan E. Arco and   
        Zahra Alizadeh Sani and   
           Fahime Khozeimeh and   
             Abbas Khosravi and   
            Saeid Nahavandi and   
Sheikh Mohammed Shariful Islam and   
            U. Rajendra Acharya   Uncertainty-Aware Semi-Supervised Method
                                  Using Large Unlabeled and Limited
                                  Labeled COVID-19 Data  . . . . . . . . . 104:1--104:24
            Ambeshwar Kumar and   
    Ramachandran Manikandan and   
                  Utku Kose and   
               Deepak Gupta and   
            Suresh C. Satapathy   Doctor's Dilemma: Evaluating an
                                  Explainable Subtractive Spatial
                                  Lightweight Convolutional Neural Network
                                  for Brain Tumor Diagnosis  . . . . . . . 105:1--105:26
                      Ge Su and   
                     Bo Lin and   
                    Wei Luo and   
                Jianwei Yin and   
             Shuiguang Deng and   
                Honghao Gao and   
                      Renjun Xu   Hypomimia Recognition in Parkinson's
                                  Disease With Semantic Features . . . . . 106:1--106:20
                     Qi Xin and   
                 Shaohao Hu and   
                Shuaiqi Liu and   
                  Ling Zhao and   
                   Shuihua Wang   WTRPNet: an Explainable Graph Feature
                                  Convolutional Neural Network for
                                  Epileptic EEG Classification . . . . . . 107:1--107:18
            Wen-Huang Cheng and   
                Jiaying Liu and   
                  Nicu Sebe and   
               Junsong Yuan and   
                 Hong-Han Shuai   Introduction to the Special Issue on
                                  Explainable AI on Multimedia Computing   108:1--108:2
                   Jiguo Li and   
              Xinfeng Zhang and   
                 Jizheng Xu and   
                   Siwei Ma and   
                        Wen Gao   Learning to Fool the Speaker Recognition 109:1--109:21
              Chenggang Yan and   
                  Tong Teng and   
                  Yutao Liu and   
             Yongbing Zhang and   
               Haoqian Wang and   
                   Xiangyang Ji   Precise No-Reference Image Quality
                                  Evaluation Based on Distortion
                                  Identification . . . . . . . . . . . . . 110:1--110:21
              Yung-Yao Chen and   
               Sin-Ye Jhong and   
            Chih-Hsien Hsia and   
                   Kai-Lung Hua   Explainable AI: a Multispectral
                                  Palm-Vein Identification System with New
                                  Augmentation Features  . . . . . . . . . 111:1--111:21
               Yu-Sheng Lin and   
                 Zhe-Yu Liu and   
                 Yu-An Chen and   
              Yu-Siang Wang and   
             Ya-Liang Chang and   
                 Winston H. Hsu   xCos: an Explainable Cosine Metric for
                                  Face Verification Task . . . . . . . . . 112:1--112:16
      Mohammad Shorfuzzaman and   
          M. Shamim Hossain and   
         Abdulmotaleb El Saddik   An Explainable Deep Learning Ensemble
                                  Model for Robust Diagnosis of Diabetic
                                  Retinopathy Grading  . . . . . . . . . . 113:1--113:24
                  Zhenyu Wu and   
               Zhaowen Wang and   
                    Ye Yuan and   
             Jianming Zhang and   
             Zhangyang Wang and   
                     Hailin Jin   Black-Box Diagnosis and Calibration on
                                  GAN Intra-Mode Collapse: a Pilot Study   114:1--114:18
                  Bohui Xia and   
               Xueting Wang and   
             Toshihiko Yamasaki   Semantic Explanation for Deep Neural
                                  Networks Using Feature Interactions  . . 115:1--115:19
                  Yang Wang and   
                   Yang Cao and   
                 Jing Zhang and   
                    Feng Wu and   
                  Zheng-Jun Zha   Leveraging Deep Statistics for
                                  Underwater Image Enhancement . . . . . . 116:1--116:20

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 17, Number 4, November, 2021

                   Junyi Wu and   
                  Yan Huang and   
                   Qiang Wu and   
                Zhipeng Gao and   
             Jianqiang Zhao and   
                    Liqin Huang   Dual-Stream Guided-Learning via a Priori
                                  Optimization for Person
                                  Re-identification  . . . . . . . . . . . 117:1--117:22
               Zhaoliang He and   
                Hongshan Li and   
                   Zhi Wang and   
                 Shutao Xia and   
                      Wenwu Zhu   Adaptive Compression for Online Computer
                                  Vision: an Edge Reinforcement Learning
                                  Approach . . . . . . . . . . . . . . . . 118:1--118:23
                Yingwei Pan and   
                   Yue Chen and   
                   Qian Bao and   
                 Ning Zhang and   
                   Ting Yao and   
                 Jingen Liu and   
                        Tao Mei   Smart Director: an Event-Driven
                                  Directing System for Live Broadcasting   119:1--119:18
                 Chunyan Xu and   
                   Rong Liu and   
                 Tong Zhang and   
                   Zhen Cui and   
                  Jian Yang and   
                    Chunlong Hu   Dual-Stream Structured Graph Convolution
                                  Network for Skeleton-Based Action
                                  Recognition  . . . . . . . . . . . . . . 120:1--120:22
                   Jie Wang and   
                Kaibin Tian and   
                Dayong Ding and   
                  Gang Yang and   
                      Xirong Li   Unsupervised Domain Expansion for Visual
                                  Categorization . . . . . . . . . . . . . 121:1--121:24
       Candy Olivia Mawalim and   
                Shogo Okada and   
               Yukiko I. Nakano   Task-independent Recognition of
                                  Communication Skills in Group
                                  Interaction Using Time-series Modeling   122:1--122:27
                   Bo Zhang and   
                  Rui Zhang and   
            Niccolo Bisagno and   
               Nicola Conci and   
  Francesco G. B. De Natale and   
                     Hongbo Liu   Where Are They Going? Predicting Human
                                  Behaviors in Crowded Scenes  . . . . . . 123:1--123:19
             Ellen P. Silva and   
      Natália Vieira and   
              Glauco Amorim and   
            Renata Mousinho and   
             Gustavo Guedes and   
          Gheorghita Ghinea and   
          Joel A. F. Dos Santos   Using Multisensory Content to Impact the
                                  Quality of Experience of Reading Digital
                                  Books  . . . . . . . . . . . . . . . . . 124:1--124:18
               Weitao Jiang and   
               Weixuan Wang and   
                     Haifeng Hu   Bi-Directional Co-Attention Network for
                                  Image Captioning . . . . . . . . . . . . 125:1--125:20
              Xiangjun Shen and   
               Jinghui Zhou and   
               Zhongchen Ma and   
                Bingkun Bao and   
                   Zhengjun Zha   Cross-Domain Object Representation via
                                  Robust Low-Rank Correlation Analysis . . 126:1--126:20
                    Xing Xu and   
                 Yifan Wang and   
                  Yixuan He and   
                  Yang Yang and   
              Alan Hanjalic and   
                  Heng Tao Shen   Cross-Modal Hybrid Feature Fusion for
                                  Image-Sentence Matching  . . . . . . . . 127:1--127:23
             Nicola Messina and   
             Giuseppe Amato and   
               Andrea Esuli and   
            Fabrizio Falchi and   
            Claudio Gennaro and   
Stéphane Marchand-Maillet   Fine-Grained Visual Textual Alignment
                                  for Cross-Modal Retrieval Using
                                  Transformer Encoders . . . . . . . . . . 128:1--128:23
                    Xuan Ma and   
              Xiaoshan Yang and   
                  Junyu Gao and   
                  Changsheng Xu   Health Status Prediction with
                                  Local-Global Heterogeneous Behavior
                                  Graph  . . . . . . . . . . . . . . . . . 129:1--129:21
              Guangtao Zhai and   
                    Wei Sun and   
               Xiongkuo Min and   
                   Jiantao Zhou   Perceptual Quality Assessment of
                                  Low-light Image Enhancement  . . . . . . 130:1--130:24
              Prerna Mishra and   
              Santosh Kumar and   
         Mithilesh Kumar Chaube   Dissimilarity-Based Regularized Learning
                                  of Charts  . . . . . . . . . . . . . . . 131:1--131:23
           Lokesh Nandanwar and   
  Palaiahnakote Shivakumara and   
            Divya Krishnani and   
    Raghavendra Ramachandra and   
                    Tong Lu and   
                Umapada Pal and   
              Mohan Kankanhalli   A New Foreground-Background based Method
                                  for Behavior-Oriented Social Media Image
                                  Classification . . . . . . . . . . . . . 132:1--132:25
          Mohannad Alahmadi and   
                Peter Pocta and   
                    Hugh Melvin   An Adaptive Bitrate Switching Algorithm
                                  for Speech Applications in Context of
                                  WebRTC . . . . . . . . . . . . . . . . . 133:1--133:21
                    Wei Gao and   
                Linjie Zhou and   
                     Lvfang Tao   A Fast View Synthesis Implementation
                                  Method for Light Field Applications  . . 134:1--134:20
              Jianhai Zhang and   
               Zhiyong Feng and   
                    Yong Su and   
                      Meng Xing   Bayesian Covariance Representation with
                                  Global Informative Prior for $3$D Action
                                  Recognition  . . . . . . . . . . . . . . 135:1--135:22
                   Anqi Zhu and   
                  Lin Zhang and   
                Juntao Chen and   
                    Yicong Zhou   Pedestrian-Aware Panoramic Video
                                  Stitching Based on a Structured Camera
                                  Array  . . . . . . . . . . . . . . . . . 136:1--136:24
                Yizhen Chen and   
                     Haifeng Hu   Y-Net: Dual-branch Joint Network for
                                  Semantic Segmentation  . . . . . . . . . 137:1--137:22
                Jinwei Wang and   
                  Wei Huang and   
              Xiangyang Luo and   
               Yun-Qing Shi and   
                  Sunil Kr. Jha   Detecting Non-Aligned Double JPEG
                                  Compression Based on Amplitude-Angle
                                  Feature  . . . . . . . . . . . . . . . . 138:1--138:18
                    Wei Jia and   
                      Li Li and   
                     Zhu Li and   
                Xiang Zhang and   
                       Shan Liu   Residual-guided In-loop Filter Using
                                  Convolution Neural Network . . . . . . . 139:1--139:19
                  Zhihan Lv and   
                   Houbing Song   Trust Mechanism of Feedback Trust Weight
                                  in Multimedia Network  . . . . . . . . . 140:1--140:26

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 18, Number 1, January, 2022

                   Peng Yao and   
                   Jieqing Feng   Sparse LIDAR Measurement Fusion with
                                  Joint Updating Cost for Fast Stereo
                                  Matching . . . . . . . . . . . . . . . . 1:1--1:18
     Theodoros Karagkioules and   
        Georgios S. Paschos and   
       Nikolaos Liakopoulos and   
         Attilio Fiandrotti and   
      Dimitrios Tsilimantos and   
                 Marco Cagnazzo   Online Learning for Adaptive Video
                                  Streaming in Mobile Networks . . . . . . 2:1--2:22
             Ching-Ling Fan and   
               Tse-Hou Hung and   
                 Cheng-Hsin Hsu   Modeling the User Experience of Watching
                                  360${}^\circ $ Videos with Head-Mounted
                                  Displays . . . . . . . . . . . . . . . . 3:1--3:23
                Baiju P. S. and   
              Sudhish N. George   TTV Regularized LRTA Technique for the
                                  Estimation of Haze Model Parameters in
                                  Video Dehazing . . . . . . . . . . . . . 4:1--4:22
               Samah Aloufi and   
         Abdulmotaleb El Saddik   MMSUM Digital Twins: a Multi-view
                                  Multi-modality Summarization Framework
                                  for Sporting Events  . . . . . . . . . . 5:1--5:25
               Zhoutao Wang and   
                   Qian Xie and   
              Mingqiang Wei and   
                   Kun Long and   
                       Jun Wang   Multi-feature Fusion VoteNet for $3$D
                                  Object Detection . . . . . . . . . . . . 6:1--6:17
             Md Azher Uddin and   
       Joolekha Bibi Joolee and   
              Young-Koo Lee and   
                  Kyung-Ah Sohn   A Novel Multi-Modal Network-Based
                                  Dynamic Scene Understanding  . . . . . . 7:1--7:19
               Shiguang Liu and   
                Huixin Wang and   
                        Min Pei   Facial-expression-aware Emotional Color
                                  Transfer Based on Convolutional Neural
                                  Network  . . . . . . . . . . . . . . . . 8:1--8:19
   Ana Daniela Peres Rebelo and   
Guedes De Oliveira Inês and   
           D. E. Verboom Damion   The Impact of Artificial Intelligence on
                                  the Creativity of Videos . . . . . . . . 9:1--9:27
               Yaguang Song and   
                  Junyu Gao and   
              Xiaoshan Yang and   
                  Changsheng Xu   Learning Hierarchical Video Graph
                                  Networks for One-Stop Video Delivery . . 10:1--10:23
                  Aihua Mao and   
                 Yuan Liang and   
                Jianbo Jiao and   
                Yongtuo Liu and   
                   Shengfeng He   Mask-Guided Deformation Adaptive Network
                                  for Human Parsing  . . . . . . . . . . . 11:1--11:20
        Lohic Fotio Tiotsop and   
               Tomas Mizdos and   
           Marcus Barkowsky and   
                Peter Pocta and   
           Antonio Servetti and   
                  Enrico Masala   Mimicking Individual Media Quality
                                  Perception with Neural Network based
                                  Artificial Observers . . . . . . . . . . 12:1--12:25
              William Thong and   
               Cees G. M. Snoek   Diversely-Supervised Visual Product
                                  Search . . . . . . . . . . . . . . . . . 13:1--13:22
             Farshid Farhat and   
      Mohammad Mahdi Kamani and   
                  James Z. Wang   CAPTAIN: Comprehensive Composition
                                  Assistance for Photo Taking  . . . . . . 14:1--14:24
         Amanda K. Holloman and   
              Chris S. Crawford   Defining Scents: a Systematic Literature
                                  Review of Olfactory-based Computing
                                  Systems  . . . . . . . . . . . . . . . . 15:1--15:22
               Xian-Hua Han and   
             Yinqiang Zheng and   
                   Yen-Wei Chen   Hyperspectral Image Reconstruction Using
                                  Multi-scale Fusion Learning  . . . . . . 16:1--16:21
                   Shuji Tasaka   An Empirical Method for Causal Inference
                                  of Constructs for QoE in
                                  Haptic-Audiovisual Communications  . . . 17:1--17:24
               Dongbao Yang and   
                    Yu Zhou and   
                    Wei Shi and   
                   Dayan Wu and   
                   Weiping Wang   RD-IOD: Two-Level
                                  Residual-Distillation-Based
                                  Triple-Network for Incremental Object
                                  Detection  . . . . . . . . . . . . . . . 18:1--18:23
               Chih-Fan Hsu and   
               Tse-Hou Hung and   
                 Cheng-Hsin Hsu   Optimizing Immersive Video Coding
                                  Configurations Using Deep Learning: a
                                  Case Study on TMIV . . . . . . . . . . . 19:1--19:25
      Rémy Siegfried and   
               Jean-Marc Odobez   Robust Unsupervised Gaze Calibration
                                  Using Conversation and Manipulation
                                  Attention Priors . . . . . . . . . . . . 20:1--20:27
                  Jing Wang and   
                Weiqing Min and   
                 Sujuan Hou and   
                Shengnan Ma and   
              Yuanjie Zheng and   
                 Shuqiang Jiang   LogoDet-3K: a Large-scale Image Dataset
                                  for Logo Detection . . . . . . . . . . . 21:1--21:19
                 Da-Chun Wu and   
                   Yu-Tsung Hsu   Authentication of LINE Chat History
                                  Files by Information Hiding  . . . . . . 22:1--22:23
              Changming Liu and   
                Xiaojing Ma and   
                 Sixing Cao and   
                  Jiayun Fu and   
                     Bin B. Zhu   Privacy-preserving Motion Detection for
                                  HEVC-compressed Surveillance Video . . . 23:1--23:27

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 18, Number 1s, February, 2022

             Shiliang Zhang and   
                 Guorong Li and   
              Weigang Zhang and   
             Qingming Huang and   
               Tiejun Huang and   
               Mubarak Shah and   
                      Nicu Sebe   Introduction to the Special Issue on
                                  Fine-Grained Visual Recognition and
                                  Re-Identification  . . . . . . . . . . . 24:1--24:3
                   La Zhang and   
                 Haiyun Guo and   
                   Kuan Zhu and   
               Honglin Qiao and   
               Gaopan Huang and   
                  Sen Zhang and   
              Huichen Zhang and   
                   Jian Sun and   
                   Jinqiao Wang   Hybrid Modality Metric Learning for
                                  Visible-Infrared Person
                                  Re-Identification  . . . . . . . . . . . 25:1--25:15
                   Sheng Xu and   
                  Chang Liu and   
             Baochang Zhang and   
              Jinhu Lü and   
                Guodong Guo and   
                 David Doermann   BiRe-ID: Binary Neural Network for
                                  Efficient Person Re-ID . . . . . . . . . 26:1--26:22
              Zhongwei Zhao and   
                   Ran Song and   
                 Qian Zhang and   
                  Peng Duan and   
                   Youmei Zhang   JoT-GAN: a Framework for Jointly
                                  Training GAN and Person
                                  Re-Identification Model  . . . . . . . . 27:1--27:18
               Liqian Liang and   
               Congyan Lang and   
                     Zun Li and   
                  Jian Zhao and   
                   Tao Wang and   
                    Songhe Feng   Seeing Crucial Parts: Vehicle Model
                                  Verification via a Discriminative
                                  Representation Model . . . . . . . . . . 28:1--28:22
              Chenggang Yan and   
                Lixuan Meng and   
                   Liang Li and   
               Jiehua Zhang and   
                  Zhan Wang and   
                   Jian Yin and   
               Jiyong Zhang and   
                  Yaoqi Sun and   
                    Bolun Zheng   Age-Invariant Face Recognition by
                                  Multi-Feature Fusionand Decomposition
                                  with Self-attention  . . . . . . . . . . 29:1--29:18
                Deming Zhai and   
                Ruifeng Shi and   
               Junjun Jiang and   
                   Xianming Liu   Rectified Meta-learning from Noisy
                                  Labels for Robust Image-based Plant
                                  Disease Classification . . . . . . . . . 30:1--30:17
                    Min Tan and   
                    Fu Yuan and   
                     Jun Yu and   
                Guijun Wang and   
                    Xiaoling Gu   Fine-grained Image Classification via
                                  Multi-scale Selective Hierarchical
                                  Biquadratic Pooling  . . . . . . . . . . 31:1--31:23
             Rita Cucchiara and   
                  Matteo Fabbri   Fine-grained Human Analysis under
                                  Occlusions and Perspective Constraints
                                  in Multimedia Surveillance . . . . . . . 32:1--32:23
                     Lei Wu and   
                 Hefei Ling and   
                 Yuxuan Shi and   
                   Baiyan Zhang   Instance Correlation Graph for
                                  Unsupervised Domain Adaptation . . . . . 33:1--33:23
             Daniele Mugnai and   
           Federico Pernici and   
         Francesco Turchini and   
              Alberto Del Bimbo   Fine-Grained Adversarial Semi-Supervised
                                  Learning . . . . . . . . . . . . . . . . 34:1--34:19
                 Dezhao Luo and   
                    Yu Zhou and   
                    Bo Fang and   
                 Yucan Zhou and   
                   Dayan Wu and   
                   Weiping Wang   Exploring Relations in Untrimmed Videos
                                  for Self-Supervised Learning . . . . . . 35:1--35:21
                 Yabin Wang and   
                 Zhiheng Ma and   
                   Xing Wei and   
                Shuai Zheng and   
                Yaowei Wang and   
                  Xiaopeng Hong   ECCNAS: Efficient Crowd Counting Neural
                                  Architecture Search  . . . . . . . . . . 36:1--36:19
                   Wenxu Li and   
                   Gang Pan and   
                  Chen Wang and   
                  Zhen Xing and   
                    Zhenjun Han   From Coarse to Fine: Hierarchical
                                  Structure-aware Video Summarization  . . 37:1--37:16
          M. Shamim Hossain and   
             Rita Cucchiara and   
            Ghulam Muhammad and   
      Diana P. Tobón and   
         Abdulmotaleb El Saddik   Special Section on AI-empowered
                                  Multimedia Data Analytics for Smart
                                  Healthcare . . . . . . . . . . . . . . . 38:1--38:2
                   Min Chen and   
               Wenjing Xiao and   
                    Miao Li and   
                  Yixue Hao and   
                    Long Hu and   
                  Guangming Tao   A Multi-feature and Time-aware-based
                                  Stress Evaluation Mechanism for Mental
                                  Status Adjustment  . . . . . . . . . . . 39:1--39:18
               Mehedi Masud and   
        Mohammed F. Alhamid and   
                      Yin Zhang   A Convolutional Neural Network Model
                                  Using Weighted Loss Function to Detect
                                  Diabetic Retinopathy . . . . . . . . . . 40:1--40:16
                  Debin Liu and   
           Laurence T. Yang and   
                Puming Wang and   
                Ruonan Zhao and   
                 Qingchen Zhang   TT-TSVD: a Multi-modal Tensor Train
                                  Decomposition with Its Application in
                                  Convolutional Neural Networks for Smart
                                  Healthcare . . . . . . . . . . . . . . . 41:1--41:17
              Chun-Wei Yang and   
            Thanh Hai Phung and   
             Hong-Han Shuai and   
                Wen-Huang Cheng   Mask or Non-Mask? Robust Face Mask
                                  Detector via Triplet-Consistency
                                  Representation Learning  . . . . . . . . 42:1--42:20
                  Zhihan Lv and   
                Zengchen Yu and   
                Shuxuan Xie and   
                    Atif Alamri   Deep Learning-based Smart Predictive
                                  Evaluation for Interactive
                                  Multimedia-enabled Smart Healthcare  . . 43:1--43:20

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 18, Number 2, May, 2022

              Hadi Amirpour and   
           Antonio Pinheiro and   
            Manuela Pereira and   
       Fernando J. P. Lopes and   
              Mohammad Ghanbari   Efficient Light Field Image Compression
                                  with Enhanced Random Access  . . . . . . 44:1--44:18
              Pedro Morillo and   
José J. Navarro-Pérez and   
      Juan M. Orduña and   
        Marcos Fernández   Evaluation of an Intervention Program
                                  Based on Mobile Apps to Learn Sexism
                                  Prevention in Teenagers  . . . . . . . . 45:1--45:20
               Yansong Tang and   
                 Xingyu Liu and   
                   Xumin Yu and   
              Danyang Zhang and   
                   Jiwen Lu and   
                       Jie Zhou   Learning from Temporal Spatial Cubism
                                  for Cross-Dataset Skeleton-based Action
                                  Recognition  . . . . . . . . . . . . . . 46:1--46:24
            Burak Kizilkaya and   
                 Enver Ever and   
         Hakan Yekta Yatbaz and   
                   Adnan Yazici   An Effective Forest Fire Detection
                                  Framework Using Heterogeneous Wireless
                                  Multimedia Sensor Networks . . . . . . . 47:1--47:21
                   Yehao Li and   
                 Jiahao Fan and   
                Yingwei Pan and   
                   Ting Yao and   
                 Weiyao Lin and   
                        Tao Mei   Uni-EDEN: Universal Encoder-Decoder
                                  Network by Multi-Granular
                                  Vision-Language Pre-training . . . . . . 48:1--48:16
              Shenming Feng and   
             Xingzhong Nong and   
                     Haifeng Hu   Cascaded Structure-Learning Network with
                                  Using Adversarial Training for Robust
                                  Facial Landmark Detection  . . . . . . . 49:1--49:20
              Sam Van Damme and   
          Maria Torres Vega and   
                 Filip De Turck   Machine Learning Based Content-Agnostic
                                  Viewport Prediction for 360-Degree Video 50:1--50:24
               Chih-Kuo Yeh and   
           Thi-Ngoc-Hanh Le and   
               Zhi-Ying Hou and   
                   Tong-Yee Lee   Generating Virtual Wire Sculptural Art
                                  from $3$D Models . . . . . . . . . . . . 51:1--51:23
                   Teng Sun and   
                  Chun Wang and   
               Xuemeng Song and   
                  Fuli Feng and   
                    Liqiang Nie   Response Generation by Jointly Modeling
                                  Personalized Linguistic Styles and
                                  Emotions . . . . . . . . . . . . . . . . 52:1--52:20
              Jobin Francis and   
                 M. Baburaj and   
              Sudhish N. George   An $ l_{1 / 2} $ and Graph Regularized
                                  Subspace Clustering Method for Robust
                                  Image Segmentation . . . . . . . . . . . 53:1--53:24
                Jiahao Wang and   
               Yunhong Wang and   
                  Nina Weng and   
               Tianrui Chai and   
                   Annan Li and   
                 Faxi Zhang and   
                       Samsi Yu   Will You Ever Become Popular? Learning
                                  to Predict Virality of Dance Clips . . . 54:1--54:24
            Sheng-Hua Zhong and   
                 Jingxu Lin and   
                Jianglin Lu and   
                Ahmed Fares and   
                    Tongwei Ren   Deep Semantic and Attentive Network for
                                  Unsupervised Video Summarization . . . . 55:1--55:21
                 Yawen Zeng and   
                     Da Cao and   
                 Shaofei Lu and   
              Hanling Zhang and   
                    Jiao Xu and   
                      Zheng Qin   Moment is Important: Language-Based
                                  Video Moment Retrieval via Adversarial
                                  Learning . . . . . . . . . . . . . . . . 56:1--56:21
                  Hanjie Wu and   
                Yongtuo Liu and   
                Hongmin Cai and   
                   Shengfeng He   Learning Transferable Perturbations for
                                  Image Captioning . . . . . . . . . . . . 57:1--57:18
                   Ziyi Sun and   
              Yunfeng Zhang and   
                Fangxun Bao and   
                  Ping Wang and   
               Xunxiang Yao and   
                  Caiming Zhang   SADnet: Semi-supervised Single Image
                                  Dehazing Method Based on an Attention
                                  Mechanism  . . . . . . . . . . . . . . . 58:1--58:23
               Feifei Zhang and   
               Mingliang Xu and   
                  Changsheng Xu   Tell, Imagine, and Search: End-to-end
                                  Learning for Composing Text and Image to
                                  Image Retrieval  . . . . . . . . . . . . 59:1--59:23
                   Haoyu Ma and   
              Bingchen Gong and   
                      Yizhou Yu   Structure-aware Meta-fusion for Image
                                  Super-resolution . . . . . . . . . . . . 60:1--60:25
               Madiha Tahir and   
                Zahid Halim and   
             Atta Ur Rahman and   
             Muhammad Waqas and   
                Shanshan Tu and   
                 Sheng Chen and   
                        Zhu Han   Non-Acted Text and Keystrokes Database
                                  and Learning Methods to Recognize
                                  Emotions . . . . . . . . . . . . . . . . 61:1--61:24
             Matteo Fincato and   
            Marcella Cornia and   
             Federico Landi and   
               Fabio Cesari and   
                 Rita Cucchiara   Transform, Warp, and Dress: a New
                                  Transformation-guided Model for Virtual
                                  Try-on . . . . . . . . . . . . . . . . . 62:1--62:24
                   Ning Han and   
              Jingjing Chen and   
                  Hao Zhang and   
               Huanwen Wang and   
                       Hao Chen   Adversarial Multi-Grained Embedding
                                  Network for Cross-Modal Text-Video
                                  Retrieval  . . . . . . . . . . . . . . . 63:1--63:23
                    Bo Pang and   
                Deming Zhai and   
               Junjun Jiang and   
                   Xianming Liu   Fully Unsupervised Person
                                  Re-Identification via Selective
                                  Contrastive Learning . . . . . . . . . . 64:1--64:15
              Wenlin Zhuang and   
                Congyi Wang and   
              Jinxiang Chai and   
               Yangang Wang and   
                  Ming Shao and   
                       Siyu Xia   Music2Dance: DanceNet for Music-Driven
                                  Dance Generation . . . . . . . . . . . . 65:1--65:21
                Eva Cetinic and   
                      James She   Understanding and Creating Art with AI:
                                  Review and Outlook . . . . . . . . . . . 66:1--66:22

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 18, Number 2s, June, 2022

                Zheng Zhang and   
              Jianning Wang and   
                    Lei Zhu and   
                   Guangming Lu   Discriminative Visual Similarity Search
                                  with Semantically Cycle-consistent
                                  Hashing Networks . . . . . . . . . . . . 114:1--114:??
                 Shiming Ge and   
                Fanzhao Lin and   
                  Chenyu Li and   
               Daichi Zhang and   
               Weiping Wang and   
                       Dan Zeng   Deepfake Video Detection via Predictive
                                  Representation Learning  . . . . . . . . 115:1--115:??
           Leonardo Galteri and   
          Lorenzo Seidenari and   
             Pietro Bongini and   
              Marco Bertini and   
              Alberto Del Bimbo   LANBIQUE: LANguage-based Blind Image
                                  QUality Evaluation . . . . . . . . . . . 116:1--116:??
                  Zhihan Lv and   
             Dongliang Chen and   
                      Haibin Lv   Smart City Construction and Management
                                  by Digital Twins and BIM Big Data in
                                  COVID-19 Scenario  . . . . . . . . . . . 117:1--117:??
               Ashima Anand and   
               Amit Kumar Singh   A Comprehensive Study of Deep
                                  Learning-based Covert Communication  . . 118:1--118:??
                 Haotian Xu and   
                 Xiaobo Jin and   
               Qiufeng Wang and   
               Amir Hussain and   
                   Kaizhu Huang   Exploiting Attention-Consistency Loss
                                  For Spatial-Temporal Stream Action
                                  Recognition  . . . . . . . . . . . . . . 119:1--119:??
                 Sara Salim and   
              Nour Moustafa and   
          Benjamin Turnbull and   
                   Imran Razzak   Perturbation-enabled Deep Federated
                                  Learning for Preserving Internet of
                                  Things-based Social Networks . . . . . . 120:1--120:??
                   An-Qi Bi and   
             Xiao-Yang Tian and   
              Shui-Hua Wang and   
                  Yu-Dong Zhang   Dynamic Transfer Exemplar based Facial
                                  Emotion Recognition Model Toward Online
                                  Video  . . . . . . . . . . . . . . . . . 121:1--121:??
          Marjan Golmaryami and   
               Rahim Taheri and   
            Zahra Pooranian and   
          Mohammad Shojafar and   
                       Pei Xiao   SETTI: a Self-supervised AdvErsarial
                                  Malware DeTection ArchiTecture in an IoT
                                  Environment  . . . . . . . . . . . . . . 122:1--122:??
                 Abbas Khan and   
                Ijaz Ul Haq and   
            Tanveer Hussain and   
              Khan Muhammad and   
             Mohammad Hijji and   
            Muhammad Sajjad and   
Victor Hugo C. De Albuquerque and   
                 Sung Wook Baik   PMAL: a Proxy Model Active Learning
                                  Approach for Vision Based Industrial
                                  Applications . . . . . . . . . . . . . . 123:1--123:??
                Chenyi Yang and   
                Xiaolong Xu and   
              Xiaokang Zhou and   
                    Lianyong Qi   Deep Q Network-Driven Task Offloading
                                  for Efficient Multimedia Data Analysis
                                  in Edge Computing-Assisted IoV . . . . . 124:1--124:??
                Arti Tiwari and   
                    Millie Pant   Optimized Deep-Neural Network for
                                  Content-based Medical Image Retrieval in
                                  a Brownfield IoMT Network  . . . . . . . 125:1--125:??
                  Wei Huang and   
                 Yuze Zhang and   
                    Shaohua Wan   A Sorting Fuzzy Min-Max Model in an
                                  Embedded System for Atrial Fibrillation
                                  Detection  . . . . . . . . . . . . . . . 126:1--126:??
                   Xun Yang and   
                Liang Zheng and   
                Elisa Ricci and   
                      Meng Wang   Introduction to the Special Section on
                                  Learning Representations, Similarity,
                                  and Associations in Dynamic Multimedia
                                  Environments . . . . . . . . . . . . . . 127:1--127:??
                     Jun He and   
               Richang Hong and   
               Xueliang Liu and   
               Mingliang Xu and   
                     Qianru Sun   Revisiting Local Descriptor for Improved
                                  Few-Shot Classification  . . . . . . . . 127:1--127:??
              Yingying Jiao and   
               Haipeng Chen and   
               Runyang Feng and   
               Haoming Chen and   
                   Sifan Wu and   
                 Yifang Yin and   
                  Zhenguang Liu   GLPose: Global-Local Representation
                                  Learning for Human Pose Estimation . . . 128:1--128:??
                   Qing Han and   
                Huiting Liu and   
                Weidong Min and   
               Tiemei Huang and   
                   Deyu Lin and   
                        Qi Wang   $3$D Skeleton and Two Streams Approach
                                  to Person Re-identification Using
                                  Optimized Region Matching  . . . . . . . 129:1--129:??
                     Xin Xu and   
                   Xin Yuan and   
                 Zheng Wang and   
                  Kai Zhang and   
                      Ruimin Hu   Rank-in-Rank Loss for Person
                                  Re-identification  . . . . . . . . . . . 130:1--130:??
                 Kunpeng Li and   
                  Chang Liu and   
                 Mike Stopa and   
                  Jun Amano and   
                         Yun Fu   Guided Graph Attention Learning for
                                  Video-Text Matching  . . . . . . . . . . 131:1--131:??
      Niccoló Biondi and   
           Federico Pernici and   
               Matteo Bruni and   
             Daniele Mugnai and   
              Alberto Del Bimbo   CL$^2$R: Compatible Lifelong Learning
                                  Representations  . . . . . . . . . . . . 132:1--132:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 18, Number 3, August, 2022

                Yonghua Pan and   
                  Zechao Li and   
                Liyan Zhang and   
                    Jinhui Tang   Causal Inference with Knowledge
                                  Distilling and Curriculum Learning for
                                  Unbiased VQA . . . . . . . . . . . . . . 67:1--67:23
             Rintaro Yanagi and   
                   Ren Togo and   
             Takahiro Ogawa and   
                  Miki Haseyama   Interactive Re-ranking via Object
                                  Entropy-Guided Question Answering for
                                  Cross-Modal Image Retrieval  . . . . . . 68:1--68:17
             Qinghongya Shi and   
              Hong-Bo Zhang and   
                     Zhe Li and   
                Ji-Xiang Du and   
                   Qing Lei and   
                   Jing-Hua Liu   Shuffle-invariant Network for Action
                                  Recognition in Videos  . . . . . . . . . 69:1--69:18
                    Di Yuan and   
              Xiaojun Chang and   
                  Zhihui Li and   
                      Zhenyu He   Learning Adaptive Spatial-Temporal
                                  Context-Aware Correlation Filters for
                                  UAV Tracking . . . . . . . . . . . . . . 70:1--70:18
                 Guofei Sun and   
              Yongkang Wong and   
       Mohan S. Kankanhalli and   
               Xiangdong Li and   
                   Weidong Geng   Enhanced $3$D Shape Reconstruction With
                                  Knowledge Graph of Category Concept  . . 71:1--71:20
                 Jinfeng Li and   
                Weifeng Liu and   
                Yicong Zhou and   
                     Jun Yu and   
                 Dapeng Tao and   
                  Changsheng Xu   Domain-invariant Graph for Adaptive
                                  Semi-supervised Domain Adaptation  . . . 72:1--72:18
                    Ran Shi and   
                    Jing Ma and   
              King Ngi Ngan and   
                 Jian Xiong and   
                      Tong Qiao   Objective Object Segmentation Visual
                                  Quality Evaluation: Quality Measure and
                                  Pooling Method . . . . . . . . . . . . . 73:1--73:19
               Linghua Zeng and   
                    Xinmei Tian   CRAR: Accelerating Stereo Matching with
                                  Cascaded Residual Regression and
                                  Adaptive Refinement  . . . . . . . . . . 74:1--74:19
              Lingxiang Yao and   
       Worapan Kusakunniran and   
                   Qiang Wu and   
                Jingsong Xu and   
                     Jian Zhang   Recognizing Gaits Across Walking and
                                  Running Speeds . . . . . . . . . . . . . 75:1--75:22
                     Qun Li and   
                    Fu Xiao and   
                  Bir Bhanu and   
                Biyun Sheng and   
                   Richang Hong   Inner Knowledge-based Img2Doc Scheme for
                                  Visual Question Answering  . . . . . . . 76:1--76:21
            Marcella Cornia and   
               Matteo Tomei and   
            Lorenzo Baraldi and   
                 Rita Cucchiara   Matching Faces and Attributes Between
                                  the Artistic and the Real Domain: the
                                  PersonArt Approach . . . . . . . . . . . 77:1--77:23
               Guanghao Yin and   
               Shouqian Sun and   
                    Dian Yu and   
                  Dejian Li and   
                    Kejun Zhang   A Multimodal Framework for Large-Scale
                                  Emotion Recognition by Fusing Music and
                                  Electrodermal Activity Signals . . . . . 78:1--78:23
         Himanshu Buckchash and   
          Balasubramanian Raman   GraSP: Local Grassmannian
                                  Spatio-Temporal Patterns for
                                  Unsupervised Pose Sequence Recognition   79:1--79:23
              Xiaoguang Zhu and   
                     Ye Zhu and   
                 Haoyu Wang and   
                Honglin Wen and   
                    Yan Yan and   
                     Peilin Liu   Skeleton Sequence and RGB Frame Based
                                  Multi-Modality Feature Fusion Network
                                  for Action Recognition . . . . . . . . . 80:1--80:24
     Debanjan Roy Chowdhury and   
              Sukumar Nandi and   
                Diganta Goswami   Distributed Gateway Selection for Video
                                  Streaming in VANET Using IP Multicast    81:1--81:24
               Bechir Alaya and   
                  Lamaa Sellami   Multilayer Video Encoding for QoS
                                  Managing of Video Streaming in VANET
                                  Environment  . . . . . . . . . . . . . . 82:1--82:19
                    Yike Wu and   
                Shiwan Zhao and   
                 Ying Zhang and   
               Xiaojie Yuan and   
                       Zhong Su   When Pairs Meet Triplets: Improving
                                  Low-Resource Captioning via
                                  Multi-Objective Optimization . . . . . . 83:1--83:20
               Kai-Wei Yang and   
              Yen-Yun Huang and   
              Jen-Wei Huang and   
                 Ya-Rou Hsu and   
              Chang-Lin Wan and   
             Hong-Han Shuai and   
               Li-Chun Wang and   
                Wen-Huang Cheng   Improving Crowd Density Estimation by
                                  Fusing Aerial Images and Radio Signals   84:1--84:23
                 Zhihua Xia and   
                   Qiuju Ji and   
                      Qi Gu and   
            Chengsheng Yuan and   
                   Fengjun Xiao   A Format-compatible Searchable
                                  Encryption Scheme for JPEG Images Using
                                  Bag-of-words . . . . . . . . . . . . . . 85:1--85:18
     Iynkaran Natgunanathan and   
     Purathani Praitheeshan and   
              Longxiang Gao and   
                 Yong Xiang and   
                        Lei Pan   Blockchain-Based Audio Watermarking
                                  Technique for Multimedia Copyright
                                  Protection in Distribution Networks  . . 86:1--86:23
                  Kehua Guo and   
                     Min Hu and   
                  Sheng Ren and   
                Fangfang Li and   
                 Jian Zhang and   
                  Haifu Guo and   
                    Xiaoyan Kui   Deep Illumination-Enhanced Face
                                  Super-Resolution Network for Low-Light
                                  Images . . . . . . . . . . . . . . . . . 87:1--87:19
               Xiaoming Liu and   
                  Shuo Wang and   
                 Ying Zhang and   
                      Quan Yuan   Scribble-Supervised Meibomian Glands
                                  Segmentation in Infrared Images  . . . . 88:1--88:23
           Kedar Nath Singh and   
               Amit Kumar Singh   Towards Integrating Image Encryption
                                  with Compression: a Survey . . . . . . . 89:1--89:21

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 18, Number 3s, October, 2022

Carlos Enrique Montenegro Marin and   
      Dinesh Jackson Samuel and   
          Nallappan Gunasekaran   Introduction to the Special Issue on 6G
                                  Enabled Interactive Multimedia
                                  Communication Systems  . . . . . . . . . 133:1--133:??
                     Ran Li and   
                    Wei Wei and   
                 Peinan Hao and   
                    Jian Su and   
                   Fengyuan Sun   Context-aware Pseudo-true Video
                                  Interpolation at 6G Edge . . . . . . . . 133:1--133:??
           Abdullah Alharbi and   
         Mohammed Aljebreen and   
                  Amr Tolba and   
      Konstantinos A. Lizos and   
          Saied Abd El-Atty and   
                   Farid Shawki   A Normalized Slicing-assigned
                                  Virtualization Method for 6G-based
                                  Wireless Communication Systems . . . . . 134:1--134:??
                  Yin Zhang and   
                Iztok Humar and   
                    Jia Liu and   
                Alireza Jolfaei   Introduction to the Special Issue on
                                  Affective Services based on
                                  Representation Learning  . . . . . . . . 135:1--135:??
                   Kexin Xu and   
               Haijun Zhang and   
                Keping Long and   
              Jianquan Wang and   
                        Lei Sun   DRL based Joint Affective Services
                                  Computing and Resource Allocation in
                                  ISTN . . . . . . . . . . . . . . . . . . 135:1--135:??
               Yazhou Zhang and   
              Prayag Tiwari and   
                    Lu Rong and   
                   Rui Chen and   
          Nojoom A. Alnajem and   
              M. Shamim Hossain   Affective Interaction: Attentive
                                  Representation Learning for Multi-Modal
                                  Sentiment Classification . . . . . . . . 136:1--136:??
               Xiaoqin Wang and   
                  Chen Chen and   
                  Rushi Lan and   
                Licheng Liu and   
               Zhenbing Liu and   
                 Huiyu Zhou and   
                    Xiaonan Luo   Binary Representation via Jointly
                                  Personalized Sparse Hashing  . . . . . . 137:1--137:??
                    Xin Jin and   
                 Xinning Li and   
                    Hao Lou and   
                 Chenyu Fan and   
                 Qiang Deng and   
                Chaoen Xiao and   
                  Shuai Cui and   
               Amit Kumar Singh   Aesthetic Attribute Assessment of Images
                                  Numerically on Mixed Multi-attribute
                                  Datasets . . . . . . . . . . . . . . . . 138:1--138:??
                    Jie Cao and   
               Youquan Wang and   
               Haicheng Tao and   
                      Xiang Guo   Sensor-based Human Activity Recognition
                                  Using Graph LSTM and Multi-task
                                  Classification Model . . . . . . . . . . 139:1--139:??
               Jiawei Huang and   
                  Qichen Su and   
                   Weihe Li and   
                Zhuoran Liu and   
                  Tao Zhang and   
                    Sen Liu and   
                 Ping Zhong and   
              Wanchun Jiang and   
                   Jianxin Wang   Opportunistic Transmission for Video
                                  Streaming over Wild Internet . . . . . . 140:1--140:??
           Zhengfang Duanmu and   
                 Wentao Liu and   
                  Diqi Chen and   
                 Zhuoran Li and   
                  Zhou Wang and   
                Yizhou Wang and   
                        Wen Gao   A Bayesian Quality-of-Experience Model
                                  for Adaptive Streaming Videos  . . . . . 141:1--141:??
                 Oana Ignat and   
            Santiago Castro and   
                Yuhang Zhou and   
                 Jiajun Bao and   
                Dandan Shan and   
                  Rada Mihalcea   When Did It Happen? Duration-informed
                                  Temporal Localization of Narrated
                                  Actions in Vlogs . . . . . . . . . . . . 142:1--142:??
                 Wuzhen Shi and   
                    Shaohui Liu   Hiding Message Using a Cycle Generative
                                  Adversarial Network  . . . . . . . . . . 143:1--143:??
                   Chen Hui and   
                Shaohui Liu and   
                 Wuzhen Shi and   
                 Feng Jiang and   
                     Debin Zhao   Spatio-Temporal Context Based Adaptive
                                  Camcorder Recording Watermarking . . . . 144:1--144:??
                  Jian Zhao and   
                Xianhui Liu and   
                   Weidong Zhao   Balanced and Accurate Pseudo-Labels for
                                  Semi-Supervised Image Classification . . 145:1--145:??
           Lorenzo Stacchio and   
             Alessia Angeli and   
           Giuseppe Lisanti and   
            Daniela Calanca and   
                 Gustavo Marfia   Toward a Holistic Approach to the
                                  Socio-historical Analysis of Vernacular
                                  Photos . . . . . . . . . . . . . . . . . 146:1--146:??
               Hui-Chu Xiao and   
               Wan-Lei Zhao and   
                    Jie Lin and   
               Yi-Geng Hong and   
                  Chong-Wah Ngo   Deeply Activated Salient Region for
                                  Instance Search  . . . . . . . . . . . . 147:1--147:??
                 Zuquan Liu and   
                  Guopu Zhu and   
                  Feng Ding and   
              Xiangyang Luo and   
                  Sam Kwong and   
                        Peng Li   Contrast-Enhanced Color Visual
                                  Cryptography for $ (k, n) $ Threshold
                                  Schemes  . . . . . . . . . . . . . . . . 148:1--148:??
                    Zhe Liu and   
                   Xian-Hua Han   Deep Self-Supervised Hyperspectral Image
                                  Reconstruction . . . . . . . . . . . . . 149:1--149:??
             Gurinder Singh and   
                   Puneet Goyal   SDCN2: a Shallow Densely Connected CNN
                                  for Multi-Purpose Image Manipulation
                                  Detection  . . . . . . . . . . . . . . . 150:1--150:??
                 Yunfei Liu and   
                      Yu Li and   
                 Shaodi You and   
                        Feng Lu   Semantic Guided Single Image Reflection
                                  Removal  . . . . . . . . . . . . . . . . 151:1--151:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 18, Number 4, November, 2022

                Jingjing Wu and   
              Jianguo Jiang and   
                  Meibin Qi and   
                Cuiqun Chen and   
                      Yimin Liu   Improving Feature Discrimination for
                                  Object Tracking by
                                  Structural-similarity-based Metric
                                  Learning . . . . . . . . . . . . . . . . 90:1--90:23
              Xiaowen Huang and   
                 Jitao Sang and   
                  Changsheng Xu   Image-Based Personality Questionnaire
                                  Design . . . . . . . . . . . . . . . . . 91:1--91:??
                 Shijie Hao and   
                     Xu Han and   
                Yanrong Guo and   
                      Meng Wang   Decoupled Low-Light Image Enhancement    92:1--92:19
                 Yibing Liu and   
               Yangyang Guo and   
                Jianhua Yin and   
               Xuemeng Song and   
                Weifeng Liu and   
                Liqiang Nie and   
                      Min Zhang   Answer Questions with Right Image
                                  Regions: a Visual Attention
                                  Regularization Approach  . . . . . . . . 93:1--93:18
                    Yang Yu and   
                Rongrong Ni and   
                  Wenjie Li and   
                       Yao Zhao   Detection of AI-Manipulated Fake Faces
                                  via Mining Generalized Features  . . . . 94:1--94:23
                Yuhao Cheng and   
              Xiaoguang Zhu and   
               Jiuchao Qian and   
                    Fei Wen and   
                     Peilin Liu   Cross-modal Graph Matching Network for
                                  Image-text Retrieval . . . . . . . . . . 95:1--95:23
              Mihai Dogariu and   
      Liviu-Daniel \cStefan and   
      Bogdan Andrei Boteanu and   
              Claudiu Lamba and   
                   Bomi Kim and   
                 Bogdan Ionescu   Generation of Realistic Synthetic
                                  Financial Time-series  . . . . . . . . . 96:1--96:27
                   Yi Zheng and   
                  Yong Zhou and   
                 Jiaqi Zhao and   
                  Ying Chen and   
                    Rui Yao and   
                   Bing Liu and   
         Abdulmotaleb El Saddik   Clustering Matters: Sphere Feature for
                                  Fully Unsupervised Person
                                  Re-identification  . . . . . . . . . . . 97:1--97:18
              Zengming Tang and   
                      Jun Huang   Harmonious Multi-branch Network for
                                  Person Re-identification with Harder
                                  Triplet Loss . . . . . . . . . . . . . . 98:1--98:21
                   Yifan Xu and   
                Kekai Sheng and   
               Weiming Dong and   
                 Baoyuan Wu and   
              Changsheng Xu and   
                    Bao-Gang Hu   Towards Corruption-Agnostic Robust
                                  Domain Adaptation  . . . . . . . . . . . 99:1--99:16
                 Jinzhi Lin and   
                  Yun Zhang and   
                      Na Li and   
                 Hongling Jiang   Joint Source-Channel Decoding of Polar
                                  Codes for HEVC-Based Video Streaming . . 100:1--100:23
                 Yongrui Li and   
                Zengfu Wang and   
                         Jun Yu   Densely Enhanced Semantic Network for
                                  Conversation System in Social Media  . . 101:1--101:24
                    Kai Lin and   
               Chuanmin Jia and   
              Xinfeng Zhang and   
               Shanshe Wang and   
                   Siwei Ma and   
                        Wen Gao   NR-CNN: Nested-Residual Guided CNN
                                  In-loop Filtering for Video Coding . . . 102:1--102:22
                 Hanbin Dai and   
                 Hailin Shi and   
                     Wu Liu and   
               Linfang Wang and   
                 Yinglu Liu and   
                        Tao Mei   FasterPose: a Faster Simple Baseline for
                                  Human Pose Estimation  . . . . . . . . . 103:1--103:16
                    Xin Man and   
             Deqiang Ouyang and   
               Xiangpeng Li and   
              Jingkuan Song and   
                       Jie Shao   Scenario-Aware Recurrent Transformer for
                                  Goal-Directed Video Captioning . . . . . 104:1--104:17
              Tianjun Zhang and   
                   Hao Deng and   
                  Lin Zhang and   
              Shengjie Zhao and   
                   Xiao Liu and   
                    Yicong Zhou   Online Correction of Camera Poses for
                                  the Surround-view System: a Sparse
                                  Direct Approach  . . . . . . . . . . . . 106:1--106:24
                  Quan Wang and   
                   Sheng Li and   
              Xinpeng Zhang and   
                    Guorui Feng   Multi-granularity Brushstrokes Network
                                  for Universal Style Transfer . . . . . . 107:1--107:17
               Nidhi Saxena and   
          Balasubramanian Raman   Pansharpening Scheme Using
                                  Bi-dimensional Empirical Mode
                                  Decomposition and Neural Network . . . . 108:1--108:22
                Jingjing Wu and   
              Jianguo Jiang and   
                  Meibin Qi and   
                Cuiqun Chen and   
                 Jingjing Zhang   An End-to-end Heterogeneous Restraint
                                  Network for RGB-D Cross-modal Person
                                  Re-identification  . . . . . . . . . . . 109:1--109:22
                 Caixia Liu and   
                 Dehui Kong and   
               Shaofan Wang and   
                 Jinghua Li and   
                     Baocai Yin   A Spatial Relationship Preserving
                                  Adversarial Network for $3$D
                                  Reconstruction from a Single Depth View  110:1--110:22
                 Ruyong Ren and   
              Shaozhang Niu and   
                    Hua Ren and   
               Shubin Zhang and   
                Tengyue Han and   
                   Xiaohai Tong   ESRNet: Efficient Search and Recognition
                                  Network for Image Manipulation Detection 111:1--111:23
              Mingxing Duan and   
                   Kenli Li and   
                Jiayan Deng and   
                   Bin Xiao and   
                        Qi Tian   A Novel Multi-Sample Generation Method
                                  for Adversarial Attacks  . . . . . . . . 112:1--112:21
                   Yang Guo and   
                    Wei Gao and   
                   Siwei Ma and   
                          Ge Li   Accelerating Transform Algorithm
                                  Implementation for Efficient Intra
                                  Coding of 8K UHD Videos  . . . . . . . . 113:1--113:20

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 19, Number 1, January, 2023

                  Xuan Shao and   
                  Ying Shen and   
                  Lin Zhang and   
              Shengjie Zhao and   
                 Dandan Zhu and   
                    Yicong Zhou   SLAM for Indoor Parking: a Comprehensive
                                  Benchmark Dataset and a Tightly Coupled
                                  Semantic Framework . . . . . . . . . . . 1:1--1:??
              Prasen Sharma and   
                  Ira Bisht and   
                     Arijit Sur   Wavelength-based Attributed Deep Neural
                                  Network for Underwater Image Restoration 2:1--2:??
                     Jie Li and   
                   Ling Han and   
                Chong Zhang and   
                   Qiyue Li and   
                        Zhi Liu   Spherical Convolution Empowered Viewport
                                  Prediction in 360 Video Multicast with
                                  Limited FoV Feedback . . . . . . . . . . 3:1--3:??
           Thi-Ngoc-Hanh Le and   
               Chih-Kuo Yeh and   
               Ying-Chi Lin and   
                   Tong-Yee Lee   Animating Still Natural Images Using
                                  Warping  . . . . . . . . . . . . . . . . 4:1--4:??
                Lizhi Xiong and   
                   Xiao Han and   
            Ching-Nung Yang and   
                     Zhihua Xia   RDH-DES: Reversible Data Hiding over
                                  Distributed Encrypted-Image Servers
                                  Based on Secret Sharing  . . . . . . . . 5:1--5:??
               Peining Zhen and   
                 Shuqi Wang and   
               Suming Zhang and   
                Xiaotao Yan and   
                   Wei Wang and   
                 Zhigang Ji and   
                   Hai-Bao Chen   Towards Accurate Oriented Object
                                  Detection in Aerial Images with Adaptive
                                  Multi-level Feature Fusion . . . . . . . 6:1--6:??
                   Yue Song and   
                   Hao Tang and   
                  Nicu Sebe and   
                       Wei Wang   Disentangle Saliency Detection into
                                  Cascaded Detail Modeling and Body
                                  Filling  . . . . . . . . . . . . . . . . 7:1--7:??
                 Yong Zhang and   
                Yingwei Pan and   
                   Ting Yao and   
                  Rui Huang and   
                    Tao Mei and   
                 Chang-Wen Chen   Boosting Scene Graph Generation with
                                  Visual Relation Saliency . . . . . . . . 8:1--8:??
               Jingwen Chen and   
                Jianjie Luo and   
                Yingwei Pan and   
                   Yehao Li and   
                   Ting Yao and   
              Hongyang Chao and   
                        Tao Mei   Boosting Vision-and-Language Navigation
                                  with Direction Guiding and Backtracing   9:1--9:??
                  Yunbo Rao and   
               Ziqiang Yang and   
              Shaoning Zeng and   
                Qifeng Wang and   
                      Jiansu Pu   Dual Projective Zero-Shot Learning Using
                                  Text Descriptions  . . . . . . . . . . . 10:1--10:??
                    Hang Yu and   
              Chilam Cheang and   
                  Yanwei Fu and   
                  Xiangyang Xue   Multi-view Shape Generation for a $3$D
                                  Human-like Body  . . . . . . . . . . . . 11:1--11:??
               Weidong Chen and   
                 Guorong Li and   
              Xinfeng Zhang and   
                Shuhui Wang and   
                   Liang Li and   
                 Qingming Huang   Weakly Supervised Text-based
                                  Actor-Action Video Segmentation by
                                  Clip-level Multi-instance Learning . . . 12:1--12:??
               Feihong Shen and   
                        Jun Liu   Quantum Fourier Convolutional Network    13:1--13:??
                Xiaotian Wu and   
                       Peng Yao   Boolean-based Two-in-One Secret Image
                                  Sharing by Adaptive Pixel Grouping . . . 14:1--14:??
               Ashima Yadav and   
       Dinesh Kumar Vishwakarma   A Deep Multi-level Attentive Network for
                                  Multimodal Sentiment Analysis  . . . . . 15:1--15:??
                Honghao Gao and   
                 Baobin Dai and   
               Huaikou Miao and   
              Xiaoxian Yang and   
     Ramon J. Duran Barroso and   
                Hussain Walayat   A Novel GAPG Approach to Automatic
                                  Property Generation for Formal
                                  Verification: The GAN Perspective  . . . 16:1--16:??
               Pengyi Zhang and   
              Huanzhang Dou and   
                Wenhu Zhang and   
                 Yuhan Zhao and   
                  Zequn Qin and   
                Dongping Hu and   
                    Yi Fang and   
                          Xi Li   A Large-Scale Synthetic Gait Dataset
                                  Towards in-the-Wild Simulation and
                                  Comparison Study . . . . . . . . . . . . 17:1--17:??
                   Wei Zhou and   
                  Zhiwu Xia and   
                   Peng Dou and   
                     Tao Su and   
                     Haifeng Hu   Double Attention Based on Graph
                                  Attention Network for Image Multi-Label
                                  Classification . . . . . . . . . . . . . 18:1--18:??
              Xianlin Zhang and   
              Mengling Shen and   
                 Xueming Li and   
                   Xiaojie Wang   AABLSTM: a Novel Multi-task Based
                                  CNN-RNN Deep Model for Fashion Analysis  19:1--19:??
                  Deyin Liu and   
            Lin (Yuanbo) Wu and   
               Richang Hong and   
                Zongyuan Ge and   
                Jialie Shen and   
             Farid Boussaid and   
             Mohammed Bennamoun   Generative Metric Learning for
                                  Adversarially Robust Open-world Person
                                  Re-Identification  . . . . . . . . . . . 20:1--20:??
                  Shuo Wang and   
                 Huixia Ben and   
                 Yanbin Hao and   
                Xiangnan He and   
                      Meng Wang   Boosting Hyperspectral Image
                                  Classification with Dual Hierarchical
                                  Learning . . . . . . . . . . . . . . . . 21:1--21:??
                   Dayan Wu and   
                     Qi Dai and   
                      Bo Li and   
                   Weiping Wang   Deep Uncoupled Discrete Hashing via
                                  Similarity Matrix Decomposition  . . . . 22:1--22:??
                Ming Cheung and   
                 Weiwei Sun and   
                  James She and   
                   Jiantao Zhou   Social Network Analytic-Based Online
                                  Counterfeit Seller Detection using User
                                  Shared Images  . . . . . . . . . . . . . 23:1--23:??
                 Lu Feihong and   
                  Chen Hang and   
                    Li Kang and   
               Deng Qiliang and   
                  Zhao Jian and   
              Zhang Kaipeng and   
                       Han Hong   Toward High-quality Face-Mask Occluded
                                  Restoration  . . . . . . . . . . . . . . 24:1--24:??
                 Yajing Liu and   
               Zhiwei Xiong and   
                      Ya Li and   
                  Yuning Lu and   
                Xinmei Tian and   
                  Zheng-Jun Zha   Category-Stitch Learning for Union
                                  Domain Generalization  . . . . . . . . . 25:1--25:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 19, Number 1s, February, 2023

            Claudio Ferrari and   
         Federico Becattini and   
           Leonardo Galteri and   
              Alberto Del Bimbo   (Compress and Restore) N: a Robust
                                  Defense Against Adversarial Attacks on
                                  Image Classification . . . . . . . . . . 26:1--26:??
               Yaguang Song and   
              Xiaoshan Yang and   
                  Changsheng Xu   Self-supervised Calorie-aware
                                  Heterogeneous Graph Networks for Food
                                  Recommendation . . . . . . . . . . . . . 27:1--27:??
                   Feng Xue and   
                  Tian Yang and   
                   Kang Liu and   
                 Zikun Hong and   
                Mingwei Cao and   
                    Dan Guo and   
                   Richang Hong   LCSNet: End-to-end Lipreading with
                                  Channel-aware Feature Selection  . . . . 28:1--28:??
                  Zilong Fu and   
                Hongtao Xie and   
             Shancheng Fang and   
                 Yuxin Wang and   
              Mengting Xing and   
                 Yongdong Zhang   Learning Pixel Affinity Pyramid for
                                  Arbitrary-Shaped Text Detection  . . . . 29:1--29:??
João Baptista Cardia Neto and   
            Claudio Ferrari and   
    Aparecido Nilceu Marana and   
           Stefano Berretti and   
              Alberto Del Bimbo   Learning Streamed Attention Network from
                                  Descriptor Images for Cross-Resolution
                                  $3$D Face Recognition  . . . . . . . . . 30:1--30:??
                      Xin Huang   On Teaching Mode of MTI Translation
                                  Workshop Based on IPT Corpus for Tibetan
                                  Areas of China . . . . . . . . . . . . . 31:1--31:??
                  Liming Xu and   
               Xianhua Zeng and   
                Weisheng Li and   
                  Bochuan Zheng   MFGAN: Multi-modal Feature-fusion for CT
                                  Metal Artifact Reduction Using GANs  . . 32:1--32:??
                 Yuzhang Hu and   
                Wenhan Yang and   
                Jiaying Liu and   
                   Zongming Guo   Deep Inter Prediction with
                                  Error-Corrected Auto-Regressive Network
                                  for Video Coding . . . . . . . . . . . . 33:1--33:??
                     Yue Li and   
                   Li Zhang and   
                      Kai Zhang   iDAM: Iteratively Trained Deep In-loop
                                  Filter with Adaptive Model Selection . . 34:1--34:??
        Rahul Kumar Jaiswal and   
             Rajesh Kumar Dubey   CAQoE: a Novel No-Reference
                                  Context-aware Speech Quality Prediction
                                  Metric . . . . . . . . . . . . . . . . . 35:1--35:??
                  Tao Xiang and   
              Honghong Zeng and   
                 Biwen Chen and   
                   Shangwei Guo   BMIF: Privacy-preserving
                                  Blockchain-based Medical Image Fusion    36:1--36:??
                 Xiaoke Zhu and   
               Changlong Li and   
               Xiaopan Chen and   
                Xinyu Zhang and   
                 Xiao-Yuan Jing   Distance and Direction Based Deep
                                  Discriminant Metric Learning for Kinship
                                  Verification . . . . . . . . . . . . . . 37:1--37:??
             Weiming Zhuang and   
                    Xin Gan and   
               Yonggang Wen and   
                    Shuai Zhang   Optimizing Performance of Federated
                                  Person Re-identification: Benchmarking
                                  and Analysis . . . . . . . . . . . . . . 38:1--38:??
        Lavinia De Divitiis and   
         Federico Becattini and   
            Claudio Baecchi and   
              Alberto Del Bimbo   Disentangling Features for Fashion
                                  Recommendation . . . . . . . . . . . . . 39:1--39:??
                Ka-Hou Chan and   
                     Sio-Kei Im   Using Four Hypothesis Probability
                                  Estimators for CABAC in Versatile Video
                                  Coding . . . . . . . . . . . . . . . . . 40:1--40:??
                Mengqi Yuan and   
               Bing-Kun Bao and   
                  Zhiyi Tan and   
                  Changsheng Xu   Adaptive Text Denoising Network for
                                  Image Caption Editing  . . . . . . . . . 41:1--41:??
               Xiaoyu Zhang and   
                    Wei Gao and   
                      Ge Li and   
              Qiuping Jiang and   
                    Runmin Cong   Image Quality Assessment-driven
                                  Reinforcement Learning for Mixed
                                  Distorted Image Restoration  . . . . . . 42:1--42:??
              Chongyang Bai and   
            Maksim Bolonkin and   
             Viney Regunath and   
             V. S. Subrahmanian   DIPS: a Dyadic Impression Prediction
                                  System for Group Interaction Videos  . . 43:1--43:??
                 Yuqing Liu and   
              Xinfeng Zhang and   
               Shanshe Wang and   
                   Siwei Ma and   
                        Wen Gao   Sequential Hierarchical Learning with
                                  Distribution Transformation for Image
                                  Super-Resolution . . . . . . . . . . . . 44:1--44:??
               Haidong Wang and   
                    Xuan He and   
                 Zhiyong Li and   
                   Jin Yuan and   
                      Shutao Li   JDAN: Joint Detection and Association
                                  Network for Real-Time Online
                                  Multi-Object Tracking  . . . . . . . . . 45:1--45:??
               Mengyao Xiao and   
                Xiaolong Li and   
                   Yao Zhao and   
                     Bin Ma and   
                    Guodong Guo   A Novel Reversible Data Hiding Scheme
                                  Based on Pixel-Residual Histogram  . . . 46:1--46:??
                 Jiazhi Liu and   
                       Feng Liu   Modified $2$D-Ghost-Free Stereoscopic
                                  Display with Depth-of-Field Effects  . . 47:1--47:??
               Jingwen Chen and   
                Yingwei Pan and   
                   Yehao Li and   
                   Ting Yao and   
              Hongyang Chao and   
                        Tao Mei   Retrieval Augmented Convolutional
                                  Encoder-decoder Networks for Video
                                  Captioning . . . . . . . . . . . . . . . 48:1--48:??
                 Guanyu Zhu and   
                  Yong Zhou and   
                    Rui Yao and   
               Hancheng Zhu and   
                     Jiaqi Zhao   Cyclic Self-attention for Point Cloud
                                  Recognition  . . . . . . . . . . . . . . 49:1--49:??
               Dinghao Yang and   
                    Wei Gao and   
                      Ge Li and   
                   Hui Yuan and   
                 Junhui Hou and   
                      Sam Kwong   Exploiting Manifold Feature
                                  Representation for Efficient
                                  Classification of $3$D Point Clouds  . . 50:1--50:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 19, Number 2, March, 2023

                Xiaohan Lan and   
                Yitian Yuan and   
                   Xin Wang and   
                   Zhi Wang and   
                      Wenwu Zhu   A Survey on Temporal Sentence Grounding
                                  in Videos  . . . . . . . . . . . . . . . 51:1--51:??
                    Yu Qiao and   
                  Yuhao Liu and   
                   Ziqi Wei and   
                 Yuxin Wang and   
                  Qiang Cai and   
              Guofeng Zhang and   
                       Xin Yang   Hierarchical and Progressive Image
                                  Matting  . . . . . . . . . . . . . . . . 52:1--52:??
                   Fei Peng and   
               Wenyan Jiang and   
                       Min Long   A Low Distortion and
                                  Steganalysis-resistant Reversible Data
                                  Hiding for $2$D Engineering Graphics . . 53:1--53:??
                  Sijie Mai and   
              Songlong Xing and   
                 Jiaxuan He and   
                  Ying Zeng and   
                     Haifeng Hu   Multimodal Graph for Unaligned
                                  Multimodal Sequence Analysis via Graph
                                  Convolution and Graph Pooling  . . . . . 54:1--54:??
                   Qi Zheng and   
              Jianfeng Dong and   
                  Xiaoye Qu and   
                   Xun Yang and   
                Yabing Wang and   
                   Pan Zhou and   
                Baolong Liu and   
                       Xun Wang   Progressive Localization Networks for
                                  Language-Based Moment Localization . . . 55:1--55:??
                  Yue Zhang and   
              Fanghui Zhang and   
                     Yi Jin and   
                 Yigang Cen and   
         Viacheslav Voronin and   
                    Shaohua Wan   Local Correlation Ensemble with GCN
                                  Based on Attention Features for
                                  Cross-domain Person Re-ID  . . . . . . . 56:1--56:??
           Jacob Chakareski and   
              Mahmudur Khan and   
           Tanguy Ropitault and   
                 Steve Blandino   Millimeter Wave and Free-space-optics
                                  for Future Dual-connectivity 6DOF Mobile
                                  Multi-user VR Streaming  . . . . . . . . 57:1--57:??
               Yun-Shao Lin and   
               Yi-Ching Liu and   
                   Chi-Chun Lee   An Interaction-process-guided Framework
                                  for Small-group Performance Prediction   58:1--58:??
                   Na Zheng and   
               Xuemeng Song and   
                  Tianyu Su and   
                Weifeng Liu and   
                    Yan Yan and   
                    Liqiang Nie   Egocentric Early Action Prediction via
                                  Adversarial Knowledge Distillation . . . 59:1--59:??
                    Li Wang and   
                      Ke Li and   
              Jingjing Tang and   
                   Yuying Liang   Image Super-Resolution via Lightweight
                                  Attention-Directed Feature Aggregation
                                  Network  . . . . . . . . . . . . . . . . 60:1--60:??
                Jiaying Lin and   
                    Xin Tan and   
                      Ke Xu and   
                Lizhuang Ma and   
               Rynson W. H. Lau   Frequency-aware Camouflaged Object
                                  Detection  . . . . . . . . . . . . . . . 61:1--61:??
               Shuang Liang and   
                  Anjie Zhu and   
             Jiasheng Zhang and   
                       Jie Shao   Hyper-node Relational Graph Attention
                                  Network for Multi-modal Knowledge Graph
                                  Completion . . . . . . . . . . . . . . . 62:1--62:??
                   Yaya Shi and   
                 Haiyang Xu and   
              Chunfeng Yuan and   
                    Bing Li and   
                 Weiming Hu and   
                  Zheng-Jun Zha   Learning Video-Text Aligned
                                  Representations for Video Captioning . . 63:1--63:??
                  Yang Yang and   
               Yingqiu Ding and   
                 Ming Cheng and   
                  Weiming Zhang   No-reference Quality Assessment for
                                  Contrast-distorted Images Based on Gray
                                  and Color-gray-difference Space  . . . . 64:1--64:??
                   Jia Wang and   
               Jingcheng Ke and   
             Hong-Han Shuai and   
                Yung-Hui Li and   
                Wen-Huang Cheng   Referring Expression Comprehension Via
                                  Enhanced Cross-modal Graph Attention
                                  Networks . . . . . . . . . . . . . . . . 65:1--65:??
             Dengyong Zhang and   
                   Pu Huang and   
             Xiangling Ding and   
                    Feng Li and   
                 Wenjie Zhu and   
                   Yun Song and   
                     Gaobo Yang   L$^2$BEC$^2$: Local Lightweight
                                  Bidirectional Encoding and Channel
                                  Attention Cascade for Video Frame
                                  Interpolation  . . . . . . . . . . . . . 66:1--66:??
                Yushu Zhang and   
                   Qing Tan and   
                  Shuren Qi and   
                     Mingfu Xue   PRNU-based Image Forgery Localization
                                  with Deep Multi-scale Fusion . . . . . . 67:1--67:??
              Shanshan Dong and   
                 Tianzi Niu and   
                    Xin Luo and   
                     Wu Liu and   
                     Xinshun Xu   Semantic Embedding Guided Attention with
                                  Explicit Visual Feature Fusion for Video
                                  Captioning . . . . . . . . . . . . . . . 68:1--68:??
                 Shunxin Xu and   
                     Ke Sun and   
                   Dong Liu and   
               Zhiwei Xiong and   
                  Zheng-Jun Zha   Synergy between Semantic Segmentation
                                  and Image Denoising via Alternate
                                  Boosting . . . . . . . . . . . . . . . . 69:1--69:??
                   Dan Song and   
             Chu-Meng Zhang and   
             Xiao-Qian Zhao and   
                  Teng Wang and   
                Wei-Zhi Nie and   
                 Xuan-Ya Li and   
                      An-An Liu   Self-supervised Image-based $3$D Model
                                  Retrieval  . . . . . . . . . . . . . . . 70:1--70:??
            Stavros Nousias and   
        Gerasimos Arvanitis and   
                 Aris Lalos and   
         Konstantinos Moustakas   Deep Saliency Mapping for $3$D Meshes
                                  and Applications . . . . . . . . . . . . 71:1--71:??
                    Yun Liu and   
                Xiaohua Yin and   
                Zuliang Wan and   
               Guanghui Yue and   
                      Zhi Zheng   Toward A No-reference Omnidirectional
                                  Image Quality Evaluation by Using
                                  Multi-perceptual Features  . . . . . . . 72:1--72:??
                     Hua Wu and   
                     Xin Li and   
                  Gang Wang and   
                Guang Cheng and   
                     Xiaoyan Hu   Resolution Identification of Encrypted
                                  Video Streaming Based on HTTP/2 Features 73:1--73:??
                   Qipu Qin and   
                  Cheolkon Jung   Quality Enhancement of Compressed $
                                  360$-Degree Videos Using Viewport-based
                                  Deep Neural Networks . . . . . . . . . . 74:1--74:??
                   Wei Zhou and   
                  Zhiwu Xia and   
                   Peng Dou and   
                     Tao Su and   
                     Haifeng Hu   Aligning Image Semantics and Label
                                  Concepts for Image Multi-Label
                                  Classification . . . . . . . . . . . . . 75:1--75:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 19, Number 2s, April, 2023

            Summaira Jabeen and   
                      Xi Li and   
        Muhammad Shoib Amin and   
              Omar Bourahla and   
                Songyuan Li and   
                   Abdul Jabbar   A Review on Methods and Applications in
                                  Multimodal Deep Learning . . . . . . . . 76:1--76:??
           Sophie C. C. Sun and   
              Yongkang Zhao and   
                Fang-Wei Fu and   
                      Yawei Ren   Improved Random Grid-based Cheating
                                  Prevention Visual Cryptography Using
                                  Latin Square . . . . . . . . . . . . . . 77:1--77:??
                 Jiong Dong and   
                  Kaoru Ota and   
                 Mianxiong Dong   Video Frame Interpolation: a
                                  Comprehensive Survey . . . . . . . . . . 78:1--78:??
                Gaofeng Cao and   
                   Fei Zhou and   
                Kanglin Liu and   
                 Anjie Wang and   
                    Leidong Fan   A Decoupled Kernel Prediction Network
                                  Guided by Soft Mask for Single Image HDR
                                  Reconstruction . . . . . . . . . . . . . 79:1--79:??
                 Yipeng Liu and   
                    Qi Yang and   
                  Yiling Xu and   
                        Le Yang   Point Cloud Quality Assessment: Dataset
                                  Construction and Learning-based
                                  No-reference Metric  . . . . . . . . . . 80:1--80:??
                   Cheng Xu and   
                 Zejun Chen and   
                 Jiajie Mai and   
                 Xuemiao Xu and   
                   Shengfeng He   Pose- and Attribute-consistent Person
                                  Image Synthesis  . . . . . . . . . . . . 81:1--81:??
              Jae Hyun Park and   
               Sanghoon Kim and   
               Joo Chan Lee and   
                   Jong Hwan Ko   Scalable Color Quantization for
                                  Task-centric Image Compression . . . . . 82:1--82:??
 Joan Manuel Marqu\`es Puig and   
         Helena Rif\`a-Pous and   
                 Samia Oukemeni   From False-Free to Privacy-Oriented
                                  Communitarian Microblogging Social
                                  Networks . . . . . . . . . . . . . . . . 83:1--83:??
                Yiming Tang and   
                          Yi Yu   Query-Guided Prototype Learning with
                                  Decoder Alignment and Dynamic Fusion in
                                  Few-Shot Segmentation  . . . . . . . . . 84:1--84:??
                Zhiming Liu and   
                    Kai Niu and   
                    Zhiqiang He   ML-CookGAN: Multi-Label Generative
                                  Adversarial Network for Food Image
                                  Generation . . . . . . . . . . . . . . . 85:1--85:??
            Basheer Alwaely and   
            Charith Abhayaratne   GHOSM: Graph-based Hybrid Outline and
                                  Skeleton Modelling for Shape Recognition 86:1--86:??
        Sankaraganesh Jonna and   
             Moushumi Medhi and   
             Rajiv Ranjan Sahay   Distill-DBDGAN: Knowledge Distillation
                                  and Adversarial Learning Framework for
                                  Defocus Blur Detection . . . . . . . . . 87:1--87:??
                Xuewei Ding and   
                Yingwei Pan and   
                   Yehao Li and   
                   Ting Yao and   
                   Dan Zeng and   
                        Tao Mei   Boosting Relationship Detection in
                                  Images with Multi-Granular
                                  Self-Supervised Learning . . . . . . . . 88:1--88:??
                 Binfei Chu and   
                 Yiting Lin and   
               Bineng Zhong and   
               Zhenjun Tang and   
                Xianxian Li and   
                      Jing Wang   Robust Long-Term Tracking via Localizing
                                  Occluders  . . . . . . . . . . . . . . . 89:1--89:??
                   Huisi Wu and   
                Zhaoze Wang and   
                Zhuoying Li and   
                Zhenkun Wen and   
                       Jing Qin   Context Prior Guided Semantic Modeling
                                  for Biomedical Image Segmentation  . . . 90:1--90:??
                     Jun Wu and   
              Tianliang Zhu and   
                 Jiahui Zhu and   
                  Tianyi Li and   
                   Chunzhi Wang   A Optimized BERT for Multimodal
                                  Sentiment Analysis . . . . . . . . . . . 91:1--91:??
                Yongzong Xu and   
               Zhijing Yang and   
              Tianshui Chen and   
                     Kai Li and   
                   Chunmei Qing   Progressive Transformer Machine for
                                  Natural Character Reenactment  . . . . . 92:1--92:??
             Chong Hong Tan and   
              Koksheik Wong and   
       Vishnu Monn Baskaran and   
           Kiki Adhinugraha and   
                   David Taniar   Is it Violin or Viola? Classifying the
                                  Instruments' Music Pieces using
                                  Descriptive Statistics . . . . . . . . . 93:1--93:??
                   KN Singh and   
                   OP Singh and   
           Amit Kumar Singh and   
            Amrit Kumar Agrawal   EiMOL: a Secure Medical Image Encryption
                                  Algorithm based on Optimization and the
                                  Lorenz System  . . . . . . . . . . . . . 94:1--94:??
                Ziteng Qiao and   
                 Dianxi Shi and   
                Xiaodong Yi and   
                 Yanyan Shi and   
                Yuhui Zhang and   
                   Yangyang Liu   UEFPN: Unified and Enhanced Feature
                                  Pyramid Networks for Small Object
                                  Detection  . . . . . . . . . . . . . . . 95:1--95:??
                 Linwei Zhu and   
                  Yun Zhang and   
                      Na Li and   
               Gangyi Jiang and   
                      Sam Kwong   Deep Learning-Based Intra Mode
                                  Derivation for Versatile Video Coding    96:1--96:??
               Donghuo Zeng and   
                Jianming Wu and   
                Gen Hattori and   
                    Rong Xu and   
                          Yi Yu   Learning Explicit and Implicit Dual
                                  Common Subspaces for Audio-visual
                                  Cross-modal Retrieval  . . . . . . . . . 97:1--97:??
                   Qiqi Gao and   
                     Jie Li and   
                Tiejun Zhao and   
                    Yadong Wang   Real-time Image Enhancement with
                                  Attention Aggregation  . . . . . . . . . 98:1--98:??
                Yucheng Zhu and   
               Xiongkuo Min and   
                 Dandan Zhu and   
              Guangtao Zhai and   
              Xiaokang Yang and   
               Wenjun Zhang and   
                      Ke Gu and   
                   Jiantao Zhou   Toward Visual Behavior and Attention
                                  Understanding for Augmented 360 Degree
                                  Videos . . . . . . . . . . . . . . . . . 99:1--99:??
                Haiyang Mei and   
                  Letian Yu and   
                      Ke Xu and   
                  Yang Wang and   
                   Xin Yang and   
               Xiaopeng Wei and   
               Rynson W. H. Lau   Mirror Segmentation via Semantic-aware
                                  Contextual Contrasted Feature Learning   100:1--100:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 19, Number 3, May, 2023

                   Yi Zhang and   
               Fang-Yi Chao and   
          Wassim Hamidouche and   
               Olivier Deforges   PAV-SOD: a New Task towards Panoramic
                                  Audiovisual Saliency Detection . . . . . 101:1--101:??
                    Chi Xie and   
               Zikun Zhuang and   
              Shengjie Zhao and   
                   Shuang Liang   Temporal Dropout for Weakly Supervised
                                  Action Localization  . . . . . . . . . . 102:1--102:??
               Yangyang Guo and   
                Liqiang Nie and   
                Harry Cheng and   
              Zhiyong Cheng and   
          Mohan Kankanhalli and   
              Alberto Del Bimbo   On Modality Bias Recognition and
                                  Reduction  . . . . . . . . . . . . . . . 103:1--103:??
                    Kang Xu and   
                  Weixin Li and   
                   Xia Wang and   
                 Xiaoyan Hu and   
                     Ke Yan and   
               Xiaojie Wang and   
                      Xuan Dong   CUR Transformer: a Convolutional
                                  Unbiased Regional Transformer for Image
                                  Denoising  . . . . . . . . . . . . . . . 104:1--104:??
               Wenxin Huang and   
                 Xuemei Jia and   
                 Xian Zhong and   
                  Xiao Wang and   
                  Kui Jiang and   
                     Zheng Wang   Beyond the Parts: Learning
                                  Coarse-to-Fine Adaptive Alignment
                                  Representation for Person Search . . . . 105:1--105:??
               Hongchuan Yu and   
             Mengqing Huang and   
                 Jian Jun Zhang   Domain Adaptation Problem in Sketch
                                  Based Image Retrieval  . . . . . . . . . 106:1--106:??
                    Han Yan and   
               Haijun Zhang and   
               Jianyang Shi and   
               Jianghong Ma and   
                     Xiaofei Xu   Toward Intelligent Fashion Design: a
                                  Texture and Shape Disentangled
                                  Generative Adversarial Network . . . . . 107:1--107:??
                   Peng Dou and   
                  Ying Zeng and   
               Zhuoqun Wang and   
                     Haifeng Hu   Multiple Temporal Pooling Mechanisms for
                                  Weakly Supervised Temporal Action
                                  Localization . . . . . . . . . . . . . . 108:1--108:??
                     Lei Li and   
               Zhiyuan Zhou and   
                  Suping Wu and   
                   Yongrong Cao   Multi-scale Edge-guided Learning for
                                  $3$D Reconstruction  . . . . . . . . . . 109:1--109:??
              Zhengxue Wang and   
               Guangwei Gao and   
                Juncheng Li and   
                    Hui Yan and   
                  Hao Zheng and   
                      Huimin Lu   Lightweight Feature De-redundancy and
                                  Self-calibration Network for Efficient
                                  Image Super-resolution . . . . . . . . . 110:1--110:??
               Zhijie Huang and   
                    Jun Sun and   
                   Xiaopeng Guo   FastCNN: Towards Fast and Accurate
                                  Spatiotemporal Network for HEVC
                                  Compressed Video Enhancement . . . . . . 111:1--111:??
               Xiaohan Wang and   
                Linchao Zhu and   
                     Fei Wu and   
                        Yi Yang   A Differentiable Parallel Sampler for
                                  Efficient Video Classification . . . . . 112:1--112:??
                  Junjie Li and   
                   Jin Yuan and   
                     Zhiyong Li   TP-FER: an Effective Three-phase
                                  Noise-tolerant Recognizer for Facial
                                  Expression Recognition . . . . . . . . . 113:1--113:??
               Baojin Huang and   
             Zhongyuan Wang and   
            Guangcheng Wang and   
                   Zhen Han and   
                      Kui Jiang   Local Eyebrow Feature Attention Network
                                  for Masked Face Recognition  . . . . . . 114:1--114:??
             Bin-Cheng Yang and   
                    Gangshan Wu   Efficient Single-image Super-resolution
                                  Using Dual path Connections with
                                  Multiple scale Learning  . . . . . . . . 115:1--115:??
                   Wei Zhou and   
                  Yanke Hou and   
                  Dihu Chen and   
                 Haifeng Hu and   
                         Tao Su   Attention-Augmented Memory Network for
                                  Image Multi-Label Classification . . . . 116:1--116:??
             Shuaixiong Hui and   
                  Qiang Guo and   
                Xiaoyu Geng and   
                  Caiming Zhang   Multi-Guidance CNNs for Salient Object
                                  Detection  . . . . . . . . . . . . . . . 117:1--117:??
                   Kai Xing and   
                     Tao Li and   
                   Xuanhan Wang   ProposalVLAD with Proposal-Intra
                                  Exploring for Temporal Action Proposal
                                  Generation . . . . . . . . . . . . . . . 118:1--118:??
                   Hao Tang and   
                   Lei Ding and   
                Songsong Wu and   
                    Bin Ren and   
                  Nicu Sebe and   
                     Paolo Rota   Deep Unsupervised Key Frame Extraction
                                  for Efficient Video Classification . . . 119:1--119:??
                 Ling Zhang and   
            Chengjiang Long and   
             Xiaolong Zhang and   
                   Chunxia Xiao   Exploiting Residual and Illumination
                                  with GANs for Shadow Detection and
                                  Shadow Removal . . . . . . . . . . . . . 120:1--120:??
                Yushu Zhang and   
                   Nuo Chen and   
                  Shuren Qi and   
                 Mingfu Xue and   
                   Zhongyun Hua   Detection of Recolored Image by Texture
                                  Features in Chrominance Components . . . 121:1--121:??
                    Han Xue and   
                   Jun Ling and   
                  Anni Tang and   
                    Li Song and   
                   Rong Xie and   
                   Wenjun Zhang   High-Fidelity Face Reenactment Via
                                  Identity-Matched Correspondence Learning 122:1--122:??
                Haozhe Chen and   
                  Hang Zhou and   
                  Jie Zhang and   
              Dongdong Chen and   
              Weiming Zhang and   
               Kejiang Chen and   
                   Gang Hua and   
                     Nenghai Yu   Perceptual Hashing of Deep Convolutional
                                  Neural Networks for Model Copy Detection 123:1--123:??
                   Wei Duan and   
                      Yi Yu and   
               Xulong Zhang and   
                 Suhua Tang and   
                     Wei Li and   
                    Keizo Oyama   Melody Generation from Lyrics with Local
                                  Interpretability . . . . . . . . . . . . 124:1--124:??
               Shiguang Liu and   
                    Huixin Wang   Talking Face Generation via Facial
                                  Anatomy  . . . . . . . . . . . . . . . . 125:1--125:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 19, Number 3s, June, 2023

                Zengri Zeng and   
               Baokang Zhao and   
             Han-Chieh Chao and   
                  Ilsun You and   
                Kuo-Hui Yeh and   
                    Weizhi Meng   Towards Intelligent Attack Detection
                                  Using DNA Computing  . . . . . . . . . . 126:1--126:??
                Jinxia Wang and   
                   Rui Chen and   
                      Zhihan Lv   DNA Computing-Based Multi-Source Data
                                  Storage Model in Digital Twins . . . . . 127:1--127:??
                Fawad Ahmed and   
           Muneeb Ur Rehman and   
                Jawad Ahmad and   
      Muhammad Shahbaz Khan and   
              Wadii Boulila and   
          Gautam Srivastava and   
         Jerry Chun-Wei Lin and   
            William J. Buchanan   A DNA Based Colour Image Encryption
                                  Scheme Using A Convolutional Autoencoder 128:1--128:??
           Vignesh V. Menon and   
              Hadi Amirpour and   
          Mohammad Ghanbari and   
             Christian Timmerer   EMES: Efficient Multi-encoding Schemes
                                  for HEVC-based Adaptive Bitrate
                                  Streaming  . . . . . . . . . . . . . . . 129:1--129:??
                Jiwei Zhang and   
                      Yi Yu and   
                 Suhua Tang and   
                Jianming Wu and   
                         Wei Li   Variational Autoencoder with CCA for
                                  Audio-Visual Cross-modal Retrieval . . . 130:1--130:??
           Thi-Ngoc-Hanh Le and   
              Ya-Hsuan Chen and   
                   Tong-Yee Lee   Structure-aware Video Style Transfer
                                  with Map Art . . . . . . . . . . . . . . 131:1--131:??
                 Sirui Zhao and   
               Hongyu Jiang and   
                Hanqing Tao and   
                    Rui Zha and   
                  Kun Zhang and   
                    Tong Xu and   
                    Enhong Chen   PEDM: a Multi-task Learning Model for
                                  Persona-aware Emoji-embedded Dialogue
                                  Generation . . . . . . . . . . . . . . . 132:1--132:??
                 Heyu Huang and   
                Runmin Cong and   
                Lianhe Yang and   
                    Ling Du and   
                  Cong Wang and   
                      Sam Kwong   Feedback Chain Network for Hippocampus
                                  Segmentation . . . . . . . . . . . . . . 133:1--133:??
               Xuanrong Yao and   
                   Xin Wang and   
                    Yue Liu and   
                      Wenwu Zhu   Continual Recognition with Adaptive
                                  Memory Update  . . . . . . . . . . . . . 134:1--134:??
               Jingyao Wang and   
                Luntian Mou and   
                     Lei Ma and   
               Tiejun Huang and   
                        Wen Gao   AMSA: Adaptive Multimodal Learning for
                                  Sentiment Analysis . . . . . . . . . . . 135:1--135:??
              Shaoning Zeng and   
                  Yunbo Rao and   
                  Bob Zhang and   
                        Yong Xu   Joint Augmented and Compressed
                                  Dictionaries for Robust Image
                                  Classification . . . . . . . . . . . . . 136:1--136:??
              Yuyang Wanyan and   
              Xiaoshan Yang and   
                    Xuan Ma and   
                  Changsheng Xu   Dual Scene Graph Convolutional Network
                                  for Motivation Prediction  . . . . . . . 137:1--137:??
                    Fei Lei and   
                Zhongqi Cao and   
                Yuning Yang and   
                  Yibo Ding and   
                     Cong Zhang   Learning the User's Deeper Preferences
                                  for Multi-modal Recommendation Systems   138:1--138:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 19, Number 4, July, 2023

                  Xuehu Yan and   
                Longlong Li and   
                    Lei Sun and   
                   Jia Chen and   
                   Shudong Wang   Fake and Dishonest Participant Immune
                                  Secret Image Sharing . . . . . . . . . . 139:1--139:??
                  Song Yang and   
                   Qiang Li and   
                  Wenhui Li and   
                 Xuan-Ya Li and   
                    Ran Jin and   
                      Bo Lv and   
                   Rui Wang and   
                       Anan Liu   Semantic Completion and Filtration for
                                  Image-Text Retrieval . . . . . . . . . . 140:1--140:??
                    Xuan Ma and   
              Xiaoshan Yang and   
                  Changsheng Xu   Multi-Source Knowledge Reasoning Graph
                                  Network for Multi-Modal Commonsense
                                  Inference  . . . . . . . . . . . . . . . 141:1--141:??
                 Shangxi Wu and   
                 Jitao Sang and   
                 Kaiyuan Xu and   
              Jiaming Zhang and   
                        Jian Yu   Attention, Please! Adversarial Defense
                                  via Activation Rectification and
                                  Preservation . . . . . . . . . . . . . . 142:1--142:??
                   Kan Wang and   
             Changxing Ding and   
               Jianxin Pang and   
                    Xiangmin Xu   Context Sensing Attention Network for
                                  Video-based Person Re-identification . . 143:1--143:??
               Wenjing Wang and   
                 Lilang Lin and   
                  Zejia Fan and   
                    Jiaying Liu   Semi-supervised Learning for Mars
                                  Imagery Classification and Segmentation  144:1--144:??
                    Hui Liu and   
                Shanshan Li and   
                Jicheng Zhu and   
                   Kai Deng and   
                   Meng Liu and   
                    Liqiang Nie   DDIFN: a Dual-discriminator Multi-modal
                                  Medical Image Fusion Network . . . . . . 145:1--145:??
                 Xintian Wu and   
                Huanyu Wang and   
                  Yiming Wu and   
                          Xi Li   D$^3$T-GAN: Data-Dependent Domain
                                  Transfer GANs for Image Generation with
                                  Limited Data . . . . . . . . . . . . . . 146:1--146:??
                 Dandan Zhu and   
                  Xuan Shao and   
            Qiangqiang Zhou and   
               Xiongkuo Min and   
              Guangtao Zhai and   
                  Xiaokang Yang   A Novel Lightweight Audio-visual
                                  Saliency Model for Videos  . . . . . . . 147:1--147:??
             Amr Abdussalam and   
                 Zhongfu Ye and   
              Ammar Hawbani and   
             Majjed Al-Qatf and   
                    Rashid Khan   NumCap: a Number-controlled
                                  Multi-caption Image Captioning Network   148:1--148:??
                    Hao Liu and   
                 Zhaoyu Yan and   
                   Bing Liu and   
                 Jiaqi Zhao and   
                  Yong Zhou and   
         Abdulmotaleb El Saddik   Distilled Meta-learning for Multi-Class
                                  Incremental Learning . . . . . . . . . . 149:1--149:??
                   Jin Yuan and   
                Shikai Chen and   
                  Yao Zhang and   
              Zhongchao Shi and   
                   Xin Geng and   
               Jianping Fan and   
                       Yong Rui   Graph Attention Transformer Network for
                                  Multi-label Image Classification . . . . 150:1--150:??
                 Guojia Hou and   
                  Yuxuan Li and   
                  Huan Yang and   
                 Kunqian Li and   
                   Zhenkuan Pan   UID2021: an Underwater Image Dataset for
                                  Evaluation of No-Reference Quality
                                  Assessment Metrics . . . . . . . . . . . 151:1--151:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 19, Number 5, September, 2023

            Niklas Carlsson and   
                    Derek Eager   Cross-User Similarities in Viewing
                                  Behavior for 360${}^\circ $ Video and
                                  Caching Implications . . . . . . . . . . 152:1--152:??
                 Ziqiang Li and   
                Pengfei Xia and   
                    Xue Rui and   
                         Bin Li   Exploring the Effect of High-frequency
                                  Components in GANs Training  . . . . . . 153:1--153:??
                Haibing Yin and   
               Hongkui Wang and   
                      Li Yu and   
               Junhui Liang and   
                  Guangtao Zhai   Feedforward and Feedback Modulations
                                  Based Foveated JND Estimation for Images 154:1--154:??
                Taocun Yang and   
               Yaping Huang and   
                 Yanlin Xie and   
                  Junbo Liu and   
                 Shengchun Wang   MixOOD: Improving Out-of-distribution
                                  Detection with Enhanced Data Mixup . . . 155:1--155:??
                    Hao Wei and   
                       Rui Chen   A Multi-Level Consistency Network for
                                  High-Fidelity Virtual Try-On . . . . . . 156:1--156:??
               Jiachang Hao and   
                Haifeng Sun and   
                Pengfei Ren and   
               Yiming Zhong and   
                Jingyu Wang and   
                      Qi Qi and   
                   Jianxin Liao   Fine-Grained Text-to-Video Temporal
                                  Grounding from Coarse Boundary . . . . . 157:1--157:??
                  Weixin Li and   
               Tiantian Cao and   
                  Chang Liu and   
                   Xue Tian and   
                      Ya Li and   
               Xiaojie Wang and   
                      Xuan Dong   Dual-Lens HDR using Guided $3$D Exposure
                                  CNN and Guided Denoising Transformer . . 158:1--158:??
                   Xin Yang and   
                 Hengrui Li and   
               Xiaochuan Li and   
                         Tao Li   HIFGAN: a High-Frequency
                                  Information-Based Generative Adversarial
                                  Network for Image Super-Resolution . . . 159:1--159:??
                        Yang Li   Detection of Moving Object Using
                                  Superpixel Fusion Network  . . . . . . . 160:1--160:??
                Yingwei Pan and   
                   Yehao Li and   
                   Ting Yao and   
                        Tao Mei   Bottom-up and Top-down Object Inference
                                  Networks for Image Captioning  . . . . . 161:1--161:??
                Duoduo Feng and   
               Xiangteng He and   
                     Yuxin Peng   MKVSE: Multimodal Knowledge Enhanced
                                  Visual-semantic Embedding for Image-text
                                  Retrieval  . . . . . . . . . . . . . . . 162:1--162:??
                Mengyi Zhao and   
                   Hao Tang and   
                    Pan Xie and   
                Shuling Dai and   
                  Nicu Sebe and   
                       Wei Wang   Bidirectional Transformer GAN for
                                  Long-term Human Motion Prediction  . . . 163:1--163:??
                  Jian Wang and   
                 Qiang Ling and   
                      Peiyan Li   Robust Video Stabilization based on
                                  Motion Decomposition . . . . . . . . . . 164:1--164:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 19, Number 5s, October, 2023

           Pasi Fränti and   
                    Nancy Fazal   Design Principles for Content Creation
                                  in Location-Based Games  . . . . . . . . 165:1--165:??
              Chenchi Zhang and   
                   Wenbo Ma and   
                   Jun Xiao and   
              Hanwang Zhang and   
                  Jian Shao and   
             Yueting Zhuang and   
                      Long Chen   VL-NMS: Breaking Proposal Bottlenecks in
                                  Two-stage Visual-language Matching . . . 166:1--166:??
        Micha\l Ma\'ckowski and   
               Piotr Brzoza and   
            Mateusz Kawulok and   
              Rafa\l Meisel and   
               Dominik Spinczyk   Multimodal Presentation of Interactive
                                  Audio-Tactile Graphics Supporting the
                                  Perception of Visual Information by
                                  Blind People . . . . . . . . . . . . . . 167:1--167:??
                    Xin Man and   
                   Jie Shao and   
                 Feiyu Chen and   
             Mingxing Zhang and   
                  Heng Tao Shen   TEVL: Trilinear Encoder for
                                  Video-language Representation Learning   168:1--168:??
               Simone Ricci and   
           Tiberio Uricchio and   
              Alberto Del Bimbo   Meta-learning Advisor Networks for
                                  Long-tail and Noisy Labels in Social
                                  Image Classification . . . . . . . . . . 169:1--169:??
                    Chen Li and   
                    Li Song and   
                   Rong Xie and   
                   Wenjun Zhang   Local Bidirection Recurrent Network for
                                  Efficient Video Deblurring with the
                                  Fused Temporal Merge Module  . . . . . . 170:1--170:??
                Tian-Zi Niu and   
              Zhen-Duo Chen and   
                    Xin Luo and   
             Peng-Fei Zhang and   
                   Zi Huang and   
                    Xin-Shun Xu   Video Captioning by Learning from Global
                                  Sentence and Looking Ahead . . . . . . . 171:1--171:??
                  Yang Wang and   
                    Bo Dong and   
                      Ke Xu and   
                Haiyin Piao and   
                 Yufei Ding and   
                 Baocai Yin and   
                       Xin Yang   A Geometrical Approach to Evaluate the
                                  Adversarial Robustness of Deep Neural
                                  Networks . . . . . . . . . . . . . . . . 172:1--172:??
             Suncheng Xiang and   
                Dahong Qian and   
              Mengyuan Guan and   
                 Binjie Yan and   
                   Ting Liu and   
                  Yuzhuo Fu and   
                    Guanjie You   Less Is More: Learning from Synthetic
                                  Data with Fine-Grained Attributes for
                                  Person Re-Identification . . . . . . . . 173:1--173:??
            Matti Siekkinen and   
Teemu Kämäräinen   Neural Network Assisted Depth Map
                                  Packing for Compression Using Standard
                                  Hardware Video Codecs  . . . . . . . . . 174:1--174:??
 Bianca Jansen van Rensburg and   
            Pauline Puteaux and   
              William Puech and   
            Jean-Pierre Pedeboy   $3$D Object Watermarking from Data
                                  Hiding in the Homomorphic Encrypted
                                  Domain . . . . . . . . . . . . . . . . . 175:1--175:??
                    Hao Liu and   
              Xiaoshan Yang and   
                  Changsheng Xu   Counterfactual Scenario-relevant
                                  Knowledge-enriched Multi-modal Emotion
                                  Reasoning  . . . . . . . . . . . . . . . 176:1--176:??
             Melika Ayoughi and   
              Pascal Mettes and   
                     Paul Groth   Self-contained Entity Discovery from
                                  Captioned Videos . . . . . . . . . . . . 177:1--177:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 19, Number 6, November, 2023

                    Jin Xie and   
                Yanwei Pang and   
                   Jing Pan and   
                   Jing Nie and   
                  Jiale Cao and   
                    Jungong Han   Complementary Feature Pyramid Network
                                  for Object Detection . . . . . . . . . . 178:1--178:??
                Tianyi Wang and   
                Harry Cheng and   
               Kam Pui Chow and   
                    Liqiang Nie   Deep Convolutional Pooling Transformer
                                  for Deepfake Detection . . . . . . . . . 179:1--179:??
         Patrick P. K. Chan and   
                 Xiaoman Hu and   
                Haorui Song and   
                  Peng Peng and   
                      Keke Chen   Learning Disentangled Features for
                                  Person Re-identification under Clothes
                                  Changing . . . . . . . . . . . . . . . . 180:1--180:??
               Rongfei Zeng and   
                     Mai Su and   
                  Ruiyun Yu and   
                   Xingwei Wang   CD$^2$: Fine-grained $3$D Mesh
                                  Reconstruction with Twice Chamfer
                                  Distance . . . . . . . . . . . . . . . . 181:1--181:??
                Tian-Zi Niu and   
             Shan-Shan Dong and   
              Zhen-Duo Chen and   
                    Xin Luo and   
               Shanqing Guo and   
                   Zi Huang and   
                    Xin-Shun Xu   Semantic Enhanced Video Captioning with
                                  Multi-feature Fusion . . . . . . . . . . 182:1--182:??
                     Kun Li and   
                  Jiaxiu Li and   
                    Dan Guo and   
                   Xun Yang and   
                      Meng Wang   Transformer-Based Visual Grounding with
                                  Cross-Modality Interaction . . . . . . . 183:1--183:??
                Jiayuan Xie and   
                 Jiali Chen and   
                     Yi Cai and   
              Qingbao Huang and   
                        Qing Li   Visual Paraphrase Generation with Key
                                  Information Retained . . . . . . . . . . 184:1--184:??
              Bingzheng Liu and   
                Jianjun Lei and   
                    Bo Peng and   
                 Chuanbo Yu and   
                 Wanqing Li and   
                       Nam Ling   Novel View Synthesis from a Single
                                  Unposed Image via Unsupervised Learning  186:1--186:??
             Mingliang Zhou and   
               Hongyue Leng and   
                   Bin Fang and   
                  Tao Xiang and   
                 Xuekai Wei and   
                     Weijia Jia   Low-light Image Enhancement via a
                                  Frequency-based Model with Structure and
                                  Texture Decomposition  . . . . . . . . . 187:1--187:??
              Hongguang Zhu and   
                Yunchao Wei and   
                   Yao Zhao and   
              Chunjie Zhang and   
                  Shujuan Huang   AMC: Adaptive Multi-expert Collaborative
                                  Network for Text-guided Image Retrieval  188:1--188:??
           Tomaso Fontanini and   
                Luca Donati and   
           Massimo Bertozzi and   
                   Andrea Prati   Unsupervised Discovery and Manipulation
                                  of Continuous Disentangled Factors of
                                  Variation  . . . . . . . . . . . . . . . 189:1--189:??
               Puneet Kumar and   
               Gaurav Bhatt and   
                Omkar Ingle and   
                Daksh Goyal and   
          Balasubramanian Raman   Affective Feedback Synthesis Towards
                                  Multimodal Text and Image Data . . . . . 190:1--190:??
                   Yikun Xu and   
               Xingxing Wei and   
                Pengwen Dai and   
                   Xiaochun Cao   A$^2$SC: Adversarial Attacks on Subspace
                                  Clustering . . . . . . . . . . . . . . . 191:1--191:??
               Xianhua Zeng and   
               Saiyuan Chen and   
                  Yicai Xie and   
                  Tianxing Liao   3V3D: Three-View Contextual Cross-slice
                                  Difference Three-dimensional Medical
                                  Image Segmentation Adversarial Network   192:1--192:??
         Federico Becattini and   
             Pietro Bongini and   
                Luana Bulla and   
          Alberto Del Bimbo and   
         Ludovica Marinucci and   
         Misael Mongiov\`\i and   
             Valentina Presutti   VISCOUNTH: a Large-scale Multilingual
                                  Visual Question Answering Dataset for
                                  Cultural Heritage  . . . . . . . . . . . 193:1--193:??
                Wei-Yen Hsu and   
                   Pei-Wen Jian   Recurrent Multi-scale
                                  Approximation-Guided Network for Single
                                  Image Super-Resolution . . . . . . . . . 194:1--194:??
                      Bo Li and   
                 Yong Zhang and   
            Chengyang Zhang and   
               Xinglin Piao and   
                     Baocai Yin   Hypergraph Association Weakly Supervised
                                  Crowd Counting . . . . . . . . . . . . . 195:1--195:??
                 Yichun Tai and   
                 Hailin Shi and   
                   Dan Zeng and   
                    Hang Du and   
                    Yibo Hu and   
              Zicheng Zhang and   
             Zhijiang Zhang and   
                        Tao Mei   Multi-Agent Semi-Siamese Training for
                                  Long-Tail and Shallow Face Learning  . . 196:1--196:??
                     Rui Li and   
              Baopeng Zhang and   
                    Wei Liu and   
                   Zhu Teng and   
                   Jianping Fan   PANet: an End-to-end Network Based on
                                  Relative Motion for Online Multi-object
                                  Tracking . . . . . . . . . . . . . . . . 197:1--197:??
                    Ye Yuan and   
                   Jiawan Zhang   Shot Boundary Detection Using Color
                                  Clustering and Attention Mechanism . . . 198:1--198:??
                 Cong Huang and   
               Xiulian Peng and   
                   Dong Liu and   
                         Yan Lu   Text Image Super-Resolution Guided by
                                  Text Structure and Embedding Priors  . . 199:1--199:??
                    Jie Zhu and   
                    Bo Peng and   
                 Wanqing Li and   
               Haifeng Shen and   
             Qingming Huang and   
                    Jianjun Lei   Modeling Long-range Dependencies and
                                  Epipolar Geometry for Multi-view Stereo  200:1--200:??
                Xiumei Chen and   
             Xiangtao Zheng and   
                   Xiaoqiang Lu   Identity Feature Disentanglement for
                                  Visible-Infrared Person
                                  Re-Identification  . . . . . . . . . . . 201:1--201:??
                 Zhenyu Shu and   
                   Ling Gao and   
                    Shun Yi and   
                  Fangyu Wu and   
                   Xin Ding and   
                   Ting Wan and   
                    Shiqing Xin   Context-Aware $3$D Points of Interest
                                  Detection via Spatial Attention
                                  Mechanism  . . . . . . . . . . . . . . . 202:1--202:??
                  Zhen Chen and   
                  Ming Yang and   
                 Shiliang Zhang   Complementary Coarse-to-Fine Matching
                                  for Video Object Segmentation  . . . . . 203:1--203:??
         Kankanala Srinivas and   
          Ashish Kumar Bhandari   Context-Based Novel Histogram Bin
                                  Stretching Algorithm for Automatic
                                  Contrast Enhancement . . . . . . . . . . 204:1--204:??
               Zhenjun Tang and   
               Zhiyuan Chen and   
                  Zhixin Li and   
               Bineng Zhong and   
             Xianquan Zhang and   
                  Xinpeng Zhang   Unifying Dual-Attention and Siamese
                                  Transformer Network for Full-Reference
                                  Image Quality Assessment . . . . . . . . 205:1--205:??
                  Geyu Tang and   
                 Xingyu Gao and   
                    Zhenyu Chen   Learning Semantic Representation on
                                  Visual Attribute Graph for Person
                                  Re-identification and Beyond . . . . . . 206:1--206:??
                 Zijun Deng and   
               Xiangteng He and   
                     Yuxin Peng   LFR-GAN: Local Feature Refinement based
                                  Generative Adversarial Network for
                                  Text-to-Image Generation . . . . . . . . 207:1--207:??
                Yongchao Du and   
                   Min Wang and   
                  Zhenbo Lu and   
               Wengang Zhou and   
                    Houqiang Li   Weakly Supervised Hashing with
                                  Reconstructive Cross-modal Attention . . 208:1--208:??
                  Meng Wang and   
                 Jizheng Xu and   
                   Li Zhang and   
                   Junru Li and   
                  Kai Zhang and   
                 Shiqi Wang and   
                       Siwei Ma   Compressed Screen Content Image Super
                                  Resolution . . . . . . . . . . . . . . . 209:1--209:??
                 Boqiang Xu and   
                 Jian Liang and   
                Lingxiao He and   
                  Jinlin Wu and   
                   Chao Fan and   
                     Zhenan Sun   Color-Unrelated Head-Shoulder Networks
                                  for Fine-Grained Person
                                  Re-identification  . . . . . . . . . . . 210:1--210:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 20, Number 1, January, 2024

                  Zhenbo Xu and   
                Hai-Miao Hu and   
                    Liu Liu and   
             Dongping Zhang and   
              Shifeng Zhang and   
                    Wenming Tan   Instance-Based Continual Learning: a
                                  Real-World Dataset and Baseline for
                                  Fresh Recognition  . . . . . . . . . . . 1:1--1:??
             Xiaoping Liang and   
               Zhenjun Tang and   
                  Zhixin Li and   
                 Mengzhu Yu and   
               Hanyun Zhang and   
                 Xianquan Zhang   Robust Hashing via Global and Local
                                  Invariant Features for Image Copy
                                  Detection  . . . . . . . . . . . . . . . 2:1--2:??
             Sandipan Sarma and   
                     Arijit Sur   DiRaC-I: Identifying Diverse and Rare
                                  Training Classes for Zero-Shot Learning  3:1--3:??
              Chengyu Zheng and   
                  Ning Song and   
                Ruoyu Zhang and   
                  Lei Huang and   
               Zhiqiang Wei and   
                        Jie Nie   Scale-Semantic Joint Decoupling Network
                                  for Image-Text Retrieval in Remote
                                  Sensing  . . . . . . . . . . . . . . . . 4:1--4:??
                 Jiankai Li and   
               Yunhong Wang and   
                      Weixin Li   Zero-shot Scene Graph Generation via
                                  Triplet Calibration and Reduction  . . . 5:1--5:??
                Abid Yaqoob and   
           Gabriel-Miro Muntean   Advanced Predictive Tile Selection Using
                                  Dynamic Tiling for Prioritized
                                  360${}^\circ $ Video VR Streaming  . . . 6:1--6:??
                   Jia Wang and   
             Hong-Han Shuai and   
                Yung-Hui Li and   
                Wen-Huang Cheng   Language-guided Residual Graph Attention
                                  Network and Data Augmentation for Visual
                                  Grounding  . . . . . . . . . . . . . . . 7:1--7:??
                Haoran Wang and   
                 Yajie Wang and   
                Baosheng Yu and   
                Yibing Zhan and   
              Chunfeng Yuan and   
                    Wankou Yang   Attentional Composition Networks for
                                  Long-Tailed Human Action Recognition . . 8:1--8:??
              Zi-Chao Zhang and   
              Zhen-Duo Chen and   
                Zhen-Yu Xie and   
                    Xin Luo and   
                    Xin-Shun Xu   S3Mix: Same Category Same Semantics
                                  Mixing for Augmenting Fine-grained
                                  Images . . . . . . . . . . . . . . . . . 9:1--9:??
                Mingkui Tan and   
                Zhiquan Wen and   
                Leyuan Fang and   
                          Qi Wu   Transformer-Based Relational Inference
                                  Network for Complex Visual Relational
                                  Reasoning  . . . . . . . . . . . . . . . 10:1--10:??
                Yiming Yang and   
                 Weipeng Hu and   
                     Haifeng Hu   Syncretic Space Learning Network for
                                  NIR-VIS Face Recognition . . . . . . . . 11:1--11:??
                Chenghua Li and   
                  Zongze Li and   
                   Jing Sun and   
                  Yun Zhang and   
             Xiaoping Jiang and   
                      Fan Zhang   Dynamic Weighted Gradient Reversal
                                  Network for Visible-infrared Person
                                  Re-identification  . . . . . . . . . . . 12:1--12:??
                Jiajun Song and   
                    Zhuo Li and   
                Weiqing Min and   
                 Shuqiang Jiang   Towards Food Image Retrieval via
                                  Generalization-Oriented Sampling and
                                  Loss Function Design . . . . . . . . . . 13:1--13:??
                 Yiting Jin and   
                     Jie Wu and   
              Wanliang Wang and   
                 Yidong Yan and   
               Jiawei Jiang and   
                  Jianwei Zheng   Cascading Blend Network for Image
                                  Inpainting . . . . . . . . . . . . . . . 14:1--14:??
                  Kehua Guo and   
                 Liang Chen and   
              Xiangyuan Zhu and   
                Xiaoyan Kui and   
                 Jian Zhang and   
                     Heyuan Shi   Double-Layer Search and Adaptive Pooling
                                  Fusion for Reference-Based Image
                                  Super-Resolution . . . . . . . . . . . . 15:1--15:??
                  Jing Zhao and   
                     Bin Li and   
                  Jiahao Li and   
               Ruiqin Xiong and   
                         Yan Lu   A Universal Optimization Framework for
                                  Learning-based Image Codec . . . . . . . 16:1--16:??
               Liping Zhang and   
                Shukai Chen and   
                    Fei Lin and   
                    Wei Ren and   
     Kim-Kwang Raymond Choo and   
                     Geyong Min   $1$DIEN: Cross-session Electrocardiogram
                                  Authentication Using $1$D Integrated
                                  EfficientNet . . . . . . . . . . . . . . 17:1--17:??
                 Baian Chen and   
                Zhilei Chen and   
                 Xiaowei Hu and   
                     Jun Xu and   
                 Haoran Xie and   
                   Jing Qin and   
                  Mingqiang Wei   Dynamic Message Propagation Network for
                                  RGB-D and Video Salient Object Detection 18:1--18:??
                  Xiang Gao and   
                     Wei Hu and   
                     Guo-Jun Qi   Self-supervised Multi-view Learning via
                                  Auto-encoding $3$D Transformations . . . 19:1--19:??
                Dewang Wang and   
                 Gaobo Yang and   
                Zhiqing Guo and   
                     Jiyou Chen   Enhancing Adversarial Embedding based
                                  Image Steganography via Clustering
                                  Modification Directions  . . . . . . . . 20:1--20:??
               Xiaojia Zhao and   
                Tingting Xu and   
            Qiangqiang Shen and   
                  Youfa Liu and   
              Yongyong Chen and   
                    Jingyong Su   Double High-Order Correlation Preserved
                                  Robust Multi-View Ensemble Clustering    21:1--21:??
                   Shuji Tasaka   Usefulness of QoS in Multidimensional
                                  QoE Prediction for Haptic-Audiovisual
                                  Communications . . . . . . . . . . . . . 22:1--22:??
            Ching-Nung Yang and   
                Xiaotian Wu and   
                 Min-Jung Chung   Enhancement of Information Carrying and
                                  Decoding for Visual Cryptography with
                                  Error Correction . . . . . . . . . . . . 23:1--23:??
               Yuqing Zhang and   
                 Yong Zhang and   
               Shaofan Wang and   
                  Yun Liang and   
                     Baocai Yin   Semi-supervised Video Object
                                  Segmentation Via an Edge Attention Gated
                                  Graph Convolutional Network  . . . . . . 24:1--24:??
                Wenying Wen and   
              Minghui Huang and   
                Yushu Zhang and   
                Yuming Fang and   
                      Yifan Zuo   Visual Security Index Combining CNN and
                                  Filter for Perceptually Encrypted Light
                                  Field Images . . . . . . . . . . . . . . 25:1--25:??
                 Linlin Liu and   
               Haijun Zhang and   
                     Qun Li and   
               Jianghong Ma and   
                     Zhao Zhang   Collocated Clothing Synthesis with GANs
                                  Aided by Textual Information: a
                                  Multi-Modal Framework  . . . . . . . . . 26:1--26:??
                  Xulei Lou and   
                 Tinghui Wu and   
                 Haifeng Hu and   
                      Dihu Chen   Self-Supervised Consistency Based on
                                  Joint Learning for Unsupervised Person
                                  Re-identification  . . . . . . . . . . . 27:1--27:??
                Yichi Zhang and   
              Gongchun Ding and   
                Dandan Ding and   
                    Zhan Ma and   
                         Zhu Li   On Content-Aware Post-Processing:
                                  Adapting Statistically Learned Models to
                                  Dynamic Content  . . . . . . . . . . . . 28:1--28:??
                    Jing Xu and   
                   Bing Liu and   
                  Yong Zhou and   
               Mingming Liu and   
                    Rui Yao and   
                    Zhiwen Shao   Diverse Image Captioning via Conditional
                                  Variational Autoencoder and Dual
                                  Contrastive Learning . . . . . . . . . . 29:1--29:??
                   Cong Zou and   
                   Rui Wang and   
                  Cheng Jin and   
                Sanyi Zhang and   
                       Xin Wang   S$^2$CL-LeafNet: Recognizing Leaf Images
                                  Like Human Botanists . . . . . . . . . . 30:1--30:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 20, Number 2, February, 2024

            Suyel Namasudra and   
              Pascal Lorenz and   
            Seifedine Kadry and   
        Syed Ahmad Chan Bukhari   Introduction to the Special Issue on
                                  DNA-centric Modeling and Practice for
                                  Next-generation Computing and
                                  Communication Systems  . . . . . . . . . 31:1--31:??
                Shaohua Wan and   
                     Yi Jin and   
               Guangdong Xu and   
                  Michele Nappi   Editorial to Special Issue on Multimedia
                                  Cognitive Computing for Intelligent
                                  Transportation System  . . . . . . . . . 32:1--32:??
                Ruonan Zhao and   
           Laurence T. Yang and   
                  Debin Liu and   
                   Wanli Lu and   
                 Chenlu Zhu and   
                    Yiheng Ruan   Tensor-Empowered LSTM for
                                  Communication-Efficient and
                                  Privacy-Enhanced Cognitive Federated
                                  Learning in Intelligent Transportation
                                  Systems  . . . . . . . . . . . . . . . . 33:1--33:??
               Hongjian Shi and   
                   Hao Wang and   
                   Ruhui Ma and   
                   Yang Hua and   
                   Tao Song and   
                Honghao Gao and   
                   Haibing Guan   Robust Searching-Based Gradient
                                  Collaborative Management in Intelligent
                                  Transportation System  . . . . . . . . . 34:1--34:??
                 Zejia Weng and   
                  Zuxuan Wu and   
                 Hengduo Li and   
              Jingjing Chen and   
                  Yu-Gang Jiang   HCMS: Hierarchical and Conditional
                                  Modality Selection for Efficient Video
                                  Recognition  . . . . . . . . . . . . . . 35:1--35:??
             Shixiong Zhang and   
                Wenmin Wang and   
                 Honglei Li and   
                 Shenyong Zhang   E-detector: Asynchronous Spatio-temporal
                                  for Event-based Object Detection in
                                  Intelligent Transportation System  . . . 36:1--36:??
           Ram Prasad Padhy and   
            Pankaj Kumar Sa and   
             Fabio Narducci and   
             Carmen Bisogni and   
                  Sambit Bakshi   Monocular Vision-aided Depth Measurement
                                  from RGB Images for Autonomous UAV
                                  Navigation . . . . . . . . . . . . . . . 37:1--37:??
                  Zhihan Lv and   
               Fabio Poiesi and   
                    Qi Dong and   
               Jaime Lloret and   
                   Houbing Song   Special Issue on Deep Learning for
                                  Intelligent Human Computer Interaction   38:1--38:??
               Wenjuan Gong and   
                  Yue Zhang and   
                   Wei Wang and   
                 Peng Cheng and   
               Jordi Gonz\`alez   Meta-MMFNet: Meta-learning-based
                                  Multi-model Fusion Network for
                                  Micro-expression Recognition . . . . . . 39:1--39:??
            Youcef Djenouri and   
               Asma Belhadi and   
          Gautam Srivastava and   
             Jerry Chun-Wei Lin   An Efficient and Accurate GPU-based Deep
                                  Learning Model for Multimedia
                                  Recommendation . . . . . . . . . . . . . 40:1--40:??
              Gaur Loveleen and   
             Bhandari Mohan and   
      Bhadwal Singh Shikhar and   
                 Jhanjhi Nz and   
      Mohammad Shorfuzzaman and   
                   Mehedi Masud   Explanation-Driven HCI Model to Examine
                                  the Mini-Mental State for Alzheimer's
                                  Disease  . . . . . . . . . . . . . . . . 41:1--41:??
                      Mi Li and   
                  Wei Zhang and   
                     Bin Hu and   
               Jiaming Kang and   
                  Yuqi Wang and   
                     Shengfu Lu   Automatic Assessment of Depression and
                                  Anxiety through Encoding Pupil-wave from
                                  HCI in VR Scenes . . . . . . . . . . . . 42:1--42:??
               Abdul Qayyum and   
               Imran Razzak and   
                 M. Tanveer and   
                   Moona Mazher   Spontaneous Facial Behavior Analysis
                                  Using Deep Transformer-based Framework
                                  for Child-computer Interaction . . . . . 43:1--43:??
               Xiaowei Chen and   
                 Xiao Jiang and   
              Lishuang Zhan and   
                 Shihui Guo and   
              Qunsheng Ruan and   
               Guoliang Luo and   
              Minghong Liao and   
                     Yipeng Qin   Full-body Human Motion Reconstruction
                                  with Sparse Joint Tracking Using
                                  Flexible Sensors . . . . . . . . . . . . 44:1--44:??
               Shanbao Qiao and   
              Neal N. Xiong and   
                Yongbin Gao and   
                Zhijun Fang and   
                  Wenjun Yu and   
                 Juan Zhang and   
                  Xiaoyan Jiang   Self-Supervised Learning of Depth and
                                  Ego-Motion for $3$D Perception in Human
                                  Computer Interaction . . . . . . . . . . 45:1--45:??
                   Yan Kang and   
                     Bin Pu and   
                 Yongqi Kou and   
                   Yun Yang and   
               Jianguo Chen and   
              Khan Muhammad and   
                    Po Yang and   
                    Lida Xu and   
                 Mohammad Hijji   A Deep Graph Network with Multiple
                                  Similarity for User Clustering in
                                  Human-Computer Interaction . . . . . . . 46:1--46:??
               Bahar Mahmud and   
                  Guan Hong and   
                   Bernard Fong   A Study of Human--AI Symbiosis for
                                  Creative Work: Recent Developments and
                                  Future Directions in Deep Learning . . . 47:1--47:??
                Xiaoling Gu and   
                  Jie Huang and   
              Yongkang Wong and   
                     Jun Yu and   
               Jianping Fan and   
                   Pai Peng and   
           Mohan S. Kankanhalli   PAINT: Photo-realistic Fashion Design
                                  Synthesis  . . . . . . . . . . . . . . . 48:1--48:??
               Qingfeng Dai and   
              Yongkang Wong and   
                 Guofei Sun and   
                Yanwei Wang and   
                  Zhou Zhou and   
       Mohan S. Kankanhalli and   
               Xiangdong Li and   
                   Weidong Geng   Unsupervised Domain Adaptation by Causal
                                  Learning for Biometric Signal-based HCI  49:1--49:??
                    Yi Xiao and   
                   Tong Liu and   
                     Yu Han and   
                    Yue Liu and   
                  Yongtian Wang   Realtime Recognition of Dynamic Hand
                                  Gestures in Practical Applications . . . 50:1--50:??
               Jianping Gou and   
                 Liyuan Sun and   
                Baosheng Yu and   
                Shaohua Wan and   
                    Dacheng Tao   Hierarchical Multi-Attention Transfer
                                  for Knowledge Distillation . . . . . . . 51:1--51:??
            Subhrajyoti Deb and   
               Abhilash Das and   
                   Nirmalya Kar   An Applied Image Cryptosystem on Moore's
                                  Automaton Operating on $ \delta (q_k) /
                                  F_2 $  . . . . . . . . . . . . . . . . . 52:1--52:??
                   Sisi You and   
                  Yukun Zuo and   
                 Hantao Yao and   
                  Changsheng Xu   Incremental Audio-Visual Fusion for
                                  Person Recognition in Earthquake Scene   53:1--53:??
                  Shiqi Sun and   
               Danlan Huang and   
               Xiaoming Tao and   
              Chengkang Pan and   
                Guangyi Liu and   
                  Changwen Chen   Boosting Scene Graph Generation with
                                  Contextual Information . . . . . . . . . 54:1--54:??
              Jianwei Zheng and   
                     Yu Liu and   
                Yuchao Feng and   
                 Honghui Xu and   
                    Meiyu Zhang   Contrastive Attention-guided Multi-level
                                  Feature Registration for Reference-based
                                  Super-resolution . . . . . . . . . . . . 55:1--55:??
                 Shangxi Wu and   
                 Jitao Sang and   
                  Kaiyan Xu and   
              Guanhua Zheng and   
                  Changsheng Xu   Adaptive Adversarial Logits Pairing  . . 56:1--56:??
                  Ying Chen and   
                    Rui Yao and   
                  Yong Zhou and   
                 Jiaqi Zhao and   
                   Bing Liu and   
         Abdulmotaleb El Saddik   Black-box Attack against Self-supervised
                                  Video Object Segmentation Models with
                                  Contrastive Loss . . . . . . . . . . . . 57:1--57:??
               Shuang Liang and   
                  Wentao Ma and   
                        Chi Xie   Relation with Free Objects for Action
                                  Recognition  . . . . . . . . . . . . . . 58:1--58:??
                 Qiaolin He and   
               Zhijie Zheng and   
                     Haifeng Hu   A Feature Map is Worth a Video Frame:
                                  Rethinking Convolutional Features for
                                  Visible-Infrared Person
                                  Re-identification  . . . . . . . . . . . 59:1--59:??
              Wuliang Huang and   
               Yiqiang Chen and   
              Xinlong Jiang and   
                 Teng Zhang and   
                      Qian Chen   GJFusion: a Channel-Level Correlation
                                  Construction Method for Multimodal
                                  Physiological Signal Fusion  . . . . . . 60:1--60:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 20, Number 3, March, 2024

               Chengji Shen and   
              Zhenjiang Liu and   
                    Xin Gao and   
                Zunlei Feng and   
                    Mingli Song   Self-Adaptive Clothing Mapping Based
                                  Virtual Try-on . . . . . . . . . . . . . 61:1--61:??
           Alberto Baldrati and   
              Marco Bertini and   
           Tiberio Uricchio and   
              Alberto Del Bimbo   Composed Image Retrieval using
                                  Contrastive Learning and Task-oriented
                                  CLIP-based Features  . . . . . . . . . . 62:1--62:??
                   Yan Wang and   
                   Peize Li and   
                  Qingyi Si and   
               Hanwen Zhang and   
                 Wenyu Zang and   
                  Zheng Lin and   
                        Peng Fu   Cross-modality Multiple Relations
                                  Learning for Knowledge-based Visual
                                  Question Answering . . . . . . . . . . . 63:1--63:??
                  Qiang Guo and   
                  Zhi Zhang and   
             Mingliang Zhou and   
                   Hong Yue and   
                  Huayan Pu and   
                        Jun Luo   Image Defogging Based on Regional
                                  Gradient Constrained Prior . . . . . . . 64:1--64:??
                 Jintao Guo and   
                     Lei Qi and   
               Yinghuan Shi and   
                       Yang Gao   PLACE Dropout: a Progressive Layer-wise
                                  and Channel-wise Dropout for Domain
                                  Generalization . . . . . . . . . . . . . 65:1--65:??
                 Yuan Xiong and   
                Jingru Wang and   
                     Zhong Zhou   VirtualLoc: Large-scale Visual
                                  Localization Using Virtual Images  . . . 66:1--66:??
               Yiheng Zhang and   
                   Ting Yao and   
                Zhaofan Qiu and   
                        Tao Mei   Explaining Cross-domain Recognition with
                                  Interpretable Deep Classifier  . . . . . 67:1--67:??
                Ruimin Wang and   
               Fasheng Wang and   
                  Yiming Su and   
                   Jing Sun and   
                 Fuming Sun and   
                      Haojie Li   Attention-guided Multi-modality
                                  Interaction Network for RGB-D Salient
                                  Object Detection . . . . . . . . . . . . 68:1--68:??
                Jemily Rime and   
           Alan Archer-Boyd and   
                    Tom Collins   How Will You Pod? Implications of
                                  Creators' Perspectives for Designing
                                  Innovative Podcasting Tools  . . . . . . 69:1--69:??
                    Ming Cheung   Learning from the Past: Fast NAS for
                                  Tasks and Datasets . . . . . . . . . . . 70:1--70:??
                  Xinyue Li and   
                 Haiyong Xu and   
               Gangyi Jiang and   
                     Mei Yu and   
                   Ting Luo and   
                Xuebo Zhang and   
                   Hongwei Ying   Underwater Image Quality Assessment from
                                  Synthetic to Real-world: Dataset and
                                  Objective Method . . . . . . . . . . . . 71:1--71:??
                 Sujuan Hou and   
                Jiacheng Li and   
                Weiqing Min and   
                  Qiang Hou and   
                 Yanna Zhao and   
              Yuanjie Zheng and   
                 Shuqiang Jiang   Deep Learning for Logo Detection: a
                                  Survey . . . . . . . . . . . . . . . . . 72:1--72:??
                Yunjie Peng and   
                  Jinlin Wu and   
                 Boqiang Xu and   
               Chunshui Cao and   
                     Xu Liu and   
                 Zhenan Sun and   
                    Zhiqiang He   Deep Learning Based Occluded Person
                                  Re-Identification: a Survey  . . . . . . 73:1--73:??
    Muhammad Arslan Manzoor and   
              Sarah Albarri and   
                Ziting Xian and   
               Zaiqiao Meng and   
              Preslav Nakov and   
                Shangsong Liang   Multimodality Representation Learning: a
                                  Survey on Evolution, Pretraining and Its
                                  Applications . . . . . . . . . . . . . . 74:1--74:??
                 Yanyan Shi and   
                Shaowu Yang and   
               Wenjing Yang and   
                 Dianxi Shi and   
                      Xuehui Li   Boosting Few-shot Object Detection with
                                  Discriminative Representation and Class
                                  Margin . . . . . . . . . . . . . . . . . 75:1--75:??
                Harry Cheng and   
               Yangyang Guo and   
                Tianyi Wang and   
                      Qi Li and   
              Xiaojun Chang and   
                    Liqiang Nie   Voice-Face Homogeneity Tells Deepfake    76:1--76:??
                     Jin Ye and   
                   Meng Dan and   
                  Wenchao Jiang   A Visual Sensitivity Aware ABR Algorithm
                                  for DASH via Deep Reinforcement Learning 77:1--77:??
                  Jian Wang and   
                  Xiao Wang and   
                  Guosheng Zhao   Task Recommendation via Heterogeneous
                                  Multi-modal Features and Decision Fusion
                                  in Mobile Crowdsensing . . . . . . . . . 78:1--78:??
                Si-Chao Lei and   
              Yue-Jiao Gong and   
              Xiao-Lin Xiao and   
               Yi-cong Zhou and   
                      Jun Zhang   Boosting Diversity in Visual Search with
                                  Pareto Non-Dominated Re-Ranking  . . . . 79:1--79:??
               Huijie Zhang and   
                      Pu Li and   
                Xiaobai Liu and   
              Xianfeng Yang and   
                          Li An   An Iterative Semi-supervised Approach
                                  with Pixel-wise Contrastive Loss for
                                  Road Extraction in Aerial Images . . . . 80:1--80:??
                  Jing Fang and   
                   Yinbo Yu and   
             Zhongyuan Wang and   
                   Xin Ding and   
                      Ruimin Hu   An Image Arbitrary-Scale
                                  Super-Resolution Network Using
                                  Frequency-domain Information . . . . . . 81:1--81:??
                   Xiao Luo and   
                     Wei Ju and   
                  Yiyang Gu and   
                 Yifang Qin and   
                    Siyu Yi and   
                  Daqing Wu and   
                 Luchen Liu and   
                     Ming Zhang   Toward Effective Semi-supervised Node
                                  Classification with Hybrid Curriculum
                                  Pseudo-labeling  . . . . . . . . . . . . 82:1--82:??
                    Wen Guo and   
                Wuzhou Quan and   
                  Junyu Gao and   
              Tianzhu Zhang and   
                  Changsheng Xu   Feature Disentanglement Network:
                                  Multi-Object Tracking Needs More
                                  Differentiated Features  . . . . . . . . 83:1--83:??
           Mohammed Khaleel and   
                Azeez Idris and   
        Wallapak Tavanapong and   
             Jacob R. Pratt and   
                Junghwan Oh and   
               Piet C. de Groen   VisActive: Visual-concept-based Active
                                  Learning for Image Classification under
                                  Class Imbalance  . . . . . . . . . . . . 84:1--84:??
               Honghua Chen and   
                   Zhiqi Li and   
               Mingqing Wei and   
                       Jun Wang   Geometric and Learning-Based Mesh
                                  Denoising: a Comprehensive Survey  . . . 85:1--85:??
                   Ning Han and   
                 Yawen Zeng and   
                 Chuhao Shi and   
               Guangyi Xiao and   
                   Hao Chen and   
                  Jingjing Chen   BiC-Net: Learning Efficient
                                  Spatio-temporal Relation for Text-Video
                                  Retrieval  . . . . . . . . . . . . . . . 86:1--86:??
                  Yuan Feng and   
                  Yaojun Hu and   
               Pengfei Fang and   
                  Sheng Liu and   
               Yanhong Yang and   
                 Shengyong Chen   Asymmetric Dual-Decoder U-Net for Joint
                                  Rain and Haze Removal  . . . . . . . . . 87:1--87:??
                  Yurui Xie and   
                      Ling Guan   Sparsity-guided Discriminative Feature
                                  Encoding for Robust Keypoint Detection   88:1--88:??
              Nicolas Beuve and   
          Wassim Hamidouche and   
        Olivier Déforges   Hierarchical Learning and Dummy Triplet
                                  Loss for Efficient Deepfake Detection    89:1--89:??
             Suncheng Xiang and   
                Dahong Qian and   
              Jingsheng Gao and   
                Zirui Zhang and   
                   Ting Liu and   
                      Yuzhuo Fu   Rethinking Person Re-Identification via
                                  Semantic-based Pretraining . . . . . . . 90:1--90:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 20, Number 4, April, 2024

                   Min Peng and   
                Xiaohu Shao and   
                     Yu Shi and   
                 Xiangdong Zhou   Hierarchical Synergy-Enhanced Multimodal
                                  Relational Network for Video Question
                                  Answering  . . . . . . . . . . . . . . . 91:1--91:??
                    Bin Ren and   
                   Hao Tang and   
               Fanyang Meng and   
                Ding Runwei and   
          Philip H. S. Torr and   
                      Nicu Sebe   Cloth Interactive Transformer for
                                  Virtual Try-On . . . . . . . . . . . . . 92:1--92:??
                Xiushan Nie and   
                   Yang Shi and   
                  Ziyu Meng and   
                  Jin Huang and   
                 Weili Guan and   
                     Yilong Yin   Complex Scenario Image Retrieval via
                                  Deep Similarity-aware Hashing  . . . . . 93:1--93:??
                 Jiawei Tan and   
              Hongxing Wang and   
                   Junsong Yuan   Characters Link Shots: Character
                                  Attention Network for Movie Scene
                                  Segmentation . . . . . . . . . . . . . . 94:1--94:??
             Mingliang Zhou and   
                Xinwen Zhao and   
                 Futing Luo and   
                    Jun Luo and   
                  Huayan Pu and   
                      Tao Xiang   Robust RGB-T Tracking via Adaptive
                                  Modality Weight Correlation Filters and
                                  Cross-modality Learning  . . . . . . . . 95:1--95:??
              Zicheng Zhang and   
                    Wei Sun and   
               Yingjie Zhou and   
                    Jun Jia and   
              Zhichao Zhang and   
                   Jing Liu and   
               Xiongkuo Min and   
                  Guangtao Zhai   Subjective and Objective Quality
                                  Assessment for in-the-Wild Computer
                                  Graphics Images  . . . . . . . . . . . . 96:1--96:??
               Shuvendu Roy and   
                     Ali Etemad   Contrastive Learning of View-invariant
                                  Representations for Facial Expressions
                                  Recognition  . . . . . . . . . . . . . . 97:1--97:??
                    Jun Liu and   
               Jiantao Zhou and   
                  Haiwei Wu and   
                 Weiwei Sun and   
                     Jinyu Tian   Generating Robust Adversarial Examples
                                  against Online Social Networks (OSNs)    98:1--98:??
                    Tao Yao and   
                    Yiru Li and   
                    Ying Li and   
               Yingying Zhu and   
                  Gang Wang and   
                        Jun Yue   Cross-modal Semantically Augmented
                                  Network for Image-text Matching  . . . . 99:1--99:??
               Ahmed Telili and   
            Sid Ahmed Fezza and   
          Wassim Hamidouche and   
   Hanene F. Z. Brachemi Meftah   2BiVQA: Double Bi-LSTM-based Video
                                  Quality Assessment of UGC Videos . . . . 100:1--100:??
              Hongzhou Chen and   
                Haihan Duan and   
              Maha Abdallah and   
                 Yufeng Zhu and   
               Yonggang Wen and   
     Abdulmotaleb El Saddik and   
                        Wei Cai   Web3 Metaverse: State-of-the-Art and
                                  Vision . . . . . . . . . . . . . . . . . 101:1--101:??
                Lilong Wang and   
                 Yunhui Shi and   
                   Jin Wang and   
                Shujun Chen and   
                 Baocai Yin and   
                       Nam Ling   Graph Based Cross-Channel Transform for
                                  Color Image Compression  . . . . . . . . 102:1--102:??
                    Kai Han and   
                     Yu Liu and   
                  Rukai Wei and   
                    Ke Zhou and   
                  Jinhui Xu and   
                       Kun Long   Supervised Hierarchical Online Hashing
                                  for Cross-modal Retrieval  . . . . . . . 103:1--103:??
                  Fengyi Fu and   
             Shancheng Fang and   
               Weidong Chen and   
                   Zhendong Mao   Sentiment-Oriented Transformer-Based
                                  Variational Autoencoder Network for Live
                                  Video Commenting . . . . . . . . . . . . 104:1--104:??
               Yuxiang Peng and   
                   Chong Fu and   
                Guixing Cao and   
                   Wei Song and   
                Junxin Chen and   
                 Chiu-Wing Sham   JPEG-compatible Joint Image Compression
                                  and Encryption Algorithm with File Size
                                  Preservation . . . . . . . . . . . . . . 105:1--105:??
                Daizong Liu and   
                  Xiaoye Qu and   
              Jianfeng Dong and   
                   Pan Zhou and   
                 Zichuan Xu and   
               Haozhao Wang and   
                    Xing Di and   
                 Weining Lu and   
                       Yu Cheng   Transform-Equivariant Consistency
                                  Learning for Temporal Sentence Grounding 106:1--106:??
                   Yijie Hu and   
                   Bin Dong and   
               Kaizhu Huang and   
                   Lei Ding and   
                   Wei Wang and   
              Xiaowei Huang and   
                  Qiu-Feng Wang   Scene Text Recognition via Dual-path
                                  Network with Shape-driven Attention
                                  Alignment  . . . . . . . . . . . . . . . 107:1--107:??
             Rongjiao Liang and   
              Shichao Zhang and   
              Wenzhen Zhang and   
              Guixian Zhang and   
                    Jinyun Tang   Nonlocal Hybrid Network for Long-tailed
                                  Image Classification . . . . . . . . . . 108:1--108:??
                   Piao Shi and   
                     Min Hu and   
                Xuefeng Shi and   
                       Fuji Ren   Deep Modular Co-Attention Shifting
                                  Network for Multimodal Sentiment
                                  Analysis . . . . . . . . . . . . . . . . 109:1--109:??
                 Jing Zhang and   
                    Dan Guo and   
                   Xun Yang and   
                Peipei Song and   
                      Meng Wang   Visual-linguistic-stylistic Triple
                                  Reward for Cross-lingual Image
                                  Captioning . . . . . . . . . . . . . . . 110:1--110:??
               Zhaoyang Jia and   
                     Yan Lu and   
                    Houqiang Li   Exploring Neighbor Correspondence
                                  Matching for Multiple-hypotheses Video
                                  Frame Synthesis  . . . . . . . . . . . . 111:1--111:??
                 Sheng Zhou and   
                    Dan Guo and   
                   Xun Yang and   
              Jianfeng Dong and   
                      Meng Wang   Graph Pooling Inference Network for
                                  Text-based VQA . . . . . . . . . . . . . 112:1--112:??
                Hengtong Hu and   
                 Lingxi Xie and   
                 Xinyue Huo and   
               Richang Hong and   
                        Qi Tian   One-Bit Supervision for Image
                                  Classification: Problem, Solution, and
                                  Beyond . . . . . . . . . . . . . . . . . 113:1--113:??
                  Hang Yuan and   
                    Wei Gao and   
                   Siwei Ma and   
                    Yiqiang Yan   Divide-and-conquer-based RDO-free CU
                                  Partitioning for 8K Video Compression    114:1--114:??
                  Mingyu Li and   
                   Tao Zhou and   
                 Zhuo Huang and   
                  Jian Yang and   
                   Jie Yang and   
                      Chen Gong   Dynamic Weighted Adversarial Learning
                                  for Semi-Supervised Classification under
                                  Intersectional Class Mismatch  . . . . . 115:1--115:??
                  Hui Huang and   
                    Di Xiao and   
                      Jia Liang   Secure Low-complexity Compressive
                                  Sensing with Preconditioning Prior
                                  Regularization Reconstruction  . . . . . 116:1--116:??
             Nathan Clement and   
                Alan Schoen and   
         Arnold Boedihardjo and   
                 Andrew Jenkins   Synthetic Data and Hierarchical Object
                                  Detection in Overhead Imagery  . . . . . 117:1--117:??
                 Jiang Bian and   
                  Xuhong Li and   
                   Tao Wang and   
             Qingzhong Wang and   
                  Jun Huang and   
                   Chen Liu and   
                   Jun Zhao and   
                Feixiang Lu and   
                 Dejing Dou and   
                    Haoyi Xiong   P$^2$ANet: a Large-Scale Benchmark for
                                  Dense Action Detection from Table Tennis
                                  Match Broadcasting Videos  . . . . . . . 118:1--118:??
                 Jifan Yang and   
             Zhongyuan Wang and   
            Guangcheng Wang and   
               Baojin Huang and   
                Yuhong Yang and   
                     Weiping Tu   Auxiliary Information Guided
                                  Self-attention for Image Quality
                                  Assessment . . . . . . . . . . . . . . . 119:1--119:??
              Zhanzhou Feng and   
                 Jiaming Xu and   
                     Lei Ma and   
                 Shiliang Zhang   Efficient Video Transformers via
                                  Spatial-temporal Token Merging for
                                  Action Recognition . . . . . . . . . . . 120:1--120:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 20, Number 5, May, 2024

               Shupei Zhang and   
               Chenqiu Zhao and   
                      Anup Basu   Principal Component Approximation
                                  Network for Image Compression  . . . . . 121:1--121:??
               Tianyu Zhang and   
                Weiqing Min and   
                    Tao Liu and   
             Shuqiang Jiang and   
                       Yong Rui   Toward Egocentric Compositional Action
                                  Anticipation with Adaptive Semantic
                                  Debiasing  . . . . . . . . . . . . . . . 122:1--122:??
                     Yu Liu and   
                Mingbo Zhao and   
                 Zhao Zhang and   
                 Yuping Liu and   
                  Shuicheng Yan   Arbitrary Virtual Try-on Network:
                                  Characteristics Preservation and
                                  Tradeoff between Body and Clothing . . . 123:1--123:??
              Shih-Wei Yang and   
             Li-Hsiang Shen and   
             Hong-Han Shuai and   
                   Kai-Ten Feng   CMAF: Cross-Modal Augmentation via
                                  Fusion for Underwater Acoustic Image
                                  Recognition  . . . . . . . . . . . . . . 124:1--124:??
               Yazhou Zhang and   
                    Yang Yu and   
               Mengyao Wang and   
                  Min Huang and   
              M. Shamim Hossain   Self-Adaptive Representation Learning
                                  Model for Multi-Modal Sentiment and
                                  Sarcasm Joint Analysis . . . . . . . . . 125:1--125:??
                     Lei Qi and   
                  Peng Dong and   
                  Tan Xiong and   
                    Hui Xue and   
                       Xin Geng   DoubleAUG: Single-domain Generalized
                                  Object Detector in Urban via Color
                                  Perturbation and Dual-style Memory . . . 126:1--126:??
                    Dan Shi and   
                    Lei Zhu and   
                Jingjing Li and   
                Guohua Dong and   
                 Huaxiang Zhang   Incomplete Cross-Modal Retrieval with
                                  Deep Correlation Transfer  . . . . . . . 127:1--127:??
               Xianhua Zeng and   
                 Xinyu Wang and   
                      Yicai Xie   Multiple Pseudo-Siamese Network with
                                  Supervised Contrast Learning for Medical
                                  Multi-modal Retrieval  . . . . . . . . . 128:1--128:??
                   Sisi You and   
                 Hantao Yao and   
               Bing-Kun Bao and   
                  Changsheng Xu   Multi-object Tracking with
                                  Spatial-Temporal Tracklet Association    129:1--129:??
 Gülnaziye Bingöl and   
               Simone Porcu and   
          Alessandro Floris and   
                   Luigi Atzori   QoE Estimation of WebRTC-based
                                  Audio-visual Conversations from Facial
                                  and Speech Features  . . . . . . . . . . 130:1--130:??
                 Heqian Qiu and   
               Hongliang Li and   
                  Qingbo Wu and   
                Hengcan Shi and   
               Lanxiao Wang and   
                Fanman Meng and   
                     Linfeng Xu   Learning Offset Probability Distribution
                                  for Accurate Object Detection  . . . . . 131:1--131:??
          Alessandro Floris and   
               Simone Porcu and   
                   Luigi Atzori   Controlling Media Player with Hands: a
                                  Transformer Approach and a Quality of
                                  Experience Assessment  . . . . . . . . . 132:1--132:??
                  Jingyu Li and   
               Zhendong Mao and   
                     Hao Li and   
               Weidong Chen and   
                 Yongdong Zhang   Exploring Visual Relationships via
                                  Transformer-based Graphs for Enhanced
                                  Image Captioning . . . . . . . . . . . . 133:1--133:??
                    Zeyu Ma and   
                 Siwei Wang and   
                   Xiao Luo and   
                Zhonghui Gu and   
                 Chong Chen and   
                 Jinxing Li and   
             Xian-Sheng Hua and   
                   Guangming Lu   HARR: Learning Discriminative and
                                  High-Quality Hash Codes for Image
                                  Retrieval  . . . . . . . . . . . . . . . 134:1--134:??
            Chengyang Zhang and   
                 Yong Zhang and   
                      Bo Li and   
               Xinglin Piao and   
                     Baocai Yin   CrowdGraph: Weakly supervised Crowd
                                  Counting via Pure Graph Neural Network   135:1--135:??
                   Jie Wang and   
                Guoqiang Li and   
                    Jie Shi and   
                      Jinwen Xi   Weighted Guided Optional Fusion Network
                                  for RGB-T Salient Object Detection . . . 136:1--136:??
                 Yibo Zhang and   
                 Weiguo Lin and   
                     Junfeng Xu   Joint Audio-Visual Attention with
                                  Contrastive Learning for More General
                                  Deepfake Detection . . . . . . . . . . . 137:1--137:??
                 Depei Wang and   
                 Ruifeng Xu and   
             Lianglun Cheng and   
                   Zhuowei Wang   Knowledge-integrated Multi-modal Movie
                                  Turning Point Identification . . . . . . 138:1--138:??
                 Chunpu Liu and   
              Guanglei Yang and   
               Wangmeng Zuo and   
                    Tianyi Zang   DPDFormer: a Coarse-to-Fine Model for
                                  Monocular Depth Estimation . . . . . . . 139:1--139:??
                 Yunyao Yan and   
              Guoqing Xiang and   
                 Huizhu Jia and   
                   Jie Chen and   
             Xiaofeng Huang and   
                   Xiaodong Xie   Two-Stage Perceptual Quality Oriented
                                  Rate Control Algorithm for HEVC  . . . . 140:1--140:??
                  Zongyi Li and   
                 Yuxuan Shi and   
                 Hefei Ling and   
              Jiazhong Chen and   
                 Boyuan Liu and   
              Runsheng Wang and   
                  Chengxin Zhao   Viewpoint Disentangling and Generation
                                  for Unsupervised Object Re-ID  . . . . . 141:1--141:??
                   Kuai Dai and   
                   Xutao Li and   
                 Huiwei Lin and   
                  Yin Jiang and   
                Xunlai Chen and   
                 Yunming Ye and   
                        Di Xian   TinyPredNet: a Lightweight Framework for
                                  Satellite Image Sequence Prediction  . . 142:1--142:??
                 Yingnan Ma and   
               Chenqiu Zhao and   
              Bingran Huang and   
                  Xudong Li and   
                      Anup Basu   RAST: Restorable Arbitrary Style
                                  Transfer . . . . . . . . . . . . . . . . 143:1--143:??
                Wei-Yen Hsu and   
                  Hsien-Wen Lin   Context-detail-aware United Network for
                                  Single Image Deraining . . . . . . . . . 144:1--144:??
                    Yao Liu and   
               Gangfeng Cui and   
                 Jiahui Luo and   
              Xiaojun Chang and   
                       Lina Yao   Two-stream Multi-level Dynamic Point
                                  Transformer for Two-person Interaction
                                  Recognition  . . . . . . . . . . . . . . 145:1--145:??
              Chengxin Chen and   
                 Pengyuan Zhang   Modality-collaborative Transformer with
                                  Hybrid Feature Reconstruction for Robust
                                  Emotion Recognition  . . . . . . . . . . 146:1--146:??
              Jiafeng Huang and   
              Tianjun Zhang and   
              Shengjie Zhao and   
                  Lin Zhang and   
                    Yicong Zhou   An Underwater Organism Image Dataset and
                                  a Lightweight Module Designed for Object
                                  Detection Networks . . . . . . . . . . . 147:1--147:??
                   Jing Liu and   
                Litao Shang and   
                  Yuting Su and   
                 Weizhi Nie and   
                    Xin Wen and   
                       Anan Liu   Privacy-preserving Multi-source
                                  Cross-domain Recommendation Based on
                                  Knowledge Graph  . . . . . . . . . . . . 148:1--148:??
                 Xingyu Liu and   
               Zhongyun Hua and   
                  Shuang Yi and   
                Yushu Zhang and   
                    Yicong Zhou   Bi-directional Block Encoding for
                                  Reversible Data Hiding over Encrypted
                                  Images . . . . . . . . . . . . . . . . . 149:1--149:??
                    Peng Yi and   
             Zhongyuan Wang and   
                 Laigan Luo and   
                  Kui Jiang and   
                   Zheng He and   
               Junjun Jiang and   
                     Tao Lu and   
                       Jiayi Ma   Omniscient Video Super-Resolution with
                                  Explicit-Implicit Alignment  . . . . . . 150:1--150:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 20, Number 6, June, 2024

           Amit Kumar Singh and   
               Deepa Kundur and   
                    Mauro Conti   Introduction to the Special Issue on
                                  Integrity of Multimedia and Multimodal
                                  Data in Internet of Things . . . . . . . 151:1--151:??
               Wenyuan Yang and   
                Shaocong Wu and   
                Jianwei Fei and   
              Xianwang Zeng and   
                Yuemin Ding and   
                     Zhihua Xia   A Bitcoin-based Secure Outsourcing
                                  Scheme for Optimization Problem in
                                  Multimedia Internet of Things  . . . . . 152:1--152:??
                Qingzhi Liu and   
               Yuchen Huang and   
                Chenglu Jin and   
               Xiaohan Zhou and   
                   Ying Mao and   
              Cagatay Catal and   
                     Long Cheng   Privacy and Integrity Protection for IoT
                                  Multimodal Data Using Machine Learning
                                  and Blockchain . . . . . . . . . . . . . 153:1--153:??
               Simon Jonker and   
            Malthe Jelstrup and   
                Weizhi Meng and   
                   Brooke Lampe   Detecting Post Editing of Multimedia
                                  Images using Transfer Learning and Fine
                                  Tuning . . . . . . . . . . . . . . . . . 154:1--154:??
             Carmen Bisogni and   
              Lucia Cascone and   
              Michele Nappi and   
                    Chiara Pero   IoT-enabled Biometric Security:
                                  Enhancing Smart Car Safety with
                                  Depth-based Head Pose Estimation . . . . 155:1--155:??
              Saif E. Nouma and   
                Attila A. Yavuz   Trustworthy and Efficient Digital Twins
                                  in Post-Quantum Era with Hybrid
                                  Hardware-Assisted Signatures . . . . . . 156:1--156:??
                     Fan Li and   
              Yanxiang Chen and   
                Haiyang Liu and   
                Zuxing Zhao and   
                Yuanzhi Yao and   
                       Xin Liao   Vocoder Detection of Spoofing Speech
                                  Based on GAN Fingerprints and Domain
                                  Generalization . . . . . . . . . . . . . 157:1--157:??
                   Jing Gao and   
                    Peng Li and   
           Asif Ali Laghari and   
          Gautam Srivastava and   
     Thippa Reddy Gadekallu and   
                Sidra Abbas and   
                  Jianing Zhang   Incomplete Multiview Clustering via
                                  Semidiscrete Optimal Transport for
                                  Multimedia Data Mining in IoT  . . . . . 158:1--158:??
                 Zhenyu Liu and   
                      Da Li and   
                Xinyu Zhang and   
                Zhang Zhang and   
                 Peng Zhang and   
               Caifeng Shan and   
                    Jungong Han   Pedestrian Attribute Recognition via
                                  Spatio-temporal Relationship Learning
                                  for Visual Surveillance  . . . . . . . . 159:1--159:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 20, Number 7, July, 2024

      Roberto García and   
                 Ana Cediel and   
     Merc\`e Teixidó and   
                       Rosa Gil   Semantics and Non-fungible Tokens for
                                  Copyright Management on the Metaverse
                                  and Beyond . . . . . . . . . . . . . . . 186:1--186:??
                Tianxiu Xie and   
                   Keke Gai and   
               Liehuang Zhu and   
                  Shuo Wang and   
                   Zijian Zhang   RAC-Chain: an Asynchronous
                                  Consensus-based Cross-chain Approach to
                                  Scalable Blockchain for Metaverse  . . . 187:1--187:??
                Yongjun Ren and   
                 Zhiying Lv and   
              Neal N. Xiong and   
                       Jin Wang   HCNCT: a Cross-chain Interaction Scheme
                                  for the Blockchain-based Metaverse . . . 188:1--188:??
             Shuangmin Chen and   
                     Rui Xu and   
                    Jian Xu and   
                Shiqing Xin and   
                 Changhe Tu and   
              Chenglei Yang and   
                         Lin Lu   QuickCSGModeling: Quick CSG Operations
                                  Based on Fusing Signed Distance Fields
                                  for VR Modeling  . . . . . . . . . . . . 189:1--189:??
               Qinnan Zhang and   
                Zehui Xiong and   
               Jianming Zhu and   
                  Sheng Gao and   
                   Wanting Yang   A Privacy-preserving Auction Mechanism
                                  for Learning Model as an NFT in
                                  Blockchain-driven Metaverse  . . . . . . 190:1--190:??
                   Han Wang and   
                     Hui Li and   
                 Abla Smahi and   
                  Feng Zhao and   
                    Yao Yao and   
           Ching Chuen Chan and   
                 Shiyu Wang and   
               Wenyuan Yang and   
             Shuo-Yen Robert Li   MIS: a Multi-Identifier Management and
                                  Resolution System in the Metaverse . . . 191:1--191:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 20, Number 8, August, 2024

               Jinliang Liu and   
              Zhedong Zheng and   
               Zongxin Yang and   
                        Yi Yang   High Fidelity Makeup via $2$D and $3$D
                                  Identity Preservation Net  . . . . . . . 230:1--230:??
              Junjian Huang and   
                    Hao Ren and   
                 Shulin Liu and   
                   Yong Liu and   
                 Chuanlu Lv and   
                  Jiawen Lu and   
              Changyong Xie and   
                        Hong Lu   Real-Time Attentive Dilated $U$-Net for
                                  Extremely Dark Image Enhancement . . . . 231:1--231:??
               Mingfu Xiong and   
                 Kaikang Hu and   
                 Zhihan Lyu and   
                   Fei Fang and   
             Zhongyuan Wang and   
                  Ruimin Hu and   
                  Khan Muhammad   Inter-camera Identity Discrimination for
                                  Unsupervised Person Re-identification    232:1--232:??
                   Jiaqi Yu and   
                Jinhai Yang and   
                   Hua Yang and   
                 Renjie Pan and   
                Pingrui Lai and   
                  Guangtao Zhai   Psychology-Guided Environment Aware
                                  Network for Discovering Social
                                  Interaction Groups from Videos . . . . . 233:1--233:??
                     Qi Liu and   
                Xinchen Liu and   
                    Kun Liu and   
                 Xiaoyan Gu and   
                         Wu Liu   SigFormer: Sparse Signal-guided
                                  Transformer for Multi-modal Action
                                  Segmentation . . . . . . . . . . . . . . 234:1--234:??
                    Jun Lyu and   
                Shouang Yan and   
              M. Shamim Hossain   DBGAN: Dual Branch Generative
                                  Adversarial Network for Multi-Modal MRI
                                  Translation  . . . . . . . . . . . . . . 235:1--235:??
                Dejun Zhang and   
                 Mian Zhang and   
                Xuefeng Tan and   
                        Jun Liu   Bridging the Domain Gap in Scene Flow
                                  Estimation via Hierarchical Smoothness
                                  Refinement . . . . . . . . . . . . . . . 236:1--236:??
                  Ning Chen and   
              Zhipeng Cheng and   
                  Xuwei Fan and   
                  Zhang Liu and   
             Bangzhen Huang and   
                Yifeng Zhao and   
              Lianfen Huang and   
               Xiaojiang Du and   
                 Mohsen Guizani   Integrated Sensing, Communication, and
                                  Computing for Cost-effective Multimodal
                                  Federated Perception . . . . . . . . . . 237:1--237:??
                 Jiayu Yang and   
               Chunhui Yang and   
                  Fei Xiong and   
                Yongqi Zhai and   
                  Ronggang Wang   Learned Video Compression with Adaptive
                                  Temporal Prior and Decoded Motion-aided
                                  Quality Enhancement  . . . . . . . . . . 238:1--238:??
                Xiaoling Gu and   
                 Junkai Zhu and   
              Yongkang Wong and   
                  Zizhao Wu and   
                     Jun Yu and   
               Jianping Fan and   
              Mohan Kankanhalli   Recurrent Appearance Flow for
                                  Occlusion-Free Virtual Try-On  . . . . . 239:1--239:??
                Yuanjie Lyu and   
               Penggang Qin and   
                    Tong Xu and   
                   Chen Zhu and   
                    Enhong Chen   InteractNet: Social Interaction
                                  Recognition for Semantic-rich Videos . . 240:1--240:??
      Mrinmoy Bhattacharjee and   
    Prasanna Mahadeva S. R. and   
                Prithwijit Guha   Exploration of Speech and Music
                                  Information for Movie Genre
                                  Classification . . . . . . . . . . . . . 241:1--241:??
                 Sara Sarto and   
            Marcella Cornia and   
            Lorenzo Baraldi and   
        Alessandro Nicolosi and   
                 Rita Cucchiara   Towards Retrieval-Augmented
                                  Architectures for Image Captioning . . . 242:1--242:??
                Kaihui Yang and   
                 Junwei Han and   
                Guangyu Guo and   
               Chaowei Fang and   
                 Yingzi Fan and   
               Lechao Cheng and   
                  Dingwen Zhang   Progressive Adapting and Pruning:
                                  Domain-Incremental Learning for Saliency
                                  Prediction . . . . . . . . . . . . . . . 243:1--243:??
                    Lv Tang and   
                  Xinfeng Zhang   High Efficiency Deep-learning Based
                                  Video Compression  . . . . . . . . . . . 244:1--244:??
    Pedro de Medeiros Gomes and   
               Silvia Rossi and   
                     Laura Toni   AGAR --- Attention Graph-RNN for
                                  Adaptative Motion Prediction of Point
                                  Clouds of Deformable Objects . . . . . . 245:1--245:??
                   Jiabo Ye and   
               Junfeng Tian and   
                   Ming Yan and   
                 Haiyang Xu and   
                 Qinghao Ye and   
                   Yaya Shi and   
              Xiaoshan Yang and   
                  Xuwu Wang and   
                   Ji Zhang and   
                   Liang He and   
                        Xin Lin   UniQRNet: Unifying Referring Expression
                                  Grounding and Segmentation with QRNet    246:1--246:??
                   Wei Zhou and   
                    Qi Yang and   
                    Wu Chen and   
              Qiuping Jiang and   
              Guangtao Zhai and   
                      Weisi Lin   Blind Quality Assessment of Dense $3$D
                                  Point Clouds with Structure Guided
                                  Resampling . . . . . . . . . . . . . . . 247:1--247:??
                  Yuli Zhao and   
                  Yin Zhang and   
          Francis C. M. Lau and   
                     Hai Yu and   
               Zhiliang Zhu and   
                      Bin Zhang   Expanding-Window Zigzag Decodable
                                  Fountain Codes for Scalable Multimedia
                                  Transmission . . . . . . . . . . . . . . 248:1--248:??
                 Xuanyu Jin and   
                      Ni Li and   
               Wanzeng Kong and   
                Jiajia Tang and   
                      Bing Yang   Unbiased Semantic Representation
                                  Learning Based on Causal Disentanglement
                                  for Domain Generalization  . . . . . . . 249:1--249:??
                    Bo Peng and   
                    Lin Sun and   
                Jianjun Lei and   
              Bingzheng Liu and   
               Haifeng Shen and   
                 Wanqing Li and   
                 Qingming Huang   Self-Supervised Monocular Depth
                                  Estimation via Binocular Geometric
                                  Correlation Learning . . . . . . . . . . 250:1--250:??
                  Yang Yang and   
              Shuailong Qiu and   
               Lanling Zeng and   
                    Zhigeng Pan   Detail-preserving Joint Image Upsampling 251:1--251:??
                  Xiao Kang and   
                 Xingbo Liu and   
                    Wen Xue and   
                Xiushan Nie and   
                     Yilong Yin   Online Cross-modal Hashing With Dynamic
                                  Prototype  . . . . . . . . . . . . . . . 252:1--252:??
                Yuqing Yang and   
            Boris Joukovsky and   
José Oramas Mogrovejo and   
           Tinne Tuytelaars and   
              Nikos Deligiannis   SNIPPET: a Framework for Subjective
                                  Evaluation of Visual Explanations
                                  Applied to DeepFake Detection  . . . . . 253:1--253:??
                Jinwang Pan and   
               Xianming Liu and   
               Yuanchao Bai and   
                Deming Zhai and   
               Junjun Jiang and   
                     Debin Zhao   Illumination-Aware Low-Light Image
                                  Enhancement with Transformer and
                                  Auto-Knee Curve  . . . . . . . . . . . . 254:1--254:??
        Lohic Fotio Tiotsop and   
           Antonio Servetti and   
                Peter Pocta and   
       Glenn Van Wallendael and   
           Marcus Barkowsky and   
                  Enrico Masala   Multiple Image Distortion DNN Modeling
                                  Individual Subject Quality Assessment    255:1--255:??
                  Yunhui Xu and   
                   Youru Li and   
                   Muhao Xu and   
               Zhenfeng Zhu and   
                       Yao Zhao   HKA: a Hierarchical Knowledge Alignment
                                  Framework for Multimodal Knowledge Graph
                                  Completion . . . . . . . . . . . . . . . 256:1--256:??
                    Li Zhou and   
                 Zhenyu Liu and   
                  Yutong Li and   
                 Yuchi Duan and   
                  Huimin Yu and   
                         Bin Hu   Multi Fine-Grained Fusion Network for
                                  Depression Detection . . . . . . . . . . 257:1--257:??
                 Chenlei Lv and   
                  Dan Zhang and   
             Shengling Geng and   
                 Zhongke Wu and   
                      Hui Huang   Color Transfer for Images: a Survey  . . 258:1--258:??
               Zhihao Zhang and   
                   Jun Wang and   
                Shengjie Li and   
                    Lei Jin and   
                     Hao Wu and   
                  Jian Zhao and   
                       Bo Zhang   Review and Analysis of RGBT Single
                                  Object Tracking Methods: a Fusion
                                  Perspective  . . . . . . . . . . . . . . 259:1--259:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 20, Number 9, September, 2024

             Yuantong Zhang and   
                Daiqin Yang and   
             Zhenzhong Chen and   
                   Wenpeng Ding   Continuous Space-Time Video
                                  Super-Resolution with Multi-Stage Motion
                                  Information Reorganization . . . . . . . 273:1--273:??
                Caijuan Shi and   
              Yuanfan Zheng and   
                      Zhen Chen   Domain Adaptive Thermal Object Detection
                                  with Unbiased Granularity Alignment  . . 274:1--274:??
                   Ziyi Liu and   
                   You Yang and   
                   Kejun Wu and   
                  Qiong Liu and   
                 Xinghua Xu and   
                Xiaoxuan Ma and   
                     Jiang Tang   ASIFusion: an Adaptive Saliency
                                  Injection-Based Infrared and Visible
                                  Image Fusion Network . . . . . . . . . . 275:1--275:??
                      Xu Wu and   
                 Zhihui Lai and   
                   Jie Zhou and   
                 Xianxu Hou and   
             Witold Pedrycz and   
                    Linlin Shen   Light-Aware Contrastive Learning for
                                  Low-Light Image Enhancement  . . . . . . 276:1--276:??
             Yoanes Bandung and   
   Mokhamad Arfan Wicaksono and   
               Sean Pribadi and   
         Armein Z. R. Langi and   
                   Dion Tanjung   IoT Video Delivery Optimization through
                                  Machine Learning-Based Frame Resolution
                                  Adjustment . . . . . . . . . . . . . . . 277:1--277:??
                 Jingwei Ma and   
              Kangkang Bian and   
                    Yang Xu and   
                        Lei Zhu   ANAGL: a Noise-Resistant and Anti-Sparse
                                  Graph Learning for Micro-Video
                                  Recommendation . . . . . . . . . . . . . 278:1--278:??
                Wuyang Chen and   
                 Boqing Zhu and   
                    Kele Xu and   
                   Yong Dou and   
                     Dawei Feng   VoiceStyle: Voice-Based Face Generation
                                  via Cross-Modal Prototype Contrastive
                                  Learning . . . . . . . . . . . . . . . . 279:1--279:??
                   Chen Cai and   
                Kim-Hui Yap and   
                    Suchen Wang   Toward Attribute-Controlled Fashion
                                  Image Captioning . . . . . . . . . . . . 280:1--280:??
                     Kai Lv and   
                 Haobo Chen and   
               Chuyang Zhao and   
                     Kai Tu and   
                 Junru Chen and   
                  Yadong Li and   
                   Boxun Li and   
                    Youfang Lin   Style Variable and Irrelevant Learning
                                  for Generalizable Person
                                  Re-identification  . . . . . . . . . . . 281:1--281:??
                 Mengran Li and   
              Ronghui Zhang and   
                 Yong Zhang and   
               Xinglin Piao and   
                 Shiyu Zhao and   
                     Baocai Yin   SCAE: Structural Contrastive
                                  Auto-Encoder for Incomplete Multi-View
                                  Representation Learning  . . . . . . . . 282:1--282:??
              Hanzhang Wang and   
                Deming Zhai and   
                 Xiong Zhou and   
               Junjun Jiang and   
                   Xianming Liu   Mix-DDPM: Enhancing Diffusion Models
                                  through Fitting Mixture Noise with
                                  Global Stochastic Offset . . . . . . . . 283:1--283:??
                Wenxuan Hou and   
                Guangyao Li and   
                Yapeng Tian and   
                          Di Hu   Toward Long Form Audio-Visual Video
                                  Understanding  . . . . . . . . . . . . . 284:1--284:??
                 Encheng Yu and   
                Jianer Zhou and   
                  Zhenyu Li and   
               Gareth Tyson and   
                 Weichao Li and   
                Xinyi Zhang and   
                  Zhiwei Xu and   
                    Gaogang Xie   Mustang: Improving QoE for Real-Time
                                  Video in Cellular Networks by Masking
                                  Jitter . . . . . . . . . . . . . . . . . 285:1--285:??
                     Yan Li and   
              Xiangyuan Lan and   
               Haifeng Chen and   
                      Ke Lu and   
                  Dongmei Jiang   Multimodal PEAR Chain-of-Thought
                                  Reasoning for Multimodal Sentiment
                                  Analysis . . . . . . . . . . . . . . . . 286:1--286:??
               Zechen Liang and   
              Yuan-Gen Wang and   
                     Wei Lu and   
                   Xiaochun Cao   Boosting Semi-Supervised Learning with
                                  Dual-Threshold Screening and Similarity
                                  Learning . . . . . . . . . . . . . . . . 287:1--287:??
                  Chen Chen and   
                Lingfeng Qu and   
              Hadi Amirpour and   
               Xingjun Wang and   
         Christian Timmerer and   
                   Zhihong Tian   On the Security of Selectively Encrypted
                                  HEVC Video Bitstreams  . . . . . . . . . 288:1--288:??
                    Tai Qin and   
                      Ge Li and   
                    Wei Gao and   
                       Shan Liu   Multi-Grained Point Cloud Geometry
                                  Compression via Dual-Model Prediction
                                  with Extended Octree . . . . . . . . . . 289:1--289:??
               Jiehua Zhang and   
                   Liang Li and   
              Chenggang Yan and   
                  Zhan Wang and   
              Changliang Xu and   
               Jiyong Zhang and   
                   Chuqiao Chen   Learning Domain Invariant Features for
                                  Unsupervised Indoor Depth Estimation
                                  Adaptation . . . . . . . . . . . . . . . 290:1--290:??
                  Yiling Xu and   
                Yujie Zhang and   
                    Qi Yang and   
               Xiaozhong Xu and   
                       Shan Liu   Compressed Point Cloud Quality Index by
                                  Combining Global Appearance and Local
                                  Details  . . . . . . . . . . . . . . . . 291:1--291:??
                 Zhilei Liu and   
               Xiaoxing Liu and   
                   Sen Chen and   
                Jiaxing Liu and   
              Longbiao Wang and   
                     Chongke Bi   Multimodal Fusion for Talking Face
                                  Generation Utilizing Speech-Related
                                  Facial Action Units  . . . . . . . . . . 292:1--292:??
                  Zizhao Wu and   
                   Siyu Liu and   
                 Peioyan Lu and   
                  Ping Yang and   
              Yongkang Wong and   
                Xiaoling Gu and   
           Mohan S. Kankanhalli   KF-VTON: Keypoints-Driven Flow Based
                                  Virtual Try-On Network . . . . . . . . . 293:1--293:??
                Linhai Zhuo and   
                  Yuqian Fu and   
              Jingjing Chen and   
                  Yixin Cao and   
                  Yu-Gang Jiang   Unified View Empirical Study for Large
                                  Pretrained Model on Cross-Domain
                                  Few-Shot Learning  . . . . . . . . . . . 294:1--294:??
                 Ruifan Zuo and   
              Chaoqun Zheng and   
                Fengling Li and   
                    Lei Zhu and   
                    Zheng Zhang   Privacy-Enhanced Prototype-Based
                                  Federated Cross-Modal Hashing for
                                  Cross-Modal Retrieval  . . . . . . . . . 295:1--295:??
                   Xue Song and   
              Jingjing Chen and   
                    Bin Zhu and   
                  Yu-Gang Jiang   Text-Driven Video Prediction . . . . . . 296:1--296:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 20, Number 10, October, 2024

            Walayat Hussain and   
                Honghao Gao and   
               Rafiul Karim and   
         Abdulmotaleb El Saddik   Seventeen Years of the \booktitleACM
                                  Transactions on Multimedia Computing,
                                  Communications and Applications: a
                                  Bibliometric Overview  . . . . . . . . . 297:1--297:??
                 Bowen Yuan and   
                  Jiahao Lu and   
                   Sisi You and   
                   Bing-Kun Bao   Unbiased Feature Learning with Causal
                                  Intervention for Visible-Infrared Person
                                  Re-Identification  . . . . . . . . . . . 298:1--298:??
                Sixian Chan and   
              Xianpeng Zeng and   
                Xinhua Wang and   
                     Jie Hu and   
                       Cong Bai   Auxiliary Feature Fusion and Noise
                                  Suppression for HOI Detection  . . . . . 299:1--299:??
                   Yefan Li and   
                Fuqing Duan and   
                          Ke Lu   Gated Multi-Modal Edge Refinement
                                  Network for Light Field Salient Object
                                  Detection  . . . . . . . . . . . . . . . 300:1--300:??
                 Dongze Hao and   
                 Qunbo Wang and   
                 Xinxin Zhu and   
                       Jing Liu   HCCL: Hierarchical Counterfactual
                                  Contrastive Learning for Robust Visual
                                  Question Answering . . . . . . . . . . . 301:1--301:??
                    Jun Jia and   
               Zhongpai Gao and   
                 Yiwei Yang and   
                    Wei Sun and   
                 Dandan Zhu and   
               Xiaohong Liu and   
               Xiongkuo Min and   
                  Guangtao Zhai   Hidden Barcode in Sub-Images with
                                  Invisible Locating Marker  . . . . . . . 302:1--302:??
                  Junxin Lu and   
                Yongbin Gao and   
                 Jieyu Chen and   
            Jeng-Neng Hwang and   
              Hamido Fujita and   
                    Zhijun Fang   Monocular Depth and Ego-motion
                                  Estimation with Scale Based on
                                  Superpixel and Normal Constraints  . . . 303:1--303:??
              Zhenjiang Guo and   
                 Xiaohai He and   
                    Yu Yang and   
                 Linbo Qing and   
                  Honggang Chen   DAG-YOLO: a Context-Feature Adaptive
                                  fusion Rotating Detection Network in
                                  Remote Sensing Images  . . . . . . . . . 304:1--304:??
                  Yong Zhou and   
                 Zeming Xie and   
                 Jiaqi Zhao and   
                Wenliang Du and   
                    Rui Yao and   
         Abdulmotaleb El Saddik   Multi-Modal LiDAR Point Cloud Semantic
                                  Segmentation with Salience Refinement
                                  and Boundary Perception  . . . . . . . . 305:1--305:??
              Yuanyuan Wang and   
                   Meng Liu and   
               Xuemeng Song and   
                    Liqiang Nie   Harnessing Representative
                                  Spatial-Temporal Information for Video
                                  Question Answering . . . . . . . . . . . 306:1--306:??
               Guibiao Liao and   
                        Wei Gao   Rethinking Feature Mining for Light
                                  Field Salient Object Detection . . . . . 307:1--307:??
                 Chao Liang and   
                Linchao Zhu and   
               Zongxin Yang and   
                   Wei Chen and   
                        Yi Yang   Noise-Tolerant Hybrid Prototypical
                                  Learning with Noisy Web Data . . . . . . 308:1--308:??
                 Yitao Peng and   
                Lianghua He and   
                     Die Hu and   
                 Yihang Liu and   
              Longzhen Yang and   
                  Shaohua Shang   Decoupling Deep Learning for Enhanced
                                  Image Recognition Interpretability . . . 309:1--309:??
                  Baoli Sun and   
                 Yanjun Guo and   
               Tiantian Yan and   
                 Xinchen Ye and   
                Zhihui Wang and   
                  Haojie Li and   
                   Zhiyong Wang   Digging into Depth and Color Spaces: a
                                  Mapping Constraint Network for Depth
                                  Super-Resolution . . . . . . . . . . . . 310:1--310:??
            Michael Seufert and   
       Marius Spangenberger and   
      Fabian Poignée and   
             Florian Wamser and   
             Werner Robitza and   
         Christian Timmerer and   
           Tobias Hoßfeld   COBIRAS: Offering a Continuous Bit Rate
                                  Slide to Maximize DASH Streaming
                                  Bandwidth Utilization  . . . . . . . . . 311:1--311:??
             Zhangyong Tang and   
                Tianyang Xu and   
                Xiao-Jun Wu and   
                  Josef Kittler   Multi-Level Fusion for Robust RGBT
                                  Tracking via Enhanced Thermal
                                  Representation . . . . . . . . . . . . . 312:1--312:??
                  Hanyue Tu and   
                      Li Li and   
               Wengang Zhou and   
                    Houqiang Li   Reconstruction-Free Image Compression
                                  for Machine Vision via Knowledge
                                  Transfer . . . . . . . . . . . . . . . . 313:1--313:??
                  Gai Zhang and   
              Xinfeng Zhang and   
                        Lv Tang   Unified and Scalable Deep Image
                                  Compression Framework for Human and
                                  Machine  . . . . . . . . . . . . . . . . 314:1--314:??
                Fengyong Li and   
                Huajun Zhai and   
                   Teng Liu and   
              Xinpeng Zhang and   
                      Chuan Qin   Learning Compressed Artifact for JPEG
                                  Manipulation Localization Using
                                  Wide-Receptive-Field Network . . . . . . 315:1--315:??
                Shukang Yin and   
                 Sirui Zhao and   
                   Hao Wang and   
                    Tong Xu and   
                    Enhong Chen   Exploiting Instance-level Relationships
                                  in Weakly Supervised Text-to-Video
                                  Retrieval  . . . . . . . . . . . . . . . 316:1--316:??
          Kayhan Latifzadeh and   
             Nima Gozalpour and   
           V. Javier Traver and   
            Tuukka Ruotsalo and   
 Aleksandra Kawala-Sterniuk and   
                  Luis A. Leiva   Efficient Decoding of Affective States
                                  from Video-elicited EEG Signals: an
                                  Empirical Investigation  . . . . . . . . 317:1--317:??
                   Ziyue Wu and   
                  Junyu Gao and   
             Shucheng Huang and   
                  Changsheng Xu   Learning Commonsense-aware Moment-Text
                                  Alignment for Fast Video Temporal
                                  Grounding  . . . . . . . . . . . . . . . 318:1--318:??
            Daniele Lorenzi and   
          Farzad Tashtarian and   
         Hermann Hellwagner and   
             Christian Timmerer   MEDUSA: a Dynamic Codec Switching
                                  Approach in HTTP Adaptive Streaming  . . 319:1--319:??
                  Ruoyan Pi and   
                    Peng Wu and   
               Xiangteng He and   
                     Yuxin Peng   EOGT: Video Anomaly Detection with
                                  Enhanced Object Information and Global
                                  Temporal Dependency  . . . . . . . . . . 320:1--320:??
               Shengbin Yue and   
                  Yunbin Tu and   
                   Liang Li and   
             Shengxiang Gao and   
                    Zhengtao Yu   Multi-Grained Representation Aggregating
                                  Transformer with Gating Cycle for Change
                                  Captioning . . . . . . . . . . . . . . . 321:1--321:??
                Jingjing Wu and   
                    Xi Zhou and   
                Xiaohong Li and   
                    Hao Liu and   
                  Meibin Qi and   
                   Richang Hong   Asymmetric Deformable Spatio-temporal
                                  Framework for Infrared Object Tracking   322:1--322:??
                  Zhenyu Li and   
               Shanshan Gao and   
                 Deqian Mao and   
               Shouwen Song and   
                     Lei Li and   
                  Yuanfeng Zhou   Deep Plug-and-Play Non-Iterative Cluster
                                  for $3$D Global Feature Extraction . . . 323:1--323:??
                 Mingfu Xue and   
                 Yinghao Wu and   
               Leo Yu Zhang and   
                  Dujuan Gu and   
                Yushu Zhang and   
                   Weiqiang Liu   SSAT: Active Authorization Control and
                                  User's Fingerprint Tracking Framework
                                  for DNN IP Protection  . . . . . . . . . 324:1--324:??
                Yongkang Li and   
                Qifan Liang and   
                   Zhen Han and   
                 Wenjun Mai and   
                 Zhongyuan Wang   Few-Shot Face Sketch-to-Photo Synthesis
                                  via Global-Local Asymmetric
                                  Image-to-Image Translation . . . . . . . 325:1--325:??
                Shuqin Chen and   
                 Xian Zhong and   
                   Yi Zhang and   
                    Lei Zhu and   
                    Ping Li and   
              Xiaokang Yang and   
                      Bin Sheng   Action-aware Linguistic Skeleton
                                  Optimization Network for
                                  Non-autoregressive Video Captioning  . . 326:1--326:??
                Yancun Yang and   
                Weiqing Min and   
                Jingru Song and   
               Guorui Sheng and   
                  Lili Wang and   
                 Shuqiang Jiang   Lightweight Food Recognition via
                                  Aggregation Block and Feature Encoding   327:1--327:??
                Huaijin Liu and   
                 Jixiang Du and   
                 Yong Zhang and   
               Hongbo Zhang and   
                  Jiandian Zeng   MSSA: Multi-Representation
                                  Semantics-Augmented Set Abstraction for
                                  $3$D Object Detection  . . . . . . . . . 328:1--328:??
        Vinicius Sato Kawai and   
       Lucas Pascotti Valem and   
        Alexandro Baldassin and   
                Edson Borin and   
Daniel Carlos Guimarães Pedronette and   
             Longin Jan Latecki   Rank-based Hashing for Effective and
                                  Efficient Nearest Neighbor Search for
                                  Image Retrieval  . . . . . . . . . . . . 329:1--329:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 20, Number 11, November, 2024

                Ritesh Vyas and   
              Michele Nappi and   
          Alberto Del Bimbo and   
                  Sambit Bakshi   Introduction to Special Issue on
                                  ``Recent Trends in Multimedia
                                  Forensics''  . . . . . . . . . . . . . . 330:1--330:??
          Vincenzo Carletti and   
            Pasquale Foggia and   
              Antonio Greco and   
            Alessia Saggese and   
                    Mario Vento   Facial Soft-biometrics Obfuscation
                                  through Adversarial Attacks  . . . . . . 331:1--331:??
                Hanrui Wang and   
                  Shuo Wang and   
               Cunjian Chen and   
         Massimo Tistarelli and   
                        Zhe Jin   A Multi-Task Adversarial Attack against
                                  Face Authentication  . . . . . . . . . . 332:1--332:??
                    Tian Wu and   
                 Rongbo Zhu and   
                    Shaohua Wan   Semantic Map Guided Identity Transfer
                                  GAN for Person Re-identification . . . . 333:1--333:??
                D. K. Mahto and   
                A. K. Singh and   
                K. N. Singh and   
                O. P. Singh and   
                  A. K. Agrawal   Robust Copyright Protection Technique
                                  with High-embedding Capacity for Color
                                  Images . . . . . . . . . . . . . . . . . 334:1--334:??
               Shitharth S. and   
      Hariprasath Manoharan and   
           Alaa O. Khadidos and   
             Achyut Shankar and   
              Carsten Maple and   
           Adil O. Khadidos and   
                  Shahid Mumtaz   Improved Security for Multimedia Data
                                  Visualization using Hierarchical
                                  Clustering Algorithm . . . . . . . . . . 335:1--335:??
               Youqiang Sun and   
                 Jianyi Liu and   
                       Ru Zhang   Generative Image Steganography Based on
                                  Guidance Feature Distribution  . . . . . 336:1--336:??
            Paarth Neekhara and   
           Shehzeen Hussain and   
              Xinqiao Zhang and   
                   Ke Huang and   
             Julian McAuley and   
             Farinaz Koushanfar   FaceSigns: Semi-fragile Watermarks for
                                  Media Authentication . . . . . . . . . . 337:1--337:??
                  Jing Zhao and   
               Hongwei Yang and   
                     Hui He and   
                   Jie Peng and   
               Weizhe Zhang and   
                Jiangqun Ni and   
        Arun Kumar Sangaiah and   
            Aniello Castiglione   Backdoor Two-Stream Video Models on
                                  Federated Learning . . . . . . . . . . . 338:1--338:??
             Farkhund Iqbal and   
               Ahmed Abbasi and   
         Abdul Rehman Javed and   
             Ahmad Almadhor and   
               Zunera Jalil and   
                Sajid Anwar and   
                      Imad Rida   Data Augmentation-based Novel Deep
                                  Learning Method for Deepfaked Images
                                  Detection  . . . . . . . . . . . . . . . 339:1--339:??
                 Kaihan Lin and   
                Weihong Han and   
                 Shudong Li and   
                Zhaoquan Gu and   
                Huimin Zhao and   
                   Yangyang Mei   Detecting Deepfake Videos using
                                  Spatiotemporal Trident Network . . . . . 340:1--340:??
                Ijaz Ul Haq and   
       Khalid Mahmood Malik and   
                  Khan Muhammad   Multimodal Neurosymbolic Approach for
                                  Explainable Deepfake Detection . . . . . 341:1--341:??
         Federico Becattini and   
             Carmen Bisogni and   
              Vincenzo Loia and   
                Chiara Pero and   
                        Fei Hao   Head Pose Estimation Patterns as
                                  Deepfake Detectors . . . . . . . . . . . 342:1--342:??
              Luca Guarnera and   
             Oliver Giudice and   
            Sebastiano Battiato   Mastering Deepfake Detection: a
                                  Cutting-edge Approach to Distinguish GAN
                                  and Diffusion-model Images . . . . . . . 343:1--343:??
    Aakash Varma Nadimpalli and   
                  Ajita Rattani   ProActive DeepFake Detection using
                                  GAN-based Visible Watermarking . . . . . 344:1--344:??
              Bachir Kaddar and   
            Sid Ahmed Fezza and   
               Zahid Akhtar and   
          Wassim Hamidouche and   
             Abdenour Hadid and   
     Joan Serra-Sagristá   Deepfake Detection Using Spatiotemporal
                                  Transformer  . . . . . . . . . . . . . . 345:1--345:??
                 Shuai Xiao and   
                 Zhuo Zhang and   
               Jiachen Yang and   
                 Jiabao Wen and   
                        Yang Li   Forgery Detection by Weighted
                                  Complementarity between Significant
                                  Invariance and Detail Enhancement  . . . 346:1--346:??
              Paola Capasso and   
          Giuseppe Cattaneo and   
               Maria De Marsico   A Comprehensive Survey on Methods for
                                  Image Integrity  . . . . . . . . . . . . 347:1--347:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 20, Number 12, December, 2024

               Hongbin Wang and   
                   Rui Tang and   
                         Fan Li   Hypercube Pooling for Visual Semantic
                                  Embedding  . . . . . . . . . . . . . . . 363:1--363:??
                   Fei Wang and   
                 Liang Ding and   
                    Jun Rao and   
                     Ye Liu and   
                    Li Shen and   
                 Changxing Ding   Can Linguistic Knowledge Improve
                                  Multimodal Alignment in Vision-Language
                                  Pretraining? . . . . . . . . . . . . . . 364:1--364:??
                 Caixia Liu and   
                  Yali Chen and   
                Minhong Zhu and   
                Chenhui Hao and   
                Haisheng Li and   
                 Xiaochuan Wang   DEGAN: Detail-Enhanced Generative
                                  Adversarial Network for Monocular
                                  Depth-Based $3$D Reconstruction  . . . . 365:1--365:??
                   Dan Song and   
                Shumeng Huo and   
                  Xinwei Fu and   
              Chumeng Zhang and   
                  Wenhui Li and   
                      An-An Liu   Cross-Modal Contrastive Learning with a
                                  Style-Mixed Bridge for Single Image $3$D
                                  Shape Retrieval  . . . . . . . . . . . . 366:1--366:??
               Ting-Lan Lin and   
                Bing-Wei Su and   
              Po-Cheng Shen and   
             Ding-Yuan Chen and   
               Chi-Fu Liang and   
             Yan-Cheng Chen and   
               Yangming Wen and   
                Mohammad Shahid   Upsampling Algorithm for V-PCC-Coded
                                  $3$D Point Clouds  . . . . . . . . . . . 367:1--367:??
               Yuanzhi Wang and   
                    Yong Li and   
               Xiaoya Zhang and   
                    Xin Liu and   
                   Anbo Dai and   
             Antoni B. Chan and   
                       Zhen Cui   Edit Temporal-Consistent Videos with
                                  Image Diffusion Model  . . . . . . . . . 368:1--368:??
               Luis Alvarez and   
    Agustín Trujillo and   
       Nelson Monzón and   
              Jean-Michel Morel   Generation and Editing of $2$D Shapes
                                  Using a Branched Representation  . . . . 369:1--369:??
                 Xingbo Liu and   
                  Jiamin Li and   
                Xiushan Nie and   
              Xuening Zhang and   
                     Yilong Yin   Fast Unsupervised Cross-Modal Hashing
                                  with Robust Factorization and Dual
                                  Projection . . . . . . . . . . . . . . . 370:1--370:??
             Yongheng Zhang and   
              Yuanqiang Cai and   
                Danfeng Yan and   
                   Rongheng Lin   Real-World Scene Image Enhancement with
                                  Contrastive Domain Adaptation Learning   371:1--371:??
               Chunqiang Yu and   
              Shichao Cheng and   
             Xianquan Zhang and   
              Xinpeng Zhang and   
                   Zhenjun Tang   Reversible Data Hiding in Shared JPEG
                                  Images . . . . . . . . . . . . . . . . . 372:1--372:??
                 Boqian Liu and   
                  Haojie Li and   
                Zhihui Wang and   
                    Tianfan Xue   Transparent Depth Completion Using
                                  Segmentation Features  . . . . . . . . . 373:1--373:??
               Yongtang Bao and   
                Chunjian Su and   
                  Yutong Qi and   
               Yanbing Geng and   
                      Haojie Li   Category-Level Pose Estimation and
                                  Iterative Refinement for Monocular RGB-D
                                  Image  . . . . . . . . . . . . . . . . . 374:1--374:??
                Kuiyuan Sun and   
               Xiaolong Liu and   
                Xiaolong Li and   
                   Yao Zhao and   
                       Wei Wang   Multi-Modal Driven Pose-Controllable
                                  Talking Head Generation  . . . . . . . . 375:1--375:??
                   Bing Liu and   
                   Jinfu Lu and   
               Mingming Liu and   
                    Hao Liu and   
                  Yong Zhou and   
                  Dongping Yang   Diverse Image Captioning via Panoptic
                                  Segmentation and Sequential Conditional
                                  Variational Transformer  . . . . . . . . 376:1--376:??
         Veronika Stephanie and   
             Ibrahim Khalil and   
           Mohammed Atiquzzaman   Weight-Based Privacy-Preserving
                                  Asynchronous SplitFed for Multimedia
                                  Healthcare Data  . . . . . . . . . . . . 377:1--377:??
                Chuanhao Li and   
              Chenchen Jing and   
                    Zhen Li and   
                   Yuwei Wu and   
                      Yunde Jia   Adversarial Sample Synthesis for Visual
                                  Question Answering . . . . . . . . . . . 378:1--378:??
                Shipeng Zhu and   
                   Jun Fang and   
               Pengfei Fang and   
                        Hui Xue   Improving Scene Text Retrieval via
                                  Stylized Middle Modality . . . . . . . . 379:1--379:??
                 Xiao Liang and   
                 Erkun Yang and   
                 Cheng Deng and   
                    Yanhua Yang   CrossFormer: Cross-Modal Representation
                                  Learning via Heterogeneous Graph
                                  Transformer  . . . . . . . . . . . . . . 380:1--380:??
                  Jiayu Lin and   
                  Yuan-Gen Wang   TSFormer: Tracking Structure Transformer
                                  for Image Inpainting . . . . . . . . . . 381:1--381:??
                  Yixuan Li and   
                Peilin Chen and   
                 Hanwei Zhu and   
                 Keyan Ding and   
                   Leida Li and   
                     Shiqi Wang   Deep Shape-Texture Statistics for
                                  Completely Blind Image Quality
                                  Evaluation . . . . . . . . . . . . . . . 382:1--382:??
                Zhenyu Zhou and   
                  Qing Liao and   
                    Lei Luo and   
                Xinwang Liu and   
                         En Zhu   ProtoRefine: Enhancing Prototypes with
                                  Similar Structure in Few-Shot Learning   383:1--383:??
                 Mengzhu Yu and   
               Zhenjun Tang and   
             Xiaoping Liang and   
             Xianquan Zhang and   
                  Zhixin Li and   
                  Xinpeng Zhang   Robust Hashing with Deep Features and
                                  Meixner Moments for Image Copy Detection 384:1--384:??
                 Jiabei Liu and   
             Weiming Zhuang and   
               Yuanyuan Liu and   
               Yonggang Wen and   
                  Jun Huang and   
                        Wei Lin   Personalized Federated Mutual Learning
                                  for Unsupervised Camera-Aware Person
                                  Re-Identification  . . . . . . . . . . . 385:1--385:??
                  Yiyang Ma and   
               Haowei Kuang and   
                  Huan Yang and   
                Jianlong Fu and   
                    Jiaying Liu   Prompt-Based Modality Bridging for
                                  Unified Text-to-Face Generation and
                                  Manipulation . . . . . . . . . . . . . . 386:1--386:??
                Peilin Chen and   
                 Shiqi Wang and   
                         Zhu Li   Occupancy Map Guided Attributes
                                  Artifacts Removal for Video-Based Point
                                  Cloud Compression  . . . . . . . . . . . 387:1--387:??
                  Yunda Sun and   
                  Lin Zhang and   
                 Zhong Wang and   
                  Yang Chen and   
              Shengjie Zhao and   
                    Yicong Zhou   I2P Registration by Learning the
                                  Underlying Alignment Feature Space from
                                  Pixel-to-Point Similarities  . . . . . . 388:1--388:??
               Daniel Gebre and   
                Siem Hadish and   
                Aron Sbhatu and   
            Moayad Aloqaily and   
                 Mohsen Guizani   Establishing Trust and Security in
                                  Decentralized Metaverse: a Web 3.0
                                  Approach . . . . . . . . . . . . . . . . 389:1--389:??
                Yangjun Mao and   
                   Jun Xiao and   
                 Dong Zhang and   
                   Meng Cao and   
                  Jian Shao and   
             Yueting Zhuang and   
                      Long Chen   Improving Reference-Based Distinctive
                                  Image Captioning with Contrastive
                                  Rewards  . . . . . . . . . . . . . . . . 390:1--390:??
                Shenglan Li and   
                    Rui Yao and   
                  Yong Zhou and   
               Hancheng Zhu and   
                 Jiaqi Zhao and   
                Zhiwen Shao and   
         Abdulmotaleb El Saddik   Motion-Aware Self-Supervised RGBT
                                  Tracking with Multi-Modality
                                  Hierarchical Transformers  . . . . . . . 391:1--391:??
                   Jun Ling and   
                    Han Xue and   
                  Anni Tang and   
                   Rong Xie and   
                        Li Song   ViCoFace: Learning Disentangled Latent
                                  Motion Representations for
                                  Visual-Consistent Face Reenactment . . . 392:1--392:??
                 Jiachen Li and   
                   Qing Xie and   
              Xiaojun Chang and   
                   Jinyu Xu and   
                   Yongjian Liu   Mutually-Guided Hierarchical Multi-Modal
                                  Feature Learning for Referring Image
                                  Segmentation . . . . . . . . . . . . . . 393:1--393:??
            Fatima Alshehri and   
                Ghulam Muhammad   Ischemic Stroke Segmentation by
                                  Transformer and Convolutional Neural
                                  Network Using Few-Shot Learning  . . . . 394:1--394:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 21, Number 1, January, 2025

             Bogdan Ionescu and   
             Ioannis Patras and   
        Henning Müller and   
              Alberto Del Bimbo   Introduction to the Special Issue on
                                  Realistic Synthetic Data: Generation,
                                  Learning, Evaluation . . . . . . . . . . 1:1--1:??
             Adam Westerski and   
                  Wee Teck Fong   Synthetic Data for Object Detection with
                                  Neural Networks: State-of-the-Art Survey
                                  of Domain Randomisation Techniques . . . 2:1--2:??
                  Bruno Vaz and   
         Álvaro Figueira   GANs in the Panorama of Synthetic Data
                                  Generation Methods . . . . . . . . . . . 3:1--3:??
                Azeez Idris and   
           Mohammed Khaleel and   
        Wallapak Tavanapong and   
               Piet C. De Groen   Synthesized Image Training Techniques:
                                  On Improving Model Performance Using
                                  Confusion  . . . . . . . . . . . . . . . 4:1--4:??
                 Wenmiao Hu and   
                 Yifang Yin and   
              Ying Kiat Tan and   
                    An Tran and   
              Hannes Kruppa and   
               Roger Zimmermann   GAN-Assisted Road Segmentation from
                                  Satellite Imagery  . . . . . . . . . . . 5:1--5:??
             Fabio Hellmann and   
              Silvan Mertes and   
            Mohamed Benouis and   
          Alexander Hustinx and   
          Tzung-Chien Hsieh and   
            Cristina Conati and   
              Peter Krawitz and   
         Elisabeth André   GANonymization: a GAN-Based Face
                                  Anonymization Framework for Preserving
                                  Emotional Expressions  . . . . . . . . . 6:1--6:??
                Kaifeng Zou and   
             Sylvain Faisan and   
                  Boyang Yu and   
          Sebastien Valette and   
                     Hyewon Seo   $4$D Facial Expression Diffusion Model   7:1--7:??
                  Anjali T. and   
                   Masilamani V   Text-Guided Synthesis of Masked Face
                                  Images . . . . . . . . . . . . . . . . . 8:1--8:??
                  Xin Huang and   
                 Dong Liang and   
                Hongrui Cai and   
                Yunfeng Bai and   
               Juyong Zhang and   
                  Feng Tian and   
                    Jinyuan Jia   Double Reference Guided Interactive $2$D
                                  and $3$D Caricature Generation . . . . . 9:1--9:??
              Chaitra Desai and   
                Sujay Benur and   
               Ujwala Patil and   
                 Uma Mudenagudi   RSUIGM: Realistic Synthetic Underwater
                                  Image Generation with Image Formation
                                  Model  . . . . . . . . . . . . . . . . . 10:1--10:??
            Roberto Amoroso and   
             Davide Morelli and   
            Marcella Cornia and   
            Lorenzo Baraldi and   
          Alberto Del Bimbo and   
                 Rita Cucchiara   Parents and Children: Distinguishing
                                  Multimodal Deepfakes from Natural Images 11:1--11:??
               Pedro Celard and   
       Eva Lorenzo Iglesias and   
Jose Manuel Sorribes-Fernández and   
            Lourdes Borrajo and   
     Adrián Seara Vieira   New Metrics and Dataset for Biological
                                  Development Video Generation . . . . . . 12:1--12:??
               Lysa Gramoli and   
               Julien Cumin and   
Jérémy Lacoche and   
         Anthony Foulonneau and   
              Bruno Arnaldi and   
       Valérie Gouranton   Generating and Evaluating Data of Daily
                                  Activities with an Autonomous Agent in a
                                  Virtual Smart Home . . . . . . . . . . . 13:1--13:??
               Louis Airale and   
      Xavier Alameda-Pineda and   
Stéphane Lathuili\`ere and   
           Dominique Vaufreydaz   Autoregressive GAN for Semantic
                                  Unconditional Head Motion Generation . . 14:1--14:??
           Kerim Hod\vzi\'c and   
             Mirsad Cosovic and   
               Sasa Mrdovic and   
           Jason J. Quinlan and   
                    Darijo Raca   DashReStreamer: Framework for Creation
                                  of Impaired Video Clips under Realistic
                                  Network Conditions . . . . . . . . . . . 15:1--15:??
   Mihai Gabriel Constantin and   
       Dan-Cristian Stanciu and   
      Liviu-Daniel \cStefan and   
              Mihai Dogariu and   
           Dan Mih\uailescu and   
             George Ciobanu and   
              Matt Bergeron and   
                Winston Liu and   
           Konstantin Belov and   
              Octavian Radu and   
                 Bogdan Ionescu   Exploring Generative Adversarial
                                  Networks for Augmenting Network
                                  Intrusion Detection Tasks  . . . . . . . 16:1--16:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 21, Number 2, February, 2025

                Yushu Zhang and   
              William Puech and   
             Anderson Rocha and   
                Rongxing Lu and   
             Stefano Cresci and   
              Roberto Di Pietro   Introduction to the Special Issue on
                                  Security and Privacy of Avatar in
                                  Metaverse  . . . . . . . . . . . . . . . 41:1--41:??
                   Fan Wang and   
                Zhangjie Fu and   
                    Xiang Zhang   A Self-Defense Copyright Protection
                                  Scheme for NFT Image Art Based on
                                  Information Embedding  . . . . . . . . . 42:1--42:??
                Jinwei Wang and   
                Haihua Wang and   
               Jiawei Zhang and   
                     Hao Wu and   
              Xiangyang Luo and   
                         Bin Ma   Invisible Adversarial Watermarking: a
                                  Novel Security Mechanism for Enhancing
                                  Copyright Protection . . . . . . . . . . 43:1--43:??
                   Rui Zhai and   
                Rongrong Ni and   
                    Yang Yu and   
                       Yao Zhao   FaceDefend: Copyright Protection to
                                  Prevent Face Embezzle  . . . . . . . . . 44:1--44:??
               Hanqing Zhao and   
                 Wenbo Zhou and   
              Dongdong Chen and   
              Weiming Zhang and   
                   Ying Guo and   
                 Zhen Cheng and   
                Pengfei Yan and   
                     Nenghai Yu   Audio-Visual Contrastive Pre-train for
                                  Face Forgery Detection . . . . . . . . . 45:1--45:??
                  Long Tang and   
                 Dengpan Ye and   
                 Zhenhao Lu and   
              Yunming Zhang and   
                   Chuanxi Chen   Feature Extraction Matters More: an
                                  Effective and Efficient Universal
                                  Deepfake Disruptor . . . . . . . . . . . 46:1--46:??
                 Jian Zhang and   
                Jiangqun Ni and   
                    Fan Nie and   
                     Jiwu Huang   Domain-invariant and
                                  Patch-discriminative Feature Learning
                                  for General Deepfake Detection . . . . . 47:1--47:??
             Dengyong Zhang and   
                 Wenjie Zhu and   
                   Xin Liao and   
                  Feifan Qi and   
                 Gaobo Yang and   
                 Xiangling Ding   Spatiotemporal Inconsistency Learning
                                  and Interactive Fusion for Deepfake
                                  Video Detection  . . . . . . . . . . . . 48:1--48:??
                   Rui Yang and   
                  Rushi Lan and   
              Zhenrong Deng and   
                Xiaonan Luo and   
                      Xiyan Sun   Deepfake Video Detection Using Facial
                                  Feature Points and Ch-Transformer  . . . 49:1--49:??
              Jianheng Tang and   
                  Kejia Fan and   
                 Wenjie Yin and   
                Shihao Yang and   
              Yajiang Huang and   
                 Anfeng Liu and   
               Naixue Xiong and   
             Mianxiong Dong and   
                  Tian Wang and   
                   Shaobo Zhang   A Quality-Aware and Obfuscation-Based
                                  Data Collection Scheme for
                                  Cyber-Physical Metaverse Systems . . . . 50:1--50:??
               Xiaoxuan Han and   
               Songlin Yang and   
                   Wei Wang and   
                   Ziwen He and   
                      Jing Dong   Exploiting Backdoors of Face Synthesis
                                  Detection with Natural Triggers  . . . . 51:1--51:??
               Jiuzhen Zeng and   
           Laurence T. Yang and   
                  Chao Wang and   
                  Junjie Su and   
                   Xianjun Deng   A New Tensor Summary Statistic for
                                  Real-Time Detection of Stealthy Anomaly
                                  in Avatar Interaction  . . . . . . . . . 52:1--52:??
                 Letian Sha and   
                  Xiao Chen and   
                    Fu Xiao and   
                 Zhong Wang and   
               Zhangbo Long and   
                 Qianyu Fan and   
                   Jiankuo Dong   VRVul-Discovery: BiLSTM-based
                                  Vulnerability Discovery for Virtual
                                  Reality Devices in Metaverse . . . . . . 53:1--53:??
                   Gui Xiao and   
                  Zhen Ling and   
                 Qunqun Fan and   
                 Xiangyu Xu and   
                  Wenjia Wu and   
                  Ding Ding and   
                  Chen Chen and   
                      Xinwen Fu   Pivot: Panoramic-Image-Based VR User
                                  Authentication against Side-Channel
                                  Attacks  . . . . . . . . . . . . . . . . 54:1--54:??
                 Yalin Song and   
               Wenbin Jiang and   
                 Xiuli Chai and   
                 Zhihua Gan and   
              Mengyuan Zhou and   
                       Lei Chen   Cross-Attention Based Two-Branch
                                  Networks for Document Image Forgery
                                  Localization in the Metaverse  . . . . . 55:1--55:??
                 Yuanman Li and   
                  Lanhao Ye and   
                 Haokun Cao and   
                   Wei Wang and   
                   Zhongyun Hua   Cascaded Adaptive Graph Representation
                                  Learning for Image Copy--Move Forgery
                                  Detection  . . . . . . . . . . . . . . . 56:1--56:??
                    Cong Hu and   
             Xiao-Zhong Wei and   
                    Xiao-Jun Wu   DIRformer: a Novel Image Restoration
                                  Approach Based on U-shaped Transformer
                                  and Diffusion Models . . . . . . . . . . 57:1--57:??
                    Yuyu Xu and   
             Pingping Zhang and   
               Minghui Chen and   
               Qiudan Zhang and   
                  Wenhui Wu and   
                  Yun Zhang and   
                        Xu Wang   RGB-D Data Compression via
                                  Bi-Directional Cross-Modal Prior
                                  Transfer and Enhanced Entropy Modeling   58:1--58:??
                 Jiayu Yang and   
                Yongqi Zhai and   
                  Wei Jiang and   
               Chunhui Yang and   
                   Feng Gao and   
                  Ronggang Wang   Adaptive Prediction Structure for
                                  Learned Video Compression  . . . . . . . 59:1--59:??
                 Yifan Wang and   
                 Liang Feng and   
                Fenglin Cai and   
                    Lusi Li and   
                     Rui Wu and   
                         Jie Li   TEC-CNN: Toward Efficient Compressing of
                                  Convolutional Neural Nets with Low-rank
                                  Tensor Decomposition . . . . . . . . . . 60:1--60:??
           Chong-Yang Xiang and   
                    Xiao Wu and   
                 Jun-Yan He and   
              Zhaoquan Yuan and   
                    Tingquan He   Person in Uniforms Re-Identification . . 61:1--61:??
                  Xiyao Liu and   
               Cundian Yang and   
                Jianbiao He and   
                   Hui Fang and   
            Gerald Schaefer and   
                 Jian Zhang and   
               Yuesheng Zhu and   
                  Shichao Zhang   Attack-Defending Contrastive Learning
                                  for Volumetric Medical Image
                                  Zero-Watermarking  . . . . . . . . . . . 62:1--62:??
                   Anqi Cao and   
                Zhijing Wan and   
                  Xiao Wang and   
                    Wei Liu and   
                   Wei Wang and   
                 Zheng Wang and   
                         Xin Xu   Diversity-Representativeness Replay and
                                  Knowledge Alignment for Lifelong Vehicle
                                  Re-identification  . . . . . . . . . . . 63:1--63:??
             Xiaonuo Dongye and   
               Haiyan Jiang and   
              Dongdong Weng and   
                Zhenliang Zhang   Demonstrative Learning for Human-Agent
                                  Knowledge Transfer . . . . . . . . . . . 64:1--64:??
              Chengxin Zhao and   
                 Hefei Ling and   
                Jialie Shen and   
                   Han Fang and   
                 Sijing Xie and   
                Yaokun Fang and   
                  Zongyi Li and   
                        Ping Li   GSyncCode: Geometry Synchronous Hidden
                                  Code for One-step Photography Decoding   65:1--65:??
               Xiaolin Chen and   
               Xuemeng Song and   
                Jianhui Zuo and   
                 Yinwei Wei and   
                Liqiang Nie and   
                  Tat-Seng Chua   Domain-aware Multimodal Dialog Systems
                                  with Distribution-based User
                                  Characteristic Modeling  . . . . . . . . 66:1--66:??
                Chenghao Li and   
                     Lei Qi and   
                       Xin Geng   A SAM-guided Two-stream Lightweight
                                  Model for Anomaly Detection  . . . . . . 67:1--67:??
                  Ji-Yan Wu and   
              Kasun Gamlath and   
                   Archan Misra   Pr-Ge-Ne: Efficient Encoding of
                                  Pervasive Video Sensing Streams by
                                  Pruned Generative Networks . . . . . . . 68:1--68:??
                     Wei Ji and   
                      Li Li and   
                   Zheqi Lv and   
              Wenqiao Zhang and   
                  Mengze Li and   
                   Zhen Wan and   
               Wenqiang Lei and   
               Roger Zimmermann   Backpropagation-Free Multi-modal
                                  On-Device Model Adaptation via
                                  Cloud-Device Collaboration . . . . . . . 69:1--69:??
                  Heqi Peng and   
               Yunhong Wang and   
                Ruijie Yang and   
                 Beichen Li and   
                   Rui Wang and   
                   Yuanfang Guo   AED-PADA: Improving Generalizability of
                                  Adversarial Example Detection via
                                  Principal Adversarial Domain Adaptation  70:1--70:??
                    Ning Xu and   
               Xiaowen Wang and   
                   Jing Liu and   
                Lanjun Wang and   
                  Xuanya Li and   
               Mengxiao Zhu and   
             Yongdong Zhang and   
                      An-An Liu   Model Can Be Subtle: Two Important
                                  Mechanisms for Social Media Popularity
                                  Prediction . . . . . . . . . . . . . . . 71:1--71:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 21, Number 3, March, 2025

               Jiapeng Wang and   
                 Zening Lin and   
                 Dayi Huang and   
              Longfei Xiong and   
                    Lianwen Jin   LiLTv2: Language-substitutable
                                  Layout-image Transformer for Visual
                                  Information Extraction . . . . . . . . . 72:1--72:??
                   Yili Jin and   
                  Jiahao Li and   
                     Bin Li and   
                         Yan Lu   Neural Image Compression with Regional
                                  Decoding . . . . . . . . . . . . . . . . 73:1--73:??
                Xiaotian Wu and   
                Xinjie Feng and   
                  Bing Chen and   
            Ching-Nung Yang and   
               Qing-Yu Peng and   
                      Weiqi Yan   EVCS-DAS: Evolving Visual Cryptography
                                  Schemes for Dynamic Access Structures    74:1--74:??
  Mohamed Zakariya Talhaoui and   
               Zhelong Wang and   
       Mohamed Amine Midoun and   
          Abdelkarim Smaili and   
     Djamel Eddine Mekkaoui and   
             Mourad Lablack and   
                       Ke Zhang   Vulnerability Detection and Improvements
                                  of an Image Cryptosystem for Real-Time
                                  Visual Protection  . . . . . . . . . . . 75:1--75:??
                     Kai Xu and   
                Lichun Wang and   
                  Shuang Li and   
                   Tong Gao and   
                     Baocai Yin   Scene Adaptive Context Modeling and
                                  Balanced Relation Prediction for Scene
                                  Graph Generation . . . . . . . . . . . . 76:1--76:??
          Khouloud Samrouth and   
           Pia El Housseini and   
               Olivier Deforges   Siamese Network-Based Detection of
                                  Deepfake Impersonation Attacks with a
                                  Person of Interest Approach  . . . . . . 77:1--77:??
                Yiping Yang and   
                 Baiyun Cui and   
                    Yingming Li   A Multimodal Hierarchical Attentional
                                  Ordering Network . . . . . . . . . . . . 78:1--78:??
               Haoxian Ruan and   
                  Zhihua Xu and   
               Zhijing Yang and   
                  Yongyi Lu and   
                Jinghui Qin and   
                  Tianshui Chen   Learning Semantic-aware Representation
                                  in Visual-Language Models for
                                  Multi-label Recognition with Partial
                                  Labels . . . . . . . . . . . . . . . . . 79:1--79:??
                    Kun Yan and   
              Zied Bouraoui and   
                Fangyun Wei and   
                   Chang Xu and   
                  Ping Wang and   
              Shoaib Jameel and   
              Steven Schockaert   Modeling Multi-modal Cross-interaction
                                  for Multi-label Few-shot Image
                                  Classification Based on Local Feature
                                  Selection  . . . . . . . . . . . . . . . 80:1--80:??
                  Yajie Liu and   
                      Pu Ge and   
               Guodong Wang and   
                Qingjie Liu and   
                       Di Huang   Multi-Grained Contrastive Learning for
                                  Text-Supervised Open-Vocabulary Semantic
                                  Segmentation . . . . . . . . . . . . . . 81:1--81:??
                 Yipei Chen and   
                   Hua Yuan and   
                  Baojun Ma and   
                 Limin Wang and   
                        Yu Qian   Beyond Songs: Analyzing User Sentiment
                                  through Music Playlists and Multimodal
                                  Data . . . . . . . . . . . . . . . . . . 82:1--82:??
                 Yuzhen Niu and   
                  Yeyuan Xu and   
                 Yuezhou Li and   
              Jiabang Zhang and   
                   Yuzhong Chen   Skeleton-Boundary-Guided Network for
                                  Camouflaged Object Detection . . . . . . 83:1--83:??
             Xiaofeng Zhang and   
                  Zishan Xu and   
                   Hao Tang and   
                Chaochen Gu and   
                   Wei Chen and   
         Abdulmotaleb El Saddik   Wakeup-Darkness: When Multimodal Meets
                                  Unsupervised Low-Light Image Enhancement 84:1--84:??
                 Jiahang Tu and   
                     Wei Ji and   
                Hanbin Zhao and   
                 Chao Zhang and   
           Roger Zimmermann and   
                       Hui Qian   DriveDiTFit: Fine-tuning Diffusion
                                  Transformers for Autonomous Driving Data
                                  Generation . . . . . . . . . . . . . . . 85:1--85:??
                 Yifan Jiao and   
              Chenglong Cai and   
                   Bing-Kun Bao   Unified Text-Image Space Alignment with
                                  Cross-Modal Prompting in CLIP for UDA    86:1--86:??
                 Feifei Kou and   
               Bingwei Wang and   
                Haisheng Li and   
             Chuangying Zhu and   
                    Lei Shi and   
                Jiwei Zhang and   
                       Limei Qi   Potential Features Fusion Network for
                                  Multimodal Fake News Detection . . . . . 87:1--87:??
                 Shihao Zou and   
                  Yuanlu Xu and   
        Nikolaos Sarafianos and   
              Federica Bogo and   
                  Tony Tung and   
                  Weixin Si and   
                       Li Cheng   Generating High-Fidelity Clothed Human
                                  Dynamics with Temporal Diffusion . . . . 88:1--88:??
                Jiaxin Chen and   
                   Xin Liao and   
              Zhenxing Qian and   
                      Zheng Qin   PRest-Net: Multi-domain Probability
                                  Estimation Network for Robust Image
                                  Forgery Detection  . . . . . . . . . . . 89:1--89:??
                   Qiang Li and   
                     Di Liu and   
                   Guang Zu and   
                     Sen Li and   
                    Hui Sun and   
                 Jianzhong Wang   Multigranularity Feature Aggregation and
                                  Cross-level Boundary Modeling for
                                  Temporal Action Detection  . . . . . . . 90:1--90:??
                  Lin Huang and   
                  Chuan Qin and   
                Guorui Feng and   
              Xiangyang Luo and   
                  Xinpeng Zhang   New Framework of Robust Image Encryption 91:1--91:??
                Jiayue Chen and   
              Xiaomeng Wang and   
                    Tong Xu and   
                      Shiwei Wu   Towards Scene-Centric Multi-Level
                                  Interest Mining for Video Recommendation 92:1--92:??
                Xiusheng Lu and   
                 Yanbin Hao and   
               Lechao Cheng and   
               Sicheng Zhao and   
                  Yutao Liu and   
                    Mingli Song   Mixed Attention and Channel Shift
                                  Transformer for Efficient Action
                                  Recognition  . . . . . . . . . . . . . . 93:1--93:??
               Haifeng Zhao and   
                  Chi Zhang and   
                  Deyin Liu and   
                         Lin Wu   Deformation Field Fusion for Medical
                                  Image Registration . . . . . . . . . . . 94:1--94:??
                  Lisong Ou and   
                      Zhixin Li   Multi-modal Sarcasm Detection on Social
                                  Media via Multi-Granularity Information
                                  Fusion . . . . . . . . . . . . . . . . . 95:1--95:??
                      Ao Fu and   
                 Jiaqi Zhao and   
                  Yong Zhou and   
                Wenliang Du and   
                    Rui Yao and   
         Abdulmotaleb El Saddik   Similarity Regulation and Calibration
                                  Alignment for Weakly Supervised
                                  Text-Based Person Re-Identification  . . 96:1--96:??
                Shaojun Zhu and   
               Bincheng Zhu and   
                 Kaikai Chi and   
                 Jiefan Qiu and   
                Hailong Shi and   
                     Xingyu Gao   Maximizing Long-Term Task Completion
                                  Ratio of UAV-Enabled Wirelessly Powered
                                  MEC Systems  . . . . . . . . . . . . . . 97:1--97:??
               Xuanqing Cao and   
               Wengang Zhou and   
                     Qi Sun and   
                Weilun Wang and   
                      Li Li and   
                    Houqiang Li   DISA: Disentangled Dual-Branch Framework
                                  for Affordance-Aware Human Insertion . . 98:1--98:??
               Marco Mameli and   
            Marina Paolanti and   
            Adriano Mancini and   
           Primo Zingaretti and   
              Roberto Pierdicca   RenderGAN: Enhancing Real-time Rendering
                                  Efficiency with Deep Learning  . . . . . 99:1--99:??
                    Lv Tang and   
              Xinfeng Zhang and   
                       Li Zhang   UVC: a Unified Deep Video Compression
                                  Framework  . . . . . . . . . . . . . . . 100:1--100:??
                  Shen Wang and   
                    Yu Wang and   
                Renjie Qiao and   
                   Kejun Wu and   
               Chia-Wen Lin and   
                   Chengtao Cai   Multi-Scale Dynamic Fusion for
                                  Visible-Infrared Person
                                  Re-Identification  . . . . . . . . . . . 101:1--101:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 21, Number 4, April, 2025

                    Dan Guo and   
              Troy McDaniel and   
                Shuhui Wang and   
                      Meng Wang   Introduction to the Special Issue on
                                  Deep Learning for Robust Human Body
                                  Language Understanding . . . . . . . . . 103:1--103:??
                 Jian Zhang and   
                  Kaihao He and   
                    Ting Yu and   
                     Jun Yu and   
                  Zhenming Yuan   Semi-Supervised RGB-D Hand Gesture
                                  Recognition via Mutual Learning of
                                  Self-Supervised Models . . . . . . . . . 104:1--104:??
              Shengeng Tang and   
                   Feng Xue and   
                Jingjing Wu and   
                  Shuo Wang and   
                   Richang Hong   Gloss-driven Conditional Diffusion
                                  Models for Sign Language Production  . . 105:1--105:??
                Kaixin Chen and   
                  Lin Zhang and   
                 Zhong Wang and   
              Shengjie Zhao and   
                    Yicong Zhou   Skeleton-Aware Graph-Based Adversarial
                                  Networks for Human Pose Estimation from
                                  Sparse IMUs  . . . . . . . . . . . . . . 106:1--106:??
                  Zhewei Tu and   
                Xiangbo Shu and   
                 Peng Huang and   
                    Rui Yan and   
               Zhenxing Liu and   
                  Jiachao Zhang   Leveraging Frame- and Feature-level
                                  Progressive Augmentation for
                                  Semi-supervised Action Recognition . . . 107:1--107:??
               Linhua Xiang and   
                    Zengfu Wang   Joint Mixing Data Augmentation for
                                  Skeleton-Based Action Recognition  . . . 108:1--108:??
                  Zenan Shi and   
                  Wenyu Liu and   
                   Haipeng Chen   Face Reconstruction-Based Generalized
                                  Deepfake Detection Model with Residual
                                  Outlook Attention  . . . . . . . . . . . 109:1--109:??
                    Peng He and   
                     Jun Yu and   
                Chengjie Ge and   
                      Ye Yu and   
                     Wei Xu and   
                   Lei Wang and   
                 Tianyu Liu and   
                       Zhen Kan   Domain-Separated Bottleneck Attention
                                  Fusion Framework for Multimodal Emotion
                                  Recognition  . . . . . . . . . . . . . . 110:1--110:??
                    Yan Gan and   
               Chenxue Yang and   
                     Mao Ye and   
               Renjie Huang and   
                 Deqiang Ouyang   Generative Adversarial Networks with
                                  Learnable Auxiliary Module for Image
                                  Synthesis  . . . . . . . . . . . . . . . 111:1--111:??
                    Wei Liu and   
                     Xin Xu and   
                  Hua Chang and   
                   Xin Yuan and   
                     Zheng Wang   Mix-Modality Person Re-Identification: a
                                  New and Practical Paradigm . . . . . . . 112:1--112:??
                  Nianzi Li and   
              Guijuan Zhang and   
                    Ping Du and   
                     Dianjie Lu   GP-HSI: Human-Scene Interaction with
                                  Geometric and Physical Constraints . . . 113:1--113:??
                Enyuan Zhao and   
                  Ning Song and   
                   Ze Zhang and   
                    Jie Nie and   
               Xinyue Liang and   
                   Zhiqiang Wei   Language-guided Bias Generation
                                  Contrastive Strategy for Visual Question
                                  Answering  . . . . . . . . . . . . . . . 114:1--114:??
                   Kun Wang and   
                 Jiuxin Cao and   
                  Jiawei Ge and   
                  Chang Liu and   
                         Bo Liu   Dual-Domain Triple Contrast for
                                  Cross-Dataset Skeleton-Based Action
                                  Recognition  . . . . . . . . . . . . . . 115:1--115:??
                  Runing Li and   
               Jiangyan Dai and   
                 Qibing Qin and   
             Chengduan Wang and   
               Huihui Zhang and   
                       Yugen Yi   Texture and Structure-Guided
                                  Dual-Attention Mechanism for Image
                                  Inpainting . . . . . . . . . . . . . . . 116:1--116:??
                 Nana Zhang and   
                  Min Xiong and   
                 Dandan Zhu and   
                    Kun Zhu and   
              Guangtao Zhai and   
                  Xiaokang Yang   Audio-Visual Saliency Prediction Model
                                  with Implicit Neural Representation  . . 117:1--117:??
            Zhenqiang Zhang and   
                     Kun Li and   
              Shengeng Tang and   
                 Yanyan Wei and   
                   Fei Wang and   
               Jinxing Zhou and   
                        Dan Guo   Temporal Boundary Awareness Network for
                                  Repetitive Action Counting . . . . . . . 118:1--118:??
              Zicheng Zhang and   
               Yingjie Zhou and   
                  Chunyi Li and   
                    Wei Sun and   
               Xiongkuo Min and   
               Xiaohong Liu and   
                  Guangtao Zhai   MM-PCQA+: Advancing Multi-Modal Learning
                                  for Point Cloud Quality Assessment . . . 119:1--119:??
                   Xiao Cui and   
                     Qi Sun and   
                   Min Wang and   
                      Li Li and   
               Wengang Zhou and   
                    Houqiang Li   LayoutEnc: Leveraging Enhanced Layout
                                  Representations for Transformer-based
                                  Complex Scene Synthesis  . . . . . . . . 120:1--120:??
     Chintha Sri Pothu Raju and   
       Rabul Hussain Laskar and   
               Zulfiqar Ali and   
                Ghulam Muhammad   Attention-based Fusion for Stroke Lesion
                                  Segmentation on Computed Tomography
                                  Perfusion Data . . . . . . . . . . . . . 121:1--121:??
                Qianxing Li and   
                 Dehui Kong and   
                 Jinghua Li and   
               Dongpan Chen and   
                     Baocai Yin   Multi-Anchor Offset Representation Based
                                  Coarse-to-Fine Diffusion Model for Human
                                  Pose Estimation  . . . . . . . . . . . . 122:1--122:??
                Wasim Ahmad and   
             Yan-Tsung Peng and   
             Yuan-Hao Chang and   
      Gaddisa Olani Ganfure and   
                    Sarwar Khan   CapST: Leveraging Capsule Networks and
                                  Temporal Attention for Accurate Model
                                  Attribution in Deep-fake Videos  . . . . 123:1--123:??
                  Zekun Sun and   
                        Na Ruan   GANK: Dynamic Geometric and Appearance
                                  Features for Efficient and Robust
                                  Detection of Face Forgery  . . . . . . . 124:1--124:??
               Hancheng Zhu and   
                     Li Yan and   
                  Yong Zhou and   
                    Rui Yao and   
                Zhiwen Shao and   
                 Jiaqi Zhao and   
                       Leida Li   Image Cropping with Content and
                                  Composition Attribute-aware Global
                                  Relation Reasoning . . . . . . . . . . . 125:1--125:??
                Wenying Wen and   
                      Yu Ye and   
                  Ziye Yuan and   
                 Baolin Qiu and   
                     Dingli Hua   LFIZW-GRHFMR: Robust Zero-Watermarking
                                  with GRHFMR for Light Field Image  . . . 126:1--126:??
                   Fan Chen and   
                Lingfeng Qu and   
              Hadi Amirpour and   
         Christian Timmerer and   
                     Hongjie He   Counterfeiting Attacks on an RDH-EI
                                  Scheme Based on Block-Permutation and
                                  Co-XOR . . . . . . . . . . . . . . . . . 127:1--127:??
             Shangrong Yang and   
                 Chunyu Lin and   
                  Kang Liao and   
                       Yao Zhao   FishFormer: Annulus Slicing-based
                                  Transformer for Fisheye Rectification    128:1--128:??
                Jiahui Wang and   
                     Qin Xu and   
                   Bo Jiang and   
                        Bin Luo   Transductive Few-shot Learning via Joint
                                  Message Passing and Prototype-based
                                  Soft-label Propagation . . . . . . . . . 129:1--129:??
                   Jie Wang and   
                  Tingfa Xu and   
               Liqiang Song and   
                  Lihe Ding and   
                     Hui Li and   
                 Peng Jiang and   
                   Yuqi Han and   
                      Jianan Li   PAPooling: Graph-based Position Adaptive
                                  Aggregation of Local Geometry in Point
                                  Clouds . . . . . . . . . . . . . . . . . 130:1--130:??
                   Tao Song and   
                Kunlin Yang and   
                   Fan Meng and   
                     Xin Li and   
                 Handan Sun and   
               Chenglizhao Chen   Tropical Cyclone Image Super-Resolution
                                  via Multimodality Fusion . . . . . . . . 131:1--131:??
               Qianjiang Hu and   
                         Wei Hu   Dynamic Point Cloud Denoising via
                                  Gradient Fields  . . . . . . . . . . . . 132:1--132:??
              Jiannan Huang and   
                 Mengxue Qu and   
                 Longfei Li and   
                    Yunchao Wei   AdGPT: Explore Meaningful Advertising
                                  with ChatGPT . . . . . . . . . . . . . . 133:1--133:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 21, Number 5, May, 2025

                   Chao Wen and   
                   Chen Wei and   
                 Yuhua Qian and   
               Xiaodan Song and   
                     Xuemei Xie   Prompt-Based Invertible Mapping
                                  Alignment for Unsupervised Domain
                                  Adaptation . . . . . . . . . . . . . . . 134:1--134:??
              Jiacheng Deng and   
                 Dengpan Ye and   
                   Jizhi Li and   
                   Ziyi Liu and   
                  Long Tang and   
                  Yunming Zhang   The Interpretable and Transferable
                                  Adversarial Attack against Synthetic
                                  Speech Detectors . . . . . . . . . . . . 135:1--135:??
                  Jiawei Ge and   
                 Jiuxin Cao and   
              Xiangmei Chen and   
                 Xuelin Zhu and   
                 Weijia Liu and   
                  Chang Liu and   
                   Kun Wang and   
                         Bo Liu   Beyond Visual Cues: Synchronously
                                  Exploring Target-Centric Semantics for
                                  Vision-Language Tracking . . . . . . . . 136:1--136:??
                 Mengyu Shi and   
                  Miao Wang and   
                    Yujun Zhang   RePC: a Novel Neural Video Quality
                                  Enhancement System Framework for ABR
                                  Streaming of VBR-encoded Videos  . . . . 137:1--137:??
          Rinyoichi Takezoe and   
                   Hao Chen and   
                  Gang Shen and   
                  Xuefei Lv and   
                Yaowei Wang and   
             Shiliang Zhang and   
                    Xiaoyu Wang   Context-Assisted Active Learning for
                                  Weakly Supervised Person Search  . . . . 138:1--138:??
                  Yang Wang and   
               Yixing Zhang and   
                  Xudie Ren and   
                     Yuxin Deng   MoDA: Mixture of Domain Adapters for
                                  Parameter-efficient Generalizable Person
                                  Re-identification  . . . . . . . . . . . 139:1--139:??
                 Jiebin Yan and   
                  Ziwen Tan and   
                  Jiale Rao and   
                     Lei Wu and   
                  Yifan Zuo and   
                    Yuming Fang   Computational Analysis of Degradation
                                  Modeling in Blind Panoramic Image
                                  Quality Assessment . . . . . . . . . . . 140:1--140:??
                Yuchao Feng and   
                Mengjie Qin and   
               Jiawei Jiang and   
                 Jintao Lai and   
                  Jianwei Zheng   Axial-shunted Spatial-temporal
                                  Conversation for Change Detection  . . . 141:1--141:??
                  Wei Jiang and   
                 Jiayu Yang and   
                Yongqi Zhai and   
                   Feng Gao and   
                  Ronggang Wang   MLIC++: Linear Complexity
                                  Multi-Reference Entropy Modeling for
                                  Learned Image Compression  . . . . . . . 142:1--142:??
             Xingjie Zhuang and   
              Fengling Zhou and   
                      Zhixin Li   Multi-Modal Sarcasm Detection via
                                  Knowledge-Aware Focused Graph
                                  Convolutional Networks . . . . . . . . . 143:1--143:??
                     Xu Liu and   
                     Na Xia and   
               Jinxing Zhou and   
                Zhangbin Li and   
                        Dan Guo   Towards Energy-efficient Audio-visual
                                  Classification via Multimodal
                                  Interactive Spiking Neural Network . . . 144:1--144:??
                 Jiebin Yan and   
               Kangcheng Wu and   
                Junjie Chen and   
                  Ziwen Tan and   
                Yuming Fang and   
                      Weide Liu   Viewport-Unaware Blind Omnidirectional
                                  Image Quality Assessment: a Flexible and
                                  Effective Paradigm . . . . . . . . . . . 145:1--145:??
               Xuecheng Hua and   
                   Ke Cheng and   
                   Gege Zhu and   
                      Hu Lu and   
              Yuanquan Wang and   
                   Shitong Wang   Local-Aware Residual Attention Vision
                                  Transformer for Visible-Infrared Person
                                  Re-Identification  . . . . . . . . . . . 146:1--146:??
                Taotao Jing and   
                Haifeng Xia and   
                 Hongfu Liu and   
                 Zhengming Ding   Interpretable Novel Target Discovery
                                  through Open-Set Domain Adaptation . . . 147:1--147:??
             Dengyong Zhang and   
                  Runqi Lou and   
                Jiaxin Chen and   
             Xiangling Ding and   
                   Xin Liao and   
                     Gaobo Yang   Video Frame Interpolation via Fast
                                  Bidirectional $3$D Correlation Volume    148:1--148:??
                   Yan Wang and   
                   Hong Xie and   
                 Jinyang He and   
                 Xiaoyu Shi and   
                Mingsheng Shang   Cross-Domain Semantic Transfer for
                                  Domain Generalization  . . . . . . . . . 149:1--149:??
                   Kang Lin and   
                   Wei Zhou and   
               Zhijie Zheng and   
                  Dihu Chen and   
                         Tao Su   Temporal and Semantic Correlation
                                  Network for Weakly-Supervised Temporal
                                  Action Localization  . . . . . . . . . . 150:1--150:??
                  Zhaoda Ye and   
               Xiangteng He and   
                     Yuxin Peng   RaT2IGen: Relation-aware Text-to-image
                                  Generation via Learnable Prompt  . . . . 151:1--151:??
                 Mohan Zhou and   
                 Yalong Bai and   
                  Qing Yang and   
                    Tiejun Zhao   StyleInject: Parameter Efficient Tuning
                                  of Text-to-Image Diffusion Models  . . . 152:1--152:??
                Dongjian Yu and   
                Weiqing Min and   
                    Xin Jin and   
                 Qian Jiang and   
                   Ying Jin and   
                 Shuqiang Jiang   Diverse and High-Quality Food Image
                                  Generation from Only Food Names  . . . . 153:1--153:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 21, Number 6, June, 2025

                Wei-Yen Hsu and   
                  Yu-Chieh Chen   Multi-Attribute Feature-Aware Network
                                  for Facial Expression Recognition  . . . 154:1--154:??
                 Linlin Fan and   
             Mingliang Zhou and   
                 Xuekai Wei and   
                  Yong Feng and   
                  Tao Xiang and   
                   Bin Fang and   
              Zhaowei Shang and   
                    Fan Jia and   
                  Xu Zhuang and   
                  Huayan Pu and   
                        Jun Luo   Sparse Reduced-Rank Fully Connected
                                  Layers with Its Applications in
                                  Detection and Classification . . . . . . 155:1--155:??
                Davoud Fani and   
 Aliasghar Beheshti-Shirazi and   
          Mohammad Ghanbari and   
              Esmatollah Rezaei   On Temporal Smoothness of Video
                                  Reconstruction Quality in the DCVS via
                                  Non-Uniform Sampling . . . . . . . . . . 156:1--156:??
               I-Chun Huang and   
                  Yuang Shi and   
              Yuan-Chun Sun and   
              Wei Tsang Ooi and   
            Chun-Ying Huang and   
                 Cheng-Hsin Hsu   Composing Error Concealment Pipelines
                                  for Dynamic 3D Point Cloud Streaming . . 157:1--157:??
                     Jie Li and   
                Zhixia Zhao and   
                   Qiyue Li and   
                  Zhixin Li and   
              Pengyuan Zhou and   
                    Zhi Liu and   
                   Hao Zhou and   
                         Zhu Li   VPFormer: Leveraging Transformer with
                                  Voxel Integration for Viewport
                                  Prediction in Volumetric Video . . . . . 158:1--158:??
                Nina Willis and   
          Abraham Bernstein and   
                  Luca Rossetto   Effects of Human Cognition-Inspired Task
                                  Presentation on Interactive Video
                                  Retrieval  . . . . . . . . . . . . . . . 159:1--159:??
              Donglin Zhang and   
              Chang-Xing Li and   
                  Mengke Li and   
                      Zhikai Hu   Discrete Elective Hashing with
                                  Incomplete Labels for Efficient
                                  Cross-Modal Retrieval  . . . . . . . . . 160:1--160:??
                  Bowen Sun and   
                     Guo Lu and   
                   Shibao Zheng   DiFace: Cross-Modal Face Recognition
                                  through Controlled Diffusion . . . . . . 161:1--161:??
                Jiajia Tang and   
                  Binbin Ni and   
                Feiwei Zhou and   
                Dongjun Liu and   
                    Yu Ding and   
                  Yong Peng and   
           Andrzej Cichocki and   
                 Qibin Zhao and   
                   Wanzeng Kong   Fine-grained Semantic Disentanglement
                                  Network for Multimodal Sarcasm Analysis  162:1--162:??
                   Peng Ren and   
                Yunfeng Bai and   
                Xiaoheng Li and   
                    Jinyuan Jia   Semantic-driven Cross-space Graph
                                  Interaction Network for Fine-grained 3D
                                  Point Cloud Understanding  . . . . . . . 163:1--163:??
               Claudio Rota and   
             Marco Buzzelli and   
              Simone Bianco and   
             Raimondo Schettini   Scalable Residual Laplacian Network for
                                  HEVC-compressed Video Restoration  . . . 164:1--164:??
                  Shuo Wang and   
                   Jinda Lu and   
                 Huixia Ben and   
                 Yanbin Hao and   
                 Xingyu Gao and   
                      Meng Wang   Interventional Feature Generation for
                                  Few-shot Learning  . . . . . . . . . . . 165:1--165:??
                   Lisi Wei and   
                  Libo Zhao and   
                   Xiaoli Zhang   MAINet: Modality-Aware Interaction
                                  Network for Medical Image Fusion . . . . 166:1--166:??
                Yuxuan Zhou and   
                Mingyang Li and   
                Jingze Tong and   
                  Linlin Li and   
                    Zhiwei Yang   SD-Meta: The Software-Defined Network of
                                  Human-Centric Metaverse for Multi-Lead
                                  or Multi-Media Data in Spread Spectrum
                                  Communications . . . . . . . . . . . . . 167:1--167:??
                Wazib Ansar and   
           Saptarsi Goswami and   
          Amlan Chakrabarti and   
             Basabi Chakraborty   TexIm FAST: Text-to-Image Encoding for
                                  Semantic Similarity Evaluation of
                                  Disproportionate Sequences . . . . . . . 168:1--168:??
                Qianqian Du and   
                    Hui Yin and   
                   Lang Nie and   
                Yanting Liu and   
                        Jin Wan   EnIter: Enhancing Iterative Multi-View
                                  Depth Estimation with Universal
                                  Contextual Hints . . . . . . . . . . . . 169:1--169:??
                    Tong Wu and   
                 Jinhua Zhu and   
               Wengang Zhou and   
                    Houqiang Li   RESIST: Rationale-Enhanced and Reward
                                  Model-Based End-to-End Social Influence
                                  Dialogue System  . . . . . . . . . . . . 170:1--170:??
               Yongxin Wang and   
                  Feng Dong and   
              Zhen-Duo Chen and   
                    Xin Luo and   
                    Xin-Shun Xu   Domain-Aware Semantic Alignment Hashing
                                  for Large-Scale Zero-Shot Image
                                  Retrieval  . . . . . . . . . . . . . . . 171:1--171:??
                    Jia Cui and   
               Jinchen Shen and   
                 Jialin Wei and   
                  Shiyu Liu and   
                 Zhaojia Ye and   
                Shijian Luo and   
                       Zhen Qin   Community Transferrable Representation
                                  Learning for Image Style Classification  172:1--172:??
                   Qian Yin and   
              Xinfeng Zhang and   
                  Ruoke Yan and   
               Yuhuai Zhang and   
               Shanshe Wang and   
                       Siwei Ma   Joint Structure-Texture Scan-Order for
                                  Point Cloud Attribute Compression Using
                                  Affine Transformation  . . . . . . . . . 173:1--173:??
             Yuanzhou Huang and   
                Songwei Pei and   
                       Rui Zeng   DQFormer: Transformer with Decoupled
                                  Query Augmentations for End-to-End
                                  Multi-Object Tracking  . . . . . . . . . 174:1--174:??
                 Jiahao Lyu and   
                    Jin Wei and   
               Gangyan Zeng and   
                    Zeng Li and   
                   Enze Xie and   
                   Wei Wang and   
                     Can Ma and   
                        Yu Zhou   TextBlockV2: Towards
                                  Precise-Detection-Free Scene Text
                                  Spotting with Pre-trained Language Model 175:1--175:??
              Yang-Hao Zhou and   
                Heyan Huang and   
                 Cunhan Guo and   
              Rong-Cheng Tu and   
                  Zeyu Xiao and   
                    Bo Wang and   
                  Xian-Ling Mao   ALOHA: Adapting Local Spatio-Temporal
                                  Context to Enhance the Audio-Visual
                                  Semantic Segmentation  . . . . . . . . . 176:1--176:??
                  Bing Yang and   
               Xueqin Xiang and   
               Wanzeng Kong and   
              Jianhai Zhang and   
                   Jinliang Yao   Hybrid Feature Integrated Transformer
                                  for 3D Hand Reconstruction from a Single
                                  RGB Image  . . . . . . . . . . . . . . . 177:1--177:??
                Weizhi Xian and   
                 Junyi Wang and   
                 Xuekai Wei and   
                  Jielu Yan and   
              Yueting Huang and   
                 Kunyin Guo and   
                 Weijia Jia and   
                 Mingliang Zhou   DTSD: a Dual Teacher-Student-Based
                                  Discrimination Model for Anomaly
                                  Detection  . . . . . . . . . . . . . . . 178:1--178:??
                  Jili Chen and   
             Qionghao Huang and   
             Changqin Huang and   
                   Xiaodi Huang   Actual Cause-Guided Adaptive Gradient
                                  Scaling for Balanced Multimodal
                                  Sentiment Analysis . . . . . . . . . . . 179:1--179:??
                   Bing Fan and   
                  Feng Ding and   
                  Guopu Zhu and   
                 Jiwu Huang and   
                  Sam Kwong and   
           Pradeep K. Atrey and   
                      Siwei Lyu   Generating Higher-Quality Anti-Forensics
                                  DeepFakes with Adversarial Sharpening
                                  Mask . . . . . . . . . . . . . . . . . . 180:1--180:??
                Zhiyuan Liu and   
                     Qi Zou and   
                   Xixia Xu and   
                    Yanting Pei   Multi-Person Pose Estimation with
                                  Feature Enhancement and Decoupling Based
                                  on Contrastive Learning  . . . . . . . . 181:1--181:??
                Dongjun Liu and   
                Weichen Dai and   
               Honggang Liu and   
                 Hangjie Yi and   
                   Wanzeng Kong   Brain-Machine Cross-Modal Alignment via
                                  Sample Relational Learning for Visual
                                  Classification . . . . . . . . . . . . . 182:1--182:??
               Seongmin Lee and   
                 Jiwoo Kang and   
                   Sanghoon Lee   3D Facial Shape Similarity with Deep
                                  Perceptual Representations . . . . . . . 183:1--183:??

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 21, Number 7, July, 2025

               Kajal Kansal and   
              Yongkang Wong and   
              Mohan Kankanhalli   Implications of Privacy Regulations on
                                  Video Surveillance Systems . . . . . . . 184:1--184:27
                 Yu-Ao Wang and   
                  James She and   
            Troy TianYu Lin and   
                     Kang Zhang   AI Visual Art History: an Art Movement
                                  with Expanded Artistic Horizon . . . . . 185:1--185:16
     Abdulmotaleb El Saddik and   
                Jamil Ahmad and   
             Mustaqeem Khan and   
             Saad Abouzahir and   
                   Wail Gueaieb   Unleashing Creativity in the Metaverse:
                                  Generative AI and Multimodal Content . . 186:1--186:43
          Abdelhak Bentaleb and   
                    May Lim and   
             Sarra Hammoudi and   
                Saad Harous and   
               Roger Zimmermann   Solutions, Challenges, and Opportunities
                                  in Volumetric Video Streaming: an
                                  Architectural Perspective  . . . . . . . 187:1--187:35
               Miaohui Wang and   
               Runnan Huang and   
                 Wuyuan Xie and   
                    Zhan Ma and   
                       Siwei Ma   Compression Approaches for LiDAR Point
                                  Clouds and Beyond: a Survey  . . . . . . 188:1--188:31
              Zicheng Zhang and   
               Yingjie Zhou and   
                  Chunyi Li and   
               Baixuan Zhao and   
               Xiaohong Liu and   
                  Guangtao Zhai   Quality Assessment in the Era of Large
                                  Models: a Survey . . . . . . . . . . . . 189:1--189:31
               Haopeng Wang and   
                Haiwei Dong and   
         Abdulmotaleb El Saddik   Immersive Multimedia Communication:
                                  State-of-the-Art on Extended Reality
                                  Streaming  . . . . . . . . . . . . . . . 190:1--190:33
                     Hao Wu and   
              Maha Abdallah and   
               Yuanfang Chi and   
                  Lehao Lin and   
                        Wei Cai   Web3 Multimedia Applications: Under the
                                  Impact of Decentralization . . . . . . . 191:1--191:38
               Ammar Rashed and   
      Shervin Shirmohammadi and   
                  Ihab Amer and   
                Mohamed Hefeeda   A Review of Player Engagement Estimation
                                  in Video Games: Challenges and
                                  Opportunities  . . . . . . . . . . . . . 192:1--192:33
                   Xin Wang and   
               Ting Yu Tsai and   
                     Li Lin and   
                    Hui Guo and   
                     Shu Hu and   
           Ming-Ching Chang and   
           Pradeep K. Atrey and   
                      Siwei Lyu   Spotting the Fakes: a Deep Dive into
                                  GAN-Generated Face Detection . . . . . . 193:1--193:24
               Xinjie Zhang and   
              Tenggan Zhang and   
                    Lei Sun and   
               Jinming Zhao and   
                        Qin Jin   Exploring Interpretability in Deep
                                  Learning for Affective Computing: a
                                  Comprehensive Review . . . . . . . . . . 194:1--194:28
              Yuanding Zhou and   
                  Xinran Li and   
                Cheng Xiong and   
                   Heng Yao and   
                      Chuan Qin   A Survey of Perceptual Hashing for
                                  Multimedia . . . . . . . . . . . . . . . 195:1--195:28
                Weiqing Min and   
              Xingjian Hong and   
                  Yuxin Liu and   
               Mingyu Huang and   
                   Ying Jin and   
               Pengfei Zhou and   
                    Leyi Xu and   
                 Yilin Wang and   
             Shuqiang Jiang and   
                       Yong Rui   Multimodal Food Learning . . . . . . . . 196:1--196:28
                    Lei Gao and   
                    Kai Liu and   
                  Zheng Guo and   
                      Ling Guan   Mathematics-Inspired Models: a Green and
                                  Interpretable Learning Paradigm for
                                  Multimedia Computing . . . . . . . . . . 197:1--197:22
         Christian Timmerer and   
              Hadi Amirpour and   
          Farzad Tashtarian and   
               Samira Afzal and   
                   Amr Rizk and   
               Michael Zink and   
             Hermann Hellwagner   HTTP Adaptive Streaming: a Review on
                                  Current Advances and Future Challenges   198:1--198:27
Shah Muhammad Imtiyaj Uddin and   
       Rashedul Islam Sumon and   
   Md Ariful Islam Mozumder and   
 Md Kamran Hussin Chowdhury and   
Tagne Poupi Theodore Armand and   
                  Hee Cheol Kim   Innovations and Challenges of AI in
                                  Film: a Methodological Framework for
                                  Future Exploration . . . . . . . . . . . 199:1--199:55
               Ahmed Telili and   
          Wassim Hamidouche and   
              Hadi Amirpour and   
            Sid Ahmed Fezza and   
         Christian Timmerer and   
                     Luce Morin   Convex Hull Prediction Methods for
                                  Bitrate Ladder Construction: Design,
                                  Evaluation, and Comparison . . . . . . . 200:1--200:23
                 Jiaqi Wang and   
           Ricky Y.-K. Kwok and   
               Edith C. H. Ngai   Towards Key Point Identification (KPI)
                                  for Lecture Videos: Approaches and
                                  Performance Evaluation . . . . . . . . . 201:1--201:23
                  Longye Du and   
               Shuaiyu Deng and   
                    Ying Li and   
                     Jun Li and   
                        Qi Tian   A Survey on Composed Image Retrieval . . 202:1--202:27
  Vahdati, Monireh (Monica) and   
             Fedwa Laamarti and   
         Abdulmotaleb El Saddik   Meta-Review of Wearable Devices for
                                  Healthcare in the Metaverse  . . . . . . 203:1--203:36
                  Xuan Shao and   
                  Lin Zhang and   
              Tianjun Zhang and   
                  Shengjie Zhao   Towards a Robust
                                  Visual-Inertial-Surround-View SLAM
                                  System for Autonomous Indoor Parking . . 204:1--204:23
              Zongsheng Cao and   
                Qianqian Xu and   
               Zhiyong Yang and   
                    Yuan He and   
               Xiaochun Cao and   
                 Qingming Huang   GAHE: Geometry-Aware Embedding for
                                  Hyper-Relational Knowledge Graph
                                  Representation . . . . . . . . . . . . . 205:1--205:26
                Jiajie Fang and   
             Mengjuan Jiang and   
                Jiaqing Fan and   
               Bangjun Wang and   
                    Fanzhang Li   Complementarily Learning Decoupled
                                  Category-Region-Aware Prototype for
                                  Few-Shot Classification  . . . . . . . . 206:1--206:22
                  Zheng Liu and   
                 Kunyu Yang and   
                    Yu Weng and   
                   Zheng He and   
                   Xuan Liu and   
                    Honghao Gao   SCAG: Semantic Co-occurring Attention
                                  Guided Alignment for Knowledge-based
                                  Visual Question Answering  . . . . . . . 207:1--207:20
                 Weiyu Wang and   
               Chunmei Qing and   
                Junpeng Tan and   
                    Xiangmin Xu   Multi-view Panoramic Image Style
                                  Transfer with Multi-scale Attention and
                                  Global Sharing . . . . . . . . . . . . . 208:1--208:19
                   Lu Zhang and   
                    Rui Yao and   
               Yuhong Zhang and   
                  Yong Zhou and   
                  Fuyuan Hu and   
                 Jiaqi Zhao and   
                    Zhiwen Shao   Historical Object-Aware Prompt Learning
                                  for Universal Hyperspectral Object
                                  Tracking . . . . . . . . . . . . . . . . 209:1--209:20
                 Alain Aoun and   
            Mahmoud Masadeh and   
                Sofi\`ene Tahar   ML-based Load Value Approximator for
                                  Efficient Multimedia Processing  . . . . 210:1--210:18
                  Fubin Guo and   
                    Qi Wang and   
              Qingshan Wang and   
                     Sheng Chen   Accurate Hand Modeling in Whole-Body
                                  Mesh Reconstruction Using Joint-Level
                                  Features and Kinematic-Aware Topology    211:1--211:23
                 Zhipeng Yu and   
                Zimeng Zhao and   
                   Yanxi Du and   
               Yuzhou Zheng and   
                Binghui Zuo and   
                   Yangang Wang   T2C: Text-guided 4D Cloth Generation . . 212:1--212:19
                     Yue Li and   
                   Junru Li and   
                 Chaoyi Lin and   
                  Kai Zhang and   
                   Li Zhang and   
              Franck Galpin and   
              Thierry Dumas and   
               Hongtao Wang and   
             Muhammed Coban and   
           Jacob Ström and   
                     Du Liu and   
              Kenneth Andersson   Advanced Neural Network-Based Video
                                  Coding Technologies for Intra Prediction
                                  and In-Loop Filtering  . . . . . . . . . 213:1--213:23

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 21, Number 9, September, 2025

            Carsten Griwodz and   
                   Mea Wang and   
               Roger Zimmermann   Introduction to the Special Issue on
                                  MMSys 2023 and NOSSDAV 2023  . . . . . . 244:1--244:4
                      Na Li and   
                 Zichen Zhu and   
                  Sheng Wei and   
                        Yao Liu   EVASR: Edge-Based Salience-Aware
                                  Super-Resolution for Enhanced Video
                                  Quality and Power Efficiency . . . . . . 245:1--245:24
               Bruno Kimura and   
              Simone Ferlin and   
               Thomas Paiva and   
            Toktam Mahmoodi and   
             Anna Brunstrom and   
                      Ozgu Alay   Evaluating Adaptive Video Streaming over
                                  Multipath QUIC with Shared Bottleneck
                                  Detection  . . . . . . . . . . . . . . . 246:1--246:25
                 Ila Gokarn and   
                  Yigong Hu and   
           Tarek Abdelzaher and   
                   Archan Misra   RA-MOSAIC: Resource Adaptive Edge AI
                                  Optimization over Spatially Multiplexed
                                  Video Streams  . . . . . . . . . . . . . 247:1--247:25
                   Jiaxi Li and   
               Jingwei Liao and   
                    Bo Chen and   
                 Anh Nguyen and   
               Aditi Tiwari and   
                  Qian Zhou and   
               Zhisheng Yan and   
                Klara Nahrstedt   ST-360: Spatial-Temporal Filtering-Based
                                  Low-Latency 360-Degree Video Analytics
                                  Framework  . . . . . . . . . . . . . . . 248:1--248:25
Gabriel de Castro Araújo and   
  Henrique Domingues Garcia and   
      Myl\`ene C. Q. Farias and   
               Ravi Prakash and   
            Marcelo M. Carvalho   A 360-degree Video Player for Dynamic
                                  Video Editing Applications . . . . . . . 249:1--249:23
            Michael Rudolph and   
          Stefan Schneegass and   
                       Amr Rizk   Transcoding V-PCC Point Cloud Streams in
                                  Real-time  . . . . . . . . . . . . . . . 250:1--250:22
                   Hao Fang and   
               Haoyuan Zhao and   
                  Feng Wang and   
              Yi Ching Chou and   
                  Long Chen and   
                Jianxin Shi and   
                 Jiangchuan Liu   Streaming Media over LEO Satellite
                                  Networking: a Measurement-Based Analysis
                                  and Optimization . . . . . . . . . . . . 251:1--251:24
              Zoubida AMEUR and   
Claire-Hél\`ene Demarty and   
            Olivier Le Meur and   
           Daniel Ménard   Style-FG: a Style-based Framework for
                                  Film Grain Analysis and Synthesis  . . . 252:1--252:24
              Raphael Abreu and   
            Joel dos Santos and   
          Gheorghita Ghinea and   
Débora C. Muchaluat-Saade   Assessing Usefulness, Ease of Use, and
                                  Recognition Performance of
                                  Semi-Automatic Mulsemedia Authoring  . . 253:1--253:19
               Silvia Rossi and   
                Irene Viola and   
                 Laura Toni and   
                    Pablo Cesar   A Clustering Approach to Unveil User
                                  Similarities in 6 df Extended Reality
                                  Applications . . . . . . . . . . . . . . 254:1--254:27
                 Vijay John and   
             Yasutomo Kawanishi   Multimodal Cascaded Framework with
                                  Multimodal Latent Loss Functions Robust
                                  to Missing Modalities  . . . . . . . . . 255:1--255:21
                Kuan-Yu Lee and   
            Ashutosh Singla and   
                Pablo Cesar and   
                 Cheng-Hsin Hsu   Adaptive Cloud VR Gaming Optimized by
                                  Gamer QoE Models . . . . . . . . . . . . 256:1--256:24
                Yuqing Yang and   
                 Anh Nguyen and   
                   Zhisheng Yan   A Patch Can Disrupt Live Video
                                  Streaming: Physical Adversarial Attacks
                                  on Deep Learning Compression . . . . . . 257:1--257:23
                  Xiaoye Qu and   
                Qiyuan Chen and   
                    Wei Wei and   
                Jiashuo Sun and   
                Daizong Liu and   
                  Jianfeng Dong   Alleviating Hallucination in Large
                                  Vision-Language Models with Active
                                  Retrieval Augmentation . . . . . . . . . 258:1--258:22
                   Bing Liu and   
                Wenjie Yang and   
               Mingming Liu and   
                    Hao Liu and   
                  Yong Zhou and   
                       Peng Liu   Syntactic-Conditional Diffusion Networks
                                  for Controllable Image Captioning  . . . 259:1--259:25
                  Liyong Xu and   
                 Yifan Jiao and   
                   Bing-Kun Bao   Bool Prompt with Decomposition and
                                  Enhancement: Zero-Shot VQA Based on
                                  PVLMs  . . . . . . . . . . . . . . . . . 260:1--260:21
                  Pengyu Li and   
                  Cheolkon Jung   MRFGNet: Multiscale Reference Frame
                                  Generation Network for VVC Inter-Coding  261:1--261:20
                  Guiyu Xia and   
                Zhedong Jin and   
              Dongdong Fang and   
                      Yubao Sun   Source Information-Assisted UV-Space
                                  Transformation Network for Person Image
                                  Generation . . . . . . . . . . . . . . . 262:1--262:16
                  Junle Liu and   
                  Yun Zhang and   
                   Zixi Guo and   
              Xiaoxia Huang and   
                   Gangyi Jiang   Multiscale Feature Importance-Based Bit
                                  Allocation for End-to-End Feature Coding
                                  for Machines . . . . . . . . . . . . . . 263:1--263:19
                  Hefeng Ji and   
                  Jing Xiao and   
                 Jiefan Lin and   
                  Jimin Liu and   
                     Haoyong Yu   Intelligent Tumor Synthesis Based on
                                  Medical Image Knowledge for Liver Tumor
                                  Segmentation . . . . . . . . . . . . . . 264:1--264:23
                   Hao Ding and   
                   Jing Sun and   
                   Rui Long and   
             Xiaoping Jiang and   
               Hongling Shi and   
                 Yuting Qin and   
                  Zongze Li and   
                    Jian-Jin Li   Visible-Infrared Person
                                  Re-Identification Based on Feature
                                  Decoupling and Refinement  . . . . . . . 265:1--265:16
             Sanhita Pathak and   
              Vinay Kaushik and   
                   Brejesh Lall   Garment Recycle Training and Conditional
                                  Garment-Person Outline Attention-Guided
                                  Virtual Tryon  . . . . . . . . . . . . . 266:1--266:26
                  Zishan Xu and   
             Xiaofeng Zhang and   
                Yuqing Yang and   
                   Wei Chen and   
                Jueting Liu and   
                Tingting Xu and   
                 Zehua Wang and   
         Abdulmotaleb El Saddik   MuralAgent: Enhancing Ancient Mural
                                  Outpainting with RAG-Based Texts and
                                  Multimodal Integration . . . . . . . . . 267:1--267:17
              Hanzhang Wang and   
                Haoran Wang and   
                Zhongrui Yu and   
               Mingming Sun and   
               Junjun Jiang and   
               Xianming Liu and   
                    Deming Zhai   FAST: Flexibly Controllable Arbitrary
                                  Style Transfer via Latent Diffusion
                                  Models . . . . . . . . . . . . . . . . . 268:1--268:20
              Zhichao Zhang and   
                    Wei Sun and   
                  Li Xinyue and   
                    Jun Jia and   
               Xiongkuo Min and   
              Zicheng Zhang and   
                  Chunyi Li and   
                Zijian Chen and   
                  Wang Puyi and   
                 Sun Fengyu and   
              Jui Shangling and   
                  Guangtao Zhai   Benchmarking Multi-dimensional AIGC
                                  Video Quality Assessment: a Dataset and
                                  Unified Model  . . . . . . . . . . . . . 269:1--269:24
                Bowen Huang and   
               Yanwei Zheng and   
               Chuanlin Lan and   
               Dongchen Sui and   
               Xinpeng Zhao and   
                 Xiao Zhang and   
               Mengbai Xiao and   
                    Dongxiao Yu   Action-Aware Visual-Textual Alignment
                                  for Long-Instruction Vision-and-Language
                                  Navigation . . . . . . . . . . . . . . . 270:1--270:22
                Chenyang Lu and   
                 Zhikai Wei and   
                 Huapeng Wu and   
                     Le Sun and   
                  Tianming Zhan   KANformer: Dual-Priors-Guided Low-Light
                                  Enhancement via KAN and Transformer  . . 271:1--271:20
                  Xiang Guo and   
                  Ruimin Hu and   
             Dong Liang Zhu and   
                       Mei Wang   Uniform Light Transformer for Person
                                  Re-identification under Complex
                                  Illumination . . . . . . . . . . . . . . 272:1--272:18
                    Xin Liu and   
                  Qiya Song and   
                   Lin Xiao and   
                  Chun Wang and   
                    Xieping Gao   LPIC: Learnable Prompts and ID-guided
                                  Contrastive Learning for Multimodal
                                  Recommendation . . . . . . . . . . . . . 273:1--273:16

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 21, Number 10, October, 2025

                Alex Falcon and   
             Giuseppe Serra and   
            Sergio Escalera and   
                   Michael Wray   Introduction to the Special Issue on
                                  Text-Multimedia Retrieval: Retrieving
                                  Multimedia Data by Means of Natural
                                  Language . . . . . . . . . . . . . . . . 274:1--274:4
                 Shiping Ge and   
               Zhiwei Jiang and   
                 Yafeng Yin and   
                  Cong Wang and   
               Zifeng Cheng and   
                        Qing Gu   Fine-Grained Alignment Network for
                                  Zero-Shot Cross-Modal Retrieval  . . . . 275:1--275:24
                    Suyi Li and   
               Chenyi Jiang and   
               Shidong Wang and   
                  Yang Long and   
                Zheng Zhang and   
                  Haofeng Zhang   Contextual Interaction via
                                  Primitive-based Adversarial Training for
                                  Compositional Zero-shot Learning . . . . 276:1--276:24
                    Ying Li and   
                   Yuxiang Ding   MoHGCN: Momentum Hypergraph Convolution
                                  Network for Cross-modal Retrieval  . . . 277:1--277:21
             Suncheng Xiang and   
              Jingsheng Gao and   
                 Mingye Xie and   
              Mengyuan Guan and   
              Jiacheng Ruan and   
                      Yuzhuo Fu   Learning Visual-Semantic Embedding for
                                  Generalizable Person Re-Identification:
                                  a Unified Perspective  . . . . . . . . . 278:1--278:17
                 Renjie Pan and   
                   Hua Yang and   
                   Xiangyu Zhao   ReAL: Improving Image-Text Retrieval
                                  with Authentic Negative Repository
                                  Learning . . . . . . . . . . . . . . . . 279:1--279:22
            Shunxiang Zhang and   
                 Jiajia Liu and   
                Yixuan Jiao and   
                Yulei Zhang and   
                   Lei Chen and   
                   Kuanching Li   A Multimodal Semantic Fusion Network
                                  with Cross-Modal Alignment for
                                  Multimodal Sentiment Analysis  . . . . . 280:1--280:22
               Alex Ergasti and   
           Tomaso Fontanini and   
            Claudio Ferrari and   
           Massimo Bertozzi and   
                   Andrea Prati   MARS: Paying More Attention to Visual
                                  Attributes for Text-Based Person Search  281:1--281:22
           Taichi Nishimura and   
               Shota Nakada and   
                Masayoshi Kondo   Vision-Language Models Learn Super
                                  Images for Efficient Partially Relevant
                                  Video Retrieval  . . . . . . . . . . . . 282:1--282:22
                  Liming Xu and   
                   Hanqi Li and   
                   Jie Shao and   
               Xianhua Zeng and   
                    Weisheng Li   Multi-scale Consistency Deep Lifelong
                                  Cross-modal Hashing  . . . . . . . . . . 283:1--283:23
                  Liming Xu and   
              Dengping Zhao and   
                   Hanqi Li and   
               Xianhua Zeng and   
                  Bochuan Zheng   Deep Differential Lifelong Cross-modal
                                  Hashing for Stream Medical Data
                                  Retrieval  . . . . . . . . . . . . . . . 284:1--284:23
                  Qun Zhang and   
                  Chao Yang and   
                  Bin Jiang and   
                    Bolin Zhang   Multi-Grained Alignment with Knowledge
                                  Distillation for Partially Relevant
                                  Video Retrieval  . . . . . . . . . . . . 285:1--285:22
                 Hongyi Zhu and   
             Jia-Hong Huang and   
                Yixian Shen and   
             Stevan Rudinac and   
             Evangelos Kanoulas   Interactive Image Retrieval Meets Query
                                  Rewriting with Large Language and Vision
                                  Language Models  . . . . . . . . . . . . 286:1--286:23
                Sina Ehsani and   
                       Jian Liu   Elevating Textual Question Answering
                                  with On-Demand Visual Augmentation . . . 287:1--287:25
         Diego Gragnaniello and   
              Antonio Greco and   
              Carlo Sansone and   
                    Bruno Vento   Video Fire Recognition Using Zero-Shot
                                  Vision-Language Models Guided by a
                                  Task-Aware Object Detector . . . . . . . 288:1--288:24
             Nicola Messina and   
            Jan Sedmidubsky and   
            Fabrizio Falchi and   
             Tomás Rebok   Joint-Dataset Learning and
                                  Cross-Consistent Regularization for
                                  Text-to-Motion Retrieval . . . . . . . . 289:1--289:24
                Jianbo Song and   
                 Hong Zhang and   
                Yachun Feng and   
                Hanyang Liu and   
                     Yifan Yang   Language-guided Visual Tracking:
                                  Comprehensive and Effective Multimodal
                                  Information Fusion . . . . . . . . . . . 290:1--290:23
        Divya Arora Bhayana and   
               Om Prakash Verma   Trans-Convo-Former Net for Hierarchical
                                  Prediction of Household Images . . . . . 291:1--291:21
                  Xiaobo Hu and   
                Youfang Lin and   
                Jinwen Wang and   
                    Yue Liu and   
                  Shuo Wang and   
                   Hehe Fan and   
                         Kai Lv   Learning Robust Representations via
                                  Bidirectional Transition for Visual
                                  Reinforcement Learning . . . . . . . . . 292:1--292:24
             Mingliang Zhou and   
                  Shuqi Han and   
                    Jun Luo and   
                  Xu Zhuang and   
                    Qin Mao and   
                    Zhengguo Li   Transformer-Based and Structure-Aware
                                  Dual-Stream Network for Low-Light Image
                                  Enhancement  . . . . . . . . . . . . . . 293:1--293:24
                   Yuan Cao and   
                      Dong Wang   Dual-Branch Cross-Layer Information Flow
                                  Network for Camouflaged Object Detection
                                  in Complex Scenes  . . . . . . . . . . . 294:1--294:19
                  Haojie Li and   
                   Hao Chen and   
               Yining Huang and   
              Tianshui Chen and   
               Shuangping Huang   Enhancing Lip Dynamic Authenticity:
                                  Learning 3D Temporal Representations for
                                  Talking Head Synthesis . . . . . . . . . 295:1--295:21
                 Zhili Zhou and   
             Wensheng Zhang and   
                Zhengdao Li and   
                  Huilin Ge and   
                    Bin Qiu and   
               Fengjun Xiao and   
                 Yongfeng Huang   Progressive Generative Steganography via
                                  High-Resolution Image Generation for
                                  Covert Communication . . . . . . . . . . 296:1--296:23
              Jingtian Wang and   
                Xiaolong Li and   
                     Bin Ma and   
                       Yao Zhao   Boosting Transferability of Adversarial
                                  Examples with Spatio-Temporal Context    297:1--297:22
                     Xu Guo and   
                 Tong Zhang and   
                 Fuyun Wang and   
                Xudong Wang and   
               Xiaoya Zhang and   
                    Xin Liu and   
                       Zhen Cui   MMHCL: Multi-Modal Hypergraph
                                  Contrastive Learning for Recommendation  298:1--298:23
                   Xiao Pan and   
               Zongxin Yang and   
                  Shuai Bai and   
                        Yi Yang   GD-NeRF: Generative Detail Compensation
                                  for One-shot Generalizable Neural
                                  Radiance Fields  . . . . . . . . . . . . 299:1--299:24
               Jiacheng Yao and   
                 Jing Zhang and   
              Shuying Zhang and   
                        Li Zhuo   Cross-Modal Tri-Semantic
                                  Correlation-CLIP for Short Video
                                  Homogenization Recognition . . . . . . . 300:1--300:23
                Zhiwen Shao and   
                Yifan Cheng and   
                  Fan Zhang and   
                Xuehuai Shi and   
                  Canlin Li and   
                Lizhuang Ma and   
                  Dit-Yan Yeung   Micro-Expression Recognition via
                                  Fine-Grained Dynamic Perception  . . . . 301:1--301:23
                    Yue Liu and   
                Zhangkai Ni and   
                Peilin Chen and   
                 Shiqi Wang and   
              Xinfeng Zhang and   
                 Hanli Wang and   
                      Sam Kwong   EIN: Exposure-Induced Network for
                                  Single-Image HDR Reconstruction  . . . . 302:1--302:23
            Ali Ghorbanpour and   
         Mohammad Amin Arab and   
                Mohamed Hefeeda   RDIAS: Robust and Decentralized Image
                                  Authentication System  . . . . . . . . . 303:1--303:28

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 21, Number 11, November, 2025

                    Hao Fei and   
                     Wei Ji and   
                 Yinwei Wei and   
              Zhedong Zheng and   
                Jialie Shen and   
              Alan Hanjalic and   
               Roger Zimmermann   Introduction to the Special Issue on
                                  Deep Multimodal Generation and Retrieval 304:1--304:13
                  Haoran Li and   
                 Yanbin Hao and   
                  Jiarui Yu and   
                    Bin Zhu and   
                  Shuo Wang and   
                        Tong Xu   CVLP-NaVD: Contrastive Visual-language
                                  Pre-training Models for Non-annotated
                                  Visual Description . . . . . . . . . . . 305:1--305:23
             Jianrong Zhang and   
                   Hehe Fan and   
                        Yi Yang   Protein Captioning: Bridging the Gap
                                  between Protein Sequences and Natural
                                  Languages  . . . . . . . . . . . . . . . 306:1--306:23
                    Bobo Li and   
                    Hao Fei and   
                     Fei Li and   
              Tat-Seng Chua and   
                    Donghong Ji   Multimodal Emotion-Cause Pair Extraction
                                  with Holistic Interaction and Label
                                  Constraint . . . . . . . . . . . . . . . 307:1--307:19
                   Tao Chen and   
                Enwei Zhang and   
                 Yuting Gao and   
                      Ke Li and   
                   Xing Sun and   
                  Yan Zhang and   
                     Hui Li and   
                    Rongrong Ji   MMICT: Boosting Multi-Modal Fine-Tuning
                                  with In-Context Examples . . . . . . . . 308:1--308:17
               Xiaojie Zhou and   
                    Hang Yu and   
              Shengjie Yang and   
                   Jing Huo and   
                   Pinzhuo Tian   Learning from Orthogonal Space with
                                  Multimodal Large Models for Generalized
                                  Few-shot Segmentation  . . . . . . . . . 309:1--309:22
                     Yu Liu and   
               Haipeng Chen and   
                  Guihe Qin and   
                Jincai Song and   
                       Xun Yang   Bias Mitigation and Representation
                                  Optimization for Noise-Robust
                                  Cross-Modal Retrieval  . . . . . . . . . 310:1--310:17
                  Peng Wang and   
             Yongheng Zhang and   
                    Hao Fei and   
               Qiguang Chen and   
                 Yukai Wang and   
                Jiasheng Si and   
                 Wenpeng Lu and   
                     Min Li and   
                       Libo Qin   S 3 Agent: Unlocking the Power of VLLM
                                  for Zero-Shot Multi-Modal Sarcasm
                                  Detection  . . . . . . . . . . . . . . . 311:1--311:16
                   Yuan Gan and   
                Ruijie Quan and   
                      Yawei Luo   ExpAvatar: High-Fidelity Avatar
                                  Generation of Unseen Expressions with 3D
                                  Face Priors  . . . . . . . . . . . . . . 312:1--312:21
                  Haoyu Cai and   
                  Wenqi Lou and   
                  Chao Wang and   
                    Xuehai Zhou   Picasso: Analyzing Prompt Design for
                                  Text-to-Image Generative Diffusion
                                  Models from a Temporal-Spatial
                                  Perspective  . . . . . . . . . . . . . . 313:1--313:24
               Hongchen Wei and   
                 Zhenzhong Chen   Improving Domain Generalization for
                                  Image Captioning with Unsupervised
                                  Prompt Learning  . . . . . . . . . . . . 314:1--314:23
                   Yiwei Ma and   
                  Yijun Fan and   
                   Jiayi Ji and   
                Haowei Wang and   
                Haibing Yin and   
              Xiaoshuai Sun and   
                    Rongrong Ji   Creating High-Quality 3D Content by
                                  Bridging the Gap between Text-to-2D and
                                  Text-to-3D Generation  . . . . . . . . . 315:1--315:23
                   Fei Shen and   
                  Xiaoyu Du and   
                Liyan Zhang and   
                Xiangbo Shu and   
                    Jinhui Tang   Triplet Contrastive Representation
                                  Learning for Unsupervised Vehicle
                                  Re-Identification  . . . . . . . . . . . 316:1--316:23
                  Shilin Qu and   
               Weiqing Wang and   
                   Xin Zhou and   
                Haolan Zhan and   
                  Zhuang Li and   
                  Lizhen Qu and   
                 Linhao Luo and   
               Yuan-Fang Li and   
             Gholamreza Haffari   Scalable Frame-Based Construction of
                                  Sociocultural Norm Bases for Socially
                                  Aware Dialogues  . . . . . . . . . . . . 317:1--317:17
                    Kai Cui and   
               Shenghao Liu and   
                   Wei Feng and   
               Xianjun Deng and   
               Liangbin Gao and   
               Minmin Cheng and   
                 Hongwei Lu and   
               Laurence T. Yang   Correlation-Aware Cross-Modal Attention
                                  Network for Fashion Compatibility
                                  Modeling in UGC Systems  . . . . . . . . 318:1--318:24
                 Zhulin Tao and   
                 Runze Zhao and   
                    Xin Shi and   
                 Xingyu Gao and   
                    Xi Wang and   
                 Xianglin Huang   Multimodal Consistency Suppression
                                  Factor for Fake News Detection . . . . . 319:1--319:19
                  Yue Zhang and   
                  Chao Wang and   
                   Fei Fang and   
               Yunzhi Zhuge and   
                   Hehe Fan and   
              Xiaojun Chang and   
                 Cheng Deng and   
                        Yi Yang   SAMControl: Controlling Pose and Object
                                  for Image Editing with Soft Attention
                                  Mask . . . . . . . . . . . . . . . . . . 320:1--320:28
            Zhengwentai Sun and   
              Yanghong Zhou and   
                      P. Y. Mok   CoDE-GAN: Content Decoupled and Enhanced
                                  GAN for Sketch-guided Flexible Fashion
                                  Editing  . . . . . . . . . . . . . . . . 321:1--321:24
                  Leqi Shen and   
               Sicheng Zhao and   
               Yifeng Zhang and   
              Pengzhang Liu and   
                Yongjun Bao and   
                  Guiguang Ding   Spatio-Temporal Attention for Text-Video
                                  Retrieval  . . . . . . . . . . . . . . . 322:1--322:20
               Weipeng Jing and   
                Peilun Kang and   
                 Donglin Di and   
                  Juntao Gu and   
                  Linhui Li and   
               Mahmoud Emam and   
             Linda Mohaisen and   
                   Xun Yang and   
                        Chao Li   SRF: SpectrumRecombineFormer for
                                  Hyperspectral Image Classification . . . 323:1--323:25
                  Yiming Wu and   
                   Qihe Pan and   
                  Zhen Zhao and   
               Zicheng Wang and   
                 Sifan Long and   
                  Ronghua Liang   SOEDiff: Efficient Distillation for
                                  Small Object Editing . . . . . . . . . . 324:1--324:19
                  Licun Dai and   
                Zhiming Luo and   
               Yongguo Ling and   
               Jiaxing Chai and   
                      Shaozi Li   Dual-Modality-Shared Learning and Label
                                  Refinement for Unsupervised
                                  Visible-Infrared Person ReID . . . . . . 325:1--325:24
                Mohd Faisal and   
Roberto Alejandro Martinez Velazquez and   
             Fedwa Laamarti and   
           Hussein Al Osman and   
         Abdulmotaleb El Saddik   Haptic Network Protocols: a
                                  Comprehensive Review and Directions for
                                  Next-Gen Metaverse Applications  . . . . 326:1--326:33
                 Yahui Wang and   
             Mohsen Guizani and   
              M. Shamim Hossain   Artificial Intelligence for Virtual
                                  Reality: State of the Art, Challenges,
                                  and Future Perspectives  . . . . . . . . 327:1--327:29
             Yuanhong Zhong and   
                     Ge Yan and   
                  Ruyue Zhu and   
                   Ping Gan and   
                    Xuerui Shen   Early Traffic Accident Anticipation via
                                  Feature Consistency Representation and
                                  Soft Label Regression  . . . . . . . . . 328:1--328:21
                     Hao Fu and   
                Fengyu Yang and   
                Boyang Wang and   
                     Wei Ji and   
                Hanbin Zhao and   
                 Chao Zhang and   
           Roger Zimmermann and   
                       Hui Qian   Visuo-Tactile Class-Incremental Learning 329:1--329:19
                 Dikai Fang and   
                   Huahu Xu and   
                Yuzhe Huang and   
                    Honghao Gao   ADTC: Adaptive Dual-Stage Tree
                                  Construction for Point-Supervised Video
                                  Moment Retrieval . . . . . . . . . . . . 330:1--330:27
                 Jiaxu Kang and   
                 Bolei Chen and   
                 Ping Zhong and   
                 Yifei Wang and   
                Haonan Yang and   
                       Yu Sheng   Unbiased Embodied Visual Representation
                                  Learning with Causal Inference and
                                  Cross-Modality Alignment . . . . . . . . 331:1--331:23
               Minglong Xue and   
                 Jinhong He and   
                Wenhai Wang and   
                 Mingliang Zhou   Low-Light Image Enhancement via
                                  CLIP-Fourier Guided Wavelet Diffusion    332:1--332:22
                    Jing Li and   
                    Jun Guo and   
          M. Shamim Hossain and   
                        Ning Yu   Immersive Ink-and-Wash Landscape Design
                                  in Multimedia for Art Therapy  . . . . . 333:1--333:26

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 21, Number 12, December, 2025

Menghao Hu and
Yaguang Song and
Xiaoshan Yang and
Yaowei Wang and
Changsheng Xu Health-oriented Multimodal Food Question
Answering with Implicit and Explicit
Knowledge . . . . . . . . . . . . . . . 334:1--334:25
Peng Ren and
Xiaoheng Li and
Yunfeng Bai and
Jinyuan Jia Correlation-guided Masked Autoencoder
with Multimodal Contrastive Interaction
on Point Clouds . . . . . . . . . . . . 335:1--335:23
Yang Wen and
Shunzhe Shen and
Wuzhen Shi and
Wenming Cao and
Lei Bi and
Xiaokang Yang and
Bin Sheng A Lightweight Depthwise Separable
ConvNet with Frequency-domain
Enhancement for Retinal Vessel
Segmentation . . . . . . . . . . . . . . 336:1--336:23
Yang Wen and
Xiang-Ning Wang and
Jixue Tang and
Ping Li and
Lei Zhu and
Jing Qin and
Xiaokang Yang and
Bin Sheng CCM-Net: Contrastive and Consistent
Multi-Task Network for Artifact
Segmentation and Quality Classification
of OCTA Images . . . . . . . . . . . . . 337:1--337:21
Lin Bie and
Siqi Li and
Xiaopin Zhong and
Zongze Wu and
Yue Gao Multi-space Representation Fusion
Enhanced Monocular Depth Estimation via
Virtual Point Cloud . . . . . . . . . . 338:1--338:22
Tang-Chen Chang and
Duen-Chian Jheng and
Hsuan-Ya Liang and
Bill Louis Harchan and
Pu Ching and
Tsung-Hsun Tsai and
Chih-Yi Chang and
Te-Cheng Wu and
Yung-Hui Li and
Tse-Yu Pan and
Hung-Kuo Chu and
Min-Chun Hu Fine-grained Stroke Recognition in
Broadcast Table Tennis Videos with ATDT 339:1--339:24
Yi Jie Wong and
Mau-Luen Tham and
Ban-Hoe Kwan and
Yoong Choon Chang and
Anissa Mokraoui and
Feng Ke Efficient Client Selection for
Asynchronous Federated Learning for
Adaptive Bitrate Streaming . . . . . . . 340:1--340:22
Xingbo Liu and
Zhijie Zhao and
Xuening Zhang and
Xiao Kang and
Xiushan Nie Online Hashing with Discriminative
Attribute Embedding . . . . . . . . . . 341:1--341:20
Zihan Yuan and
Li Li and
Zichi Wang and
Xinpeng Zhang Integrity Protection of Generative
Adversarial Networks Using Fragile
Watermarking . . . . . . . . . . . . . . 342:1--342:21
Chongwei Liu and
Haojie Li and
Zhihui Wang and
Rui Xu Is a Pure Transformer Effective for
Separated and Online Multi-Object
Tracking? . . . . . . . . . . . . . . . 343:1--343:21
Mingming Zhang and
Qingjie Liu and
Yunhong Wang CtxMIM: Context-Enhanced Masked Image
Modeling for Remote Sensing Image
Understanding . . . . . . . . . . . . . 344:1--344:22
Wenzhuo Ma and
Zhenzhong Chen Diffusion-based Perceptual Neural Video
Compression with Temporal Diffusion
Information Reuse . . . . . . . . . . . 345:1--345:22
Xenios Milidonis and
Alessandro Artusi and
Francesco Banterle Deep Chroma Compression of Tone-Mapped
Images . . . . . . . . . . . . . . . . . 346:1--346:17
Jiesheng Wu and
Fangwei Hao and
Jing Xu Boosting Foreground-Background
Disentanglement for Camouflaged Object
Detection . . . . . . . . . . . . . . . 347:1--347:23
Jinming Cao and
Sicheng Shen and
Qiu Zhou and
Yifang Yin and
Yangyan Li and
Roger Zimmermann ShapeMoiré: Channel-Wise Shape-Guided
Network for Image Demoiréing . . . . . . 348:1--348:20
Xinran Wang and
Zhiqiang Tian and
Lin Bie and
Siqi Li and
Dejian Guo and
Shaoyi Du and
Yue Gao Arbitrary Large-Scale Scene
Reconstruction without Annotated Block
Partitions . . . . . . . . . . . . . . . 349:1--349:19
Zijie Zhou and
Mingliang Zhou and
Jun Luo and
Huayan Pu and
Leong Hou U. and
Xuekai Wei and
Weijia Jia VideoGNN: Video Representation Learning
via Dynamic Graph Modelling . . . . . . 350:1--350:22
Shijuan Huang and
Zongyi Li and
Hefei Ling and
Jianbo Li Cross-Modality Relation and Uncertainty
Exploration for Text-Based Person Search 351:1--351:20
Huiyi Zhou and
Feng Zhao and
Chunhai Li Multi-scale Historical Trajectory
Decomposition for Viewport Prediction in
360-degree Videos . . . . . . . . . . . 352:1--352:22
Chengmei Yang and
Qian Li and
Zhenyang Li and
Chen Ma and
Lianghua He R-HMF: a Relation-enhanced Hierarchical
Multimodal Framework for Few-shot
Knowledge Graph Completion . . . . . . . 353:1--353:24
Xueyan Cao and
Tao Lin and
Liping Zhao and
Wei Han and
Shanshe Wang and
Kailun Zhou and
Yufen Yang Ultra-Fast Intra Screen Content Coding
via Accelerated Re-Visit CU-Coding in
AVS3 . . . . . . . . . . . . . . . . . . 354:1--354:23
Ziyang Song and
Ruijie Zhu and
Jing Wang and
Chuxin Wang and
Jianfeng He and
Jiacheng Deng and
Wenfei Yang and
Tianzhu Zhang ER-Depth: Enhancing the Robustness of
Self-Supervised Monocular Depth
Estimation in Challenging Scenes . . . . 355:1--355:23
Chuanle Song and
Wei Zhou and
Han Jiao and
Wenjin Huang and
Junfeng Li and
Yihua Huang HIN: Hierarchical Interaction Network
for Image Captioning . . . . . . . . . . 356:1--356:22
Jiakai He and
Yiming Yang and
Haifeng Hu and
Ruixing Wu Unsupervised Visible-Infrared Person
ReID via Modality-Camera Balance Label
Refinement . . . . . . . . . . . . . . . 357:1--357:24
Xuelin Zhu and
Jian Liu and
Dongqi Tang and
Jiawei Ge and
Weijia Liu and
Bo Liu and
Jiuxin Cao Query-Based Knowledge Sharing for
Open-Vocabulary Multi-Label
Classification . . . . . . . . . . . . . 358:1--358:22
Weicheng Xie and
Haijian Liang and
Zenghao Niu and
Xianxu Hou and
Siyang Song and
Zitong Yu and
Linlin Shen Frequency Restoration and Modality
Enforcement towards Resisting-corruption
Multimodal Sentiment Analysis . . . . . 359:1--359:24
Victor Kombou and
Qi Xia and
Hu Xia and
Jianbin Gao and
Wei Zhang and
Benjamin Fabien Eyezo'o and
Stephane Richard Befoum and
Jonathan Anto Leoba and
Brinda Leaticia Kuiche Sop PrivaMod: Uncertainty-Aware Multimedia
Fusion with Privacy Guarantees for NFT
Visual and Transaction Analysis . . . . 360:1--360:23
Junfeng Tu and
Xueliang Liu and
Yanbin Hao and
Richang Hong A Unified Generative Hashing for
Cross-Modal Retrieval . . . . . . . . . 361:1--361:15
Meng Jian and
Ruoxi Li and
Xiaoyan Gao and
Liqiang Wei and
Lifang Wu Dual Interest Learning with
Context-Aware Adaptive Interaction for
Social Recommendation . . . . . . . . . 362:1--362:18
Kaiwei Zhang and
Mohan He and
Dandan Zhu and
Kun Zhu and
Xiongkuo Min and
Guangtao Zhai Elevating Mesh Saliency in VR:
Introducing a Novel Prediction Network
and Dataset . . . . . . . . . . . . . . 363:1--363:22

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 22, Number 1, January, 2026

              Mingqiang Wei and   
                   Qian Sun and   
                 Haoran Xie and   
                 Dong Liang and   
                Dingkun Zhu and   
                    Fu Lee Wang   Search by Image: Deeply Exploring
                                  Beneficial Features for Beauty Product
                                  Retrieval  . . . . . . . . . . . . . . . 1:1--1:19
                   Hu Xiong and   
                   Hang Yan and   
        Mohammad S. Obaidat and   
               Jingxue Chen and   
              Mingsheng Cao and   
               Sachin Kumar and   
           Kadambri Agarwal and   
                    Saru Kumari   Efficient and Privacy-Enhanced
                                  Asynchronous Federated Learning for
                                  Multimedia Data in Edge-Based IoT  . . . 2:1--2:23
       Sadia Jabeen Siddiqi and   
    Abdulraheem H. Alobaidi and   
             Mian Ahmad Jan and   
                 Muhammad Tariq   Securing Vehicle-to-Digital Twin
                                  Communications in the Internet of
                                  Vehicles . . . . . . . . . . . . . . . . 3:1--3:19
                Baoping Liu and   
                     Bo Liu and   
                  Ming Ding and   
                   Tianqing Zhu   ForgeFinder: Perceptive Multimodal
                                  Deepfake Detection via Multi-grained
                                  Forgery Localization . . . . . . . . . . 4:1--4:24
              Haolong Xiang and   
                Xuyun Zhang and   
                Xiaolong Xu and   
              Amin Beheshti and   
                Lianyong Qi and   
                 Yujie Hong and   
                    Wanchun Dou   Federated Learning-Based Anomaly
                                  Detection with Isolation Forest in the
                                  IoT-Edge Continuum . . . . . . . . . . . 5:1--5:19
         Trung Thanh Nguyen and   
         Yasutomo Kawanishi and   
          Takahiro Komamizu and   
                     Ichiro Ide   Hierarchical Global-Local Fusion for
                                  One-stage Open-vocabulary Temporal
                                  Action Detection . . . . . . . . . . . . 6:1--6:23
                  Yuan Wang and   
                    Bin Zhu and   
                 Yanbin Hao and   
              Chong-Wah Ngo and   
                     Yi Tan and   
                     Xiang Wang   CookingDiffusion: Cooking Procedural
                                  Image Generation with Stable Diffusion   7:1--7:24
                  Yaning Li and   
                    Hao Zhu and   
                   Bing-Kun Bao   Light Field Reconstruction Using
                                  Multi-orientation Epipolar Plane Images  8:1--8:22
                 Qing Zhang and   
                 Jing Zhang and   
               Xiangdong Su and   
                Feilong Bao and   
                   Guanglai Gao   Hyperbolic-Based Cross-Modal Semantic
                                  Remodeling Network for Zero-Shot
                                  Sketch-Based Image Retrieval . . . . . . 9:1--9:23
                  Anni Tang and   
                Zhiyu Zhang and   
                   Chen Zhu and   
                   Jun Ling and   
                   Rong Xie and   
                        Li Song   A Hybrid Scheme for Face Video
                                  Compression  . . . . . . . . . . . . . . 10:1--10:24
                 Xingyu Liu and   
                  Yan Jiang and   
                   Xu Cheng and   
                     Hao Yu and   
                 Haoyu Chen and   
                   Guoying Zhao   CROMBO: Cross-Modality Bootstrapping for
                                  Unified Sketch-Photo Representation
                                  Learning . . . . . . . . . . . . . . . . 11:1--11:18
                  Yan Zhang and   
                   Rui Song and   
                 Riting Xia and   
                    Zhenwei Shi   QoE Evaluation for VR with Vibrotactile
                                  Feedback Based on Inter-user Brain
                                  Spatial Information  . . . . . . . . . . 12:1--12:20
                Jianjun Lei and   
                  Duohui Tu and   
                    Bo Peng and   
                    Jie Zhu and   
                  Zhe Zhang and   
                   Chong Wu and   
                 Qingming Huang   Depth-Aware Transformer for Aerial
                                  Localization . . . . . . . . . . . . . . 13:1--13:16
              Seung-Lee Lee and   
                Minjae Kang and   
               Bo Seok Shim and   
                    Jong-Uk Hou   Robust 3D Watermarking for NeRF-Induced
                                  Modality Shifts  . . . . . . . . . . . . 14:1--14:23
                    Zan Gao and   
                  Xiaoyi Xu and   
                  Yibo Zhao and   
                 Chunjie Ma and   
                Yanbing Xue and   
                     Riwei Wang   A Collaborative Hierarchical Aggregation
                                  Network for Weakly Supervised Temporal
                                  Action Localization  . . . . . . . . . . 15:1--15:18
               Xianxuan Lin and   
                Bailin Yang and   
                Zhigeng Pan and   
              Chuangxin Cai and   
                Shuang Wang and   
            Aditi Bhattarai and   
                       Fan Meng   MambaWDC: Efficient Weather Data
                                  Compression via Selective State Space
                                  Model  . . . . . . . . . . . . . . . . . 16:1--16:24
               Zeyang Zhang and   
                     Hui Li and   
                Tianyang Xu and   
                 Xiaojun Wu and   
              Congcong Bian and   
                  Josef Kittler   BusReF: Infrared-Visible Images
                                  Registration and Fusion Focus on
                                  Reconstructible Area Using One Set of
                                  Features . . . . . . . . . . . . . . . . 17:1--17:19
               Jooyoung Lee and   
              Se Yoon Jeong and   
                   Munchurl Kim   DeepHQ: Learned Hierarchical Quantizer
                                  for Progressive Deep Image Coding  . . . 18:1--18:24
             Karanvir Singh and   
     Abdulmotaleb El Saddik and   
                   Mukesh Saini   A Step Closer Towards the Digital Twin
                                  of the Plant . . . . . . . . . . . . . . 19:1--19:23
                 Haiyu Deng and   
                    Xu Wang and   
              Guangsheng Yu and   
                     Wei Ni and   
                    Ying He and   
             Tanzeela Altaf and   
                   Ren Ping Liu   NNFMAC: a Neural Network
                                  Fingerprinting-Based Model
                                  Authentication Code Scheme . . . . . . . 20:1--20:25
                   Xin Dong and   
                Lihan Zhang and   
                 Aoyang Liu and   
              Xiaojun Liang and   
                  Yutao Guo and   
                   Yansong Tang   Enhancing Pose-Guided Human Image
                                  Generation with Comprehensive and
                                  Adjustable 3D Control  . . . . . . . . . 21:1--21:24
                   Yibo Xia and   
                 Qihui Zhan and   
                Xiaoyan Luo and   
               Xiaofeng Shi and   
                   Yunhong Wang   SignMask: Structure-aware Masked
                                  Modeling for Holistic 3D Sign Language
                                  Production . . . . . . . . . . . . . . . 22:1--22:28
            Liangcheng Zhao and   
               Yueying Wang and   
                 Yuhao Qing and   
                   Dan Zeng and   
                          Li Xu   MCFINet: a Cost-Efficient Multi-Channel
                                  Feature Integration Network for Surface
                                  Scenarios Image Super-Resolution . . . . 23:1--23:17
                Zhihao Wang and   
               Feifei Zhang and   
                Lingkai Ran and   
                Caixia Song and   
                      Ling Zhou   Enhancing Image Captioning through
                                  Bridging Image-Text Gap and Reducing
                                  Hallucinations . . . . . . . . . . . . . 24:1--24:23
                   Ruiji Xu and   
                Junhao Chen and   
               Runzhe Zhang and   
               Guanglin Dai and   
                       Keji Mao   FaceDepth: a Robust Unimodal Depression
                                  Detection Framework Using Invariant
                                  Facial Landmark Features . . . . . . . . 25:1--25:27
                  Yixuan Li and   
                  Lipeng Ma and   
               Weidong Yang and   
                        Ben Fei   3DMambaComplete: Structured State Space
                                  Model for High-Efficiency Point Cloud
                                  Completion . . . . . . . . . . . . . . . 26:1--26:24
             Shangheng Chen and   
                  Quan Fang and   
            Shengsheng Qian and   
                  Changsheng Xu   Metapath-Enhanced Language Model
                                  Pretraining on Text-Attributed
                                  Heterogeneous Graphs . . . . . . . . . . 27:1--27:23
               Yuanyu Zheng and   
                  Lin Zhang and   
                  Yunda Sun and   
                  Ying Shen and   
                  Shengjie Zhao   CaneSpeaker: an LLM-Assisted Speaker for
                                  Generating Human-Like Navigation
                                  Instructions . . . . . . . . . . . . . . 28:1--28:26
                 Wenjun Xie and   
                 Kejun Chen and   
                  Dong Wang and   
                   Xiaoping Liu   MatPose: a 2D Human Pose Estimation
                                  Model with Hybrid Mamba-Transformer  . . 29:1--29:21
                 Xinyi Chen and   
                 Weimin Lei and   
                  Wei Zhang and   
                  Wenhui Ye and   
                    Yanwen Wang   Portrait Video Compression with
                                  Semantic-guided Animation Model and
                                  Background Incremental Coding  . . . . . 30:1--30:23

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 22, Number 2, February, 2026

                Shiyi Zheng and   
                Peizhi Zhao and   
              Qingbao Huang and   
                     Yi Cai and   
               Haonan Cheng and   
                          Qi Wu   Implement Referring Expression
                                  Comprehension by Extending Auto-focus
                                  Lens to Locked Vision Model  . . . . . . 31:1--31:24
               Dinghao Yang and   
                   Bin Wang and   
                  Weijia Li and   
                   Yiqi Lin and   
                     Conghui He   Exploring the Interactive Guidance for
                                  Unified and Effective Image Matting  . . 32:1--32:24
            Masahiro Yasuda and   
              Noboru Harada and   
            Yasunori Ohishi and   
            Shoichiro Saito and   
             Akira Nakayama and   
                   Nobutaka Ono   Guided Masked Self-Distillation Modeling
                                  for Distributed Multimedia Sensor Event
                                  Analysis . . . . . . . . . . . . . . . . 33:1--33:24
                 Huijie Yao and   
               Wengang Zhou and   
                   Hao Zhou and   
                  Hezhen Hu and   
                    Houqiang Li   Retrieval-Augmented Sign Language
                                  Translation  . . . . . . . . . . . . . . 34:1--34:19
                 Yuanyou Xu and   
               Zongxin Yang and   
                        Yi Yang   Photorealistic Text-to-3D Avatar
                                  Generation with Constraints for
                                  Decoupled Geometry and Appearance  . . . 35:1--35:22
              Jiazhong Chen and   
                     Lu Guo and   
                  Dakai Ren and   
                    Zian Fu and   
                  Furui Liu and   
                    Hao Zhu and   
                     Yuxuan Pan   Geometry-Insensitive RPN Prototypes for
                                  Domain Adaptive 3D Object Detection  . . 36:1--36:22
                 Yulei Yang and   
                Zongju Peng and   
                Huabo Zhang and   
                   Fen Chen and   
                Qianliang Zhang   LF-F$^3$Net: Frequency-Guided Feature
                                  Fusion Network for Light Field Image
                                  Super-Resolution . . . . . . . . . . . . 37:1--37:19
                Junteng Liu and   
                 Zizhe Wang and   
                Yunji Liang and   
              Sagar Samtani and   
                Yangyang Li and   
                   Lei Tang and   
                      Zhiwen Yu   A Hierarchical Hard Negative Sampling
                                  Strategy for Robust Out-of-Distribution
                                  Object Detection . . . . . . . . . . . . 38:1--38:20
                 Pindan Cao and   
                Weiqing Min and   
               Guorui Sheng and   
             Yongqiang Song and   
                    Tao Yao and   
                  Lili Wang and   
                 Shuqiang Jiang   FoodHash: Context-Aware Proxy
                                  Interaction and Fusion for Food Image
                                  Retrieval  . . . . . . . . . . . . . . . 39:1--39:24
                   Yifan Xu and   
                 Sirui Zhao and   
                Shifeng Liu and   
                    Tong Xu and   
                    Enhong Chen   Emotionally Controllable Audio-driven
                                  Talking Face Generation  . . . . . . . . 40:1--40:22
               Shaofan Wang and   
                  Fuhao Wei and   
                    Hong Ma and   
                Yanfeng Sun and   
                     Baocai Yin   Text-Prompted Prompt Generator with
                                  Uncertainty Regularization for
                                  Rehearsal-Free Class-Incremental
                                  Learning . . . . . . . . . . . . . . . . 41:1--41:23
                Harry Cheng and   
               Yangyang Guo and   
                Tianyi Wang and   
                Liqiang Nie and   
              Mohan Kankanhalli   Towards Generalizable Deepfake Detection
                                  by Primary Region Regularization . . . . 42:1--42:25
                Junjie Chen and   
                    Hang Yu and   
                Subin Huang and   
                 Sanmin Liu and   
                  Linfeng Zhang   InterCLIP-MEP: Interactive CLIP and
                                  Memory-Enhanced Predictor for
                                  Multi-Modal Sarcasm Detection  . . . . . 43:1--43:23
                Yefei Sheng and   
                   Jie Wang and   
                   Ming Tao and   
                   Bing-Kun Bao   AdaEdit: Adaptive Diffusion Model for
                                  Invisible Target Oriented
                                  Text-Conditioned Image Editing . . . . . 44:1--44:19
               Yuankang Pan and   
              Zhaoquan Yuan and   
                    Xiao Wu and   
                  Zechao Li and   
                  Changsheng Xu   THMM-CLIP: Task-Guided Hierarchical
                                  Multi-Modal Alignment for Rehearsal-Free
                                  Class Incremental Learning . . . . . . . 45:1--45:18
                   Yu Jiang and   
               Yongji Zhang and   
                    Siqi Li and   
               Yuehang Wang and   
                        Yue Gao   SkiTrack: an Aerial Skiing Benchmark for
                                  Human-Centric Object Tracking  . . . . . 46:1--46:20
             Xiangling Ding and   
                   Jia Tang and   
                   Yunyi Li and   
                 Gaobo Yang and   
                      Yubo Lang   Bi-Level Routing Attention and Enhanced
                                  Spatial-Temporal Inconsistency Learning
                                  for Deep VFI Video Detection . . . . . . 47:1--47:26
                  Zhe Chang and   
                Haodong Jin and   
                   Yan Song and   
                   Ying Sun and   
                         Hui Yu   GAT-NeRF:
                                  Geometry-Aware-Transformer-Enhanced
                                  Neural Radiance Fields for High-Fidelity
                                  4D Facial Avatars  . . . . . . . . . . . 48:1--48:20
             Chong-Yu Zhang and   
                    Xin Luo and   
                Yu-Wei Zhan and   
              Zhen-Duo Chen and   
                    Xin-Shun Xu   Gleaning Wisdom from the Past: Towards
                                  Label Incremental Learning for Online
                                  Hashing with a Plug-and-Play Framework   49:1--49:23
                  Yilin Hou and   
                   Jin Wang and   
                 Jiade Chen and   
                 Yunhui Shi and   
                   Nam Ling and   
                     Baocai Yin   S$^2$PU-Net: Sparse Semantic-Guided
                                  Progressive Point Cloud Upsampling for
                                  Indoor Scenes  . . . . . . . . . . . . . 50:1--50:24
                Jianhui Zou and   
                 Weijia Cao and   
                  Nankun Mu and   
                  Shuang Yi and   
               Yifeng Zheng and   
                Zhaoquan Gu and   
                   Zhongyun Hua   Reversible Data Hiding over Encrypted
                                  Images via Intrinsic Correlation in
                                  Block-Based Secret Sharing . . . . . . . 51:1--51:25
                   Shuo Han and   
                 Qibing Qin and   
              Wenfeng Zhang and   
                      Lei Huang   Deep Uncertainty-aware Probabilistic
                                  Hashing for Cross-modal Retrieval  . . . 52:1--52:23
                Xiaobo Yang and   
                   Xiaojin Gong   Re-purposing SAM into Efficient Visual
                                  Projectors for MLLM-based Referring
                                  Image Segmentation . . . . . . . . . . . 53:1--53:26
                Mingjie Qiu and   
                  Zhiyi Tan and   
                   Bing-Kun Bao   MyGO: Modality-incomplete Fake News
                                  Video Detection via Prompt-assisted
                                  Modality Disentangling Model . . . . . . 54:1--54:23
                Zhiwen Shao and   
                  Hang Yang and   
               Hancheng Zhu and   
                    Rui Yao and   
                  Lixin Zou and   
                Mengtian Li and   
                      Bin Sheng   Spatio-Temporal Disentanglement and
                                  Constrained Self-Attention for
                                  Multi-Modal Deception Detection  . . . . 55:1--55:20
                    Tao Yan and   
              Weilong Huang and   
                Weijiang He and   
             Chenglong Wang and   
                 Cihang Wei and   
                   Yiwei Lu and   
               Xiangjie Zhu and   
               Yinghui Wang and   
               Rynson W. H. Lau   MDeRainNet: an Efficient Macro-pixel
                                  Image Rain Removal Network . . . . . . . 56:1--56:24
               Jikang Cheng and   
                  Jiaxin Ai and   
                   Zhen Han and   
                 Chao Liang and   
                    Qin Zou and   
                 Zhongyuan Wang   IDRetracor: Towards Visual Forensics
                                  against Malicious Face Swapping  . . . . 57:1--57:22
               Yangtao Wang and   
               Weibin Huang and   
                Yanzhao Xie and   
                Siyuan Chen and   
               Weilong Peng and   
                Maobin Tang and   
                  Meie Fang and   
                 Wensheng Zhang   High Feature Distinguishability for
                                  Adaptive Image-text Matching with
                                  Dual-stream Transformers . . . . . . . . 58:1--58:23
            Ghulam Muhammad and   
       Sumayah Almuntasheri and   
              Fadia Alenezi and   
              Nwraan Alhadi and   
             Victor C. M. Leung   EEG-based Multimodal Emotion
                                  Recognition: Recent Progress,
                                  Challenges, and Future Directions  . . . 59:1--59:28
             Syed Umar Amin and   
             Mohsen Guizani and   
              M. Shamim Hossain   Advances, Evaluation, and Explainability
                                  of Large Language Models in Healthcare:
                                  a Systematic Review  . . . . . . . . . . 60:1--60:32

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 22, Number 3, March, 2026

         Christian Timmerer and   
           Maria G. Martini and   
               Ali C. Begen and   
                  Luca De Cicco   Introduction to the Special Issue on ACM
                                  Multimedia Systems 2024 and Co-Located
                                  Workshops  . . . . . . . . . . . . . . . 61:1--61:4
                Ali Zeynali and   
             Mahsa Sahebdel and   
    Mohammad H. Hajiesmaili and   
            Ramesh K. Sitaraman   BOLA360: Near-optimal View and Bitrate
                                  Adaptation for 360-degree Video
                                  Streaming  . . . . . . . . . . . . . . . 62:1--62:30
                Jianxin Shi and   
                 Miao Zhang and   
               Linfeng Shen and   
             Jiangchuan Liu and   
                 Yuan Zhang and   
                 Lingjun Pu and   
                    Jingdong Xu   Implicit Representation-based Volumetric
                                  Video Streaming for Photorealistic
                                  Full-scene Experience  . . . . . . . . . 63:1--63:21
                Darijo Raca and   
             Gregory Provan and   
                   Ahmed Zahran   M$^2$ATURE: Mobile Multistage Throughput
                                  Prediction for Adaptive Video Streaming
                                  in Cellular Networks . . . . . . . . . . 64:1--64:17
     Matthias De Fré and   
       Jeroen van der Hooft and   
                Tim Wauters and   
                 Filip De Turck   Scalable MDC-Based WebRTC Streaming for
                                  One-to-Many Volumetric Video
                                  Conferencing . . . . . . . . . . . . . . 65:1--65:25
               Casper Haems and   
       Jeroen van der Hooft and   
              Hannes Mareen and   
           Peter Steenkiste and   
       Glenn Van Wallendael and   
                Tim Wauters and   
                 Filip De Turck   Hybrid Unicast-Broadcast Video Delivery
                                  for Scalable Low-Latency Live Streaming  66:1--66:24
                Dongbiao He and   
                    Xian Yu and   
                 Canshu Lin and   
            Cedric Westphal and   
             Zhongxing Ming and   
               Laizhong Cui and   
                    Xu Zhou and   
   J. J. Garcia-Luna-Aceves and   
                     Yanbiao Li   Enhancing Video Conference Applications
                                  with VCApather: a Network as a Service
                                  Perspective  . . . . . . . . . . . . . . 67:1--67:24
              Yuankang Zhao and   
                 Qinghua Wu and   
                   Gerui Lv and   
                Furong Yang and   
               Jiuhai Zhang and   
                  Feng Peng and   
                 Yanmei Liu and   
                  Zhenyu Li and   
                 Hongyu Guo and   
                  Ying Chen and   
                    Gaogang Xie   Understanding and Taming the Inflated
                                  Latency in Mobile Cloud Rendering  . . . 68:1--68:23
              Valeri George and   
           Jens Brandenburg and   
               Gabriel Hege and   
                Tobias Hinz and   
            Adam Wieckowski and   
             Benjamin Bross and   
             Thomas Schierl and   
                   Detlev Marpe   Multi-level Inter-frame Parallelization
                                  in an Open Optimized VVC Encoder . . . . 69:1--69:16
     Hamed Alimohammadzadeh and   
                 Shuqin Zhu and   
        Shahram Ghandeharizadeh   Techniques to Conceal Dark Standby
                                  Flying Light Specks  . . . . . . . . . . 70:1--70:26
             Jia-Hong Huang and   
         Chao-Han Huck Yang and   
                Pin-Yu Chen and   
              Min-Hung Chen and   
                 Marcel Worring   Conditional Modeling-Based Automatic
                                  Video Summarization  . . . . . . . . . . 71:1--71:21
                    Bo Peng and   
                   Lin Chen and   
                Jiahui Song and   
               Menglei Zhao and   
             Qingming Huang and   
                    Jianjun Lei   Meta-Learned Zero-Shot Sketch-Based
                                  Point Cloud Retrieval via
                                  Perspective-Predicted Feature Learning   72:1--72:16
                Weizhi Xian and   
                 Yichi Chen and   
                   Bin Chen and   
               Leong Hou U. and   
                 Shiyou Liu and   
                  Yong Feng and   
             Mingliang Zhou and   
                      Sam Kwong   Neighborhood Attention-based Feature
                                  Reconstruction for Image Anomaly
                                  Detection and Localization . . . . . . . 73:1--73:20
                    Keke Xu and   
              Zhenghua Peng and   
           Shuangping Huang and   
                 Gege Zhang and   
                 Yunqing Hu and   
                    Wenjie Peng   Improving Pseudo-Labeling by Dynamic
                                  Confidence Calibration for
                                  Semi-Supervised Sequence Recognition . . 74:1--74:22
                   Yun Zhou and   
                 Hongfu Yin and   
                 Chunyu Tan and   
                 Qiaoyun Wu and   
               Changyin Sun and   
                   Richang Hong   An Efficient Hybrid Cascade Tracker with
                                  Spiking Neural Networks for Event Domain
                                  Tracking . . . . . . . . . . . . . . . . 75:1--75:20
                 Bifa Liang and   
                Yichao Wang and   
                  Ziyang Hu and   
              Zhicong Huang and   
                 Haifeng Hu and   
                Jianming Xu and   
                      Dihu Chen   RCAENet: Residual Convolutional and
                                  Attention-Enhanced Stereo Matching for
                                  Real-Time Depth Estimation on Edge
                                  Devices  . . . . . . . . . . . . . . . . 76:1--76:25
              Guanhua Zheng and   
                 Jitao Sang and   
                  Changsheng Xu   GAROD: Delve into Gradient-Based
                                  Attribution Reliability for
                                  Out-of-Distribution Detection  . . . . . 77:1--77:17
                   Jun Ling and   
                 Yiwen Wang and   
                    Han Xue and   
                   Rong Xie and   
                        Li Song   PoseTalk: Exploring Text- and
                                  Audio-Based Pose Control for One-Shot
                                  Talking Face Generation  . . . . . . . . 78:1--78:24
                 Xinbo Geng and   
                    Fan Shi and   
                   Xu Cheng and   
                   Chen Jia and   
                 Shengyong Chen   Hierarchical Spatial-Angular
                                  Representation Learning for
                                  Point-Supervised Salient Object
                                  Detection in Light Fields  . . . . . . . 79:1--79:18
                 Yifan Zhao and   
               Ziyang Zheng and   
                 Duoduo Xue and   
                    Yong Li and   
                 Wenrui Dai and   
                Chenglin Li and   
                  Junni Zou and   
                  Hongkai Xiong   Unfolding Convolutional Sparse Coding
                                  With Low-rank-Guided Hybrid Priors for
                                  Image Denoising  . . . . . . . . . . . . 80:1--80:29
                 Guoyi Tang and   
                 Chunlin Li and   
                Zihao Zhang and   
                  Kun Jiang and   
               Bingxin Wang and   
                  Wenhao Wu and   
                    Xu Yang and   
                    Shaohua Wan   Low-Latency Multimedia Delivery via
                                  Collaborative Cloud-Edge Caching in Edge
                                  Computing Networks . . . . . . . . . . . 81:1--81:24
                   Yidan Xu and   
                    Suo Gao and   
               Yinghong Cao and   
                        Jun Mou   Multi-Image Encryption Scheme Based on
                                  Chaotic Pseudo-Random Signal Generator
                                  and DWT Compression  . . . . . . . . . . 82:1--82:20
             Jing-Xuan Chen and   
                    Ling Lo and   
                   Si-Yu Lu and   
                   Ling Zou and   
            Wen-Huang Cheng and   
                Jungwoo Huh and   
                   Sanghoon Lee   SeCo: Semantic-Guided Multimodal Color
                                  Splash Effects . . . . . . . . . . . . . 83:1--83:21
                  Zuyi Zhou and   
                 Dizhan Xue and   
                 Baoyuan Qi and   
            Shengsheng Qian and   
                  Changsheng Xu   Code-Driven LLM Agent for One-Shot
                                  Explanatory Visual Question Answering    84:1--84:16
                 Dongxu Mao and   
              Shangzhi Teng and   
                   Xueqiang Lyu   CVAF: a CLIP-Based View-Consistent
                                  Alignment Framework for Aerial-Ground
                                  Person Re-Identification . . . . . . . . 85:1--85:19
               Yuan-Yu Tsai and   
               Wen-Ting Jao and   
                    Yi-Hui Chen   Authentication-enabled Reversible Data
                                  Hiding in Encrypted 3D Meshes via
                                  Effective Vertex Traversal and Secret
                                  Sharing  . . . . . . . . . . . . . . . . 86:1--86:25
                    Jian Li and   
                Quanxing Xu and   
                  Ling Zhou and   
               Feifei Zhang and   
                   Rubing Huang   PLMAS: Adaptive Sample Selection for
                                  Prompting LLMs in Knowledge-Based Visual
                                  Question Answering . . . . . . . . . . . 87:1--87:21
                     Ji Dai and   
                  Quan Fang and   
                     Jun Hu and   
                Desheng Cai and   
                  Yang Yang and   
                       Can Zhao   Cross-Modal Attention Network with Dual
                                  Graph Learning in Multimodal
                                  Recommendation . . . . . . . . . . . . . 88:1--88:23
               Yun-Cong Liu and   
              Zhen-Duo Chen and   
                Qing-Ze Bai and   
              Xiao-Dong Xie and   
                    Hao Liu and   
                    Xin Luo and   
                    Xin-Shun Xu   Fine-Grained Augmentation and
                                  Progressive Feature Integration for
                                  Unsupervised Fine-Grained Hashing  . . . 89:1--89:19
                  Jiaqi Liu and   
               Xian-Ying Xu and   
                    Suo Gao and   
                Junxin Chen and   
                        Jun Mou   Lightweight Video Secondary-Encryption
                                  Scheme Based on YOLOv11 and a Discrete
                                  Model of Bi-Neuron HNN . . . . . . . . . 90:1--90:20

ACM Transactions on Multimedia Computing, Communications, and Applications
Volume 22, Number 4, April, 2026

                  Wei Zhang and   
                   Peng Jin and   
                 Chunyu Zou and   
                   Han Peng and   
                Ziyao Zhang and   
                   Jie Chen and   
                        Wen Gao   Disentangled Concept Matching for
                                  Text-video Retrieval through Perception
                                  Imitation  . . . . . . . . . . . . . . . 91:1--91:23
                Zhenyu Zhou and   
                  Qing Liao and   
                Tianrui Liu and   
                    Lei Luo and   
                Xinwang Liu and   
                         En Zhu   Self-Regressive Prototype Refinement:
                                  Stepping from Local to Global Prototypes
                                  in Few-Shot Image Classification . . . . 92:1--92:23
                 Lejun Gong and   
                   Ziyi Wei and   
                    Li Tian and   
                   Jie Deng and   
                      Guozi Sun   PingTactics: a Multimodal Dataset for
                                  Table Tennis Action Recognition and
                                  Tactical Analysis  . . . . . . . . . . . 93:1--93:22
      Jashanjot Singh Sidhu and   
              Abdelhak Bentaleb   Video Streaming Over QUIC: a
                                  Comprehensive Study  . . . . . . . . . . 94:1--94:29
                 Zhihao Jia and   
                    Hao Liu and   
                 Shuo Zhang and   
              Yonghua Zhang and   
                    Yang Lu and   
               Ziqing Huang and   
                   Shiguang Liu   Adaptive Geometric Attention-Driven
                                  No-Reference Multi-Modal Point Cloud
                                  Quality Assessment . . . . . . . . . . . 95:1--95:24
                Chuyuan Wei and   
                  Anning He and   
               Shengda Zhuo and   
             Changdong Wang and   
                 Shuqiang Huang   ProGraph: Graph Prompt Tuning with
                                  Knowledge-aware Contrastive Learning for
                                  Recommendation . . . . . . . . . . . . . 96:1--96:26
                  Wei Jiang and   
                Yongqi Zhai and   
                 Jiayu Yang and   
                   Feng Gao and   
                  Ronggang Wang   MLICv2: Enhanced Multi-Reference Entropy
                                  Modeling for Learned Image Compression   97:1--97:23
                Tomasz Lyko and   
             Yehia Elkhatib and   
             Rajiv Ramdhany and   
                  Nicholas Race   Beyond Bitrate: Understanding the QoE
                                  Impact of Playback Rate and Seeking in
                                  Adaptive Video Streaming . . . . . . . . 98:1--98:24
                 Zequn Wang and   
               Tianshi Wang and   
                Fengling Li and   
                Jingjing Li and   
                        Lei Zhu   Noise-Robust Generative Hashing for
                                  Cross-Modal Retrieval  . . . . . . . . . 99:1--99:22
           Alberto Baldrati and   
             Davide Morelli and   
            Marcella Cornia and   
              Marco Bertini and   
                 Rita Cucchiara   Multimodal-Conditioned Latent Diffusion
                                  Models for Fashion Image Editing . . . . 100:1--100:27
               Pengyu Huang and   
                Jingjing Wu and   
                Yanrong Guo and   
                   Richang Hong   Infrared Object Tracking via
                                  Complementary Dual-domain Interaction
                                  with Target-guided Frequency
                                  Transformation . . . . . . . . . . . . . 101:1--101:23
                 Yifang Yin and   
                Jinming Cao and   
              Zhenguang Liu and   
              Guanfeng Wang and   
                Shili Xiang and   
               Roger Zimmermann   Improving Test-Time Efficiency in
                                  Source-Free Semantic Segmentation via
                                  Multi-Stage Self-Training  . . . . . . . 102:1--102:20
                  Jican Tan and   
                Jinjia Peng and   
               Songyu Zhang and   
                  Zhen Wang and   
                   Huibing Wang   Unsupervised Lifelong Person
                                  Re-Identification via Affinity
                                  Harmonization  . . . . . . . . . . . . . 103:1--103:22
                Chunyang Fu and   
                      Ge Li and   
                    Wei Gao and   
                 Shiqi Wang and   
                     Zhu Li and   
                       Shan Liu   DALD-PCAC: Density-Adaptive Learning
                                  Descriptor for Point Cloud Lossless
                                  Attribute Compression  . . . . . . . . . 104:1--104:23
                    Wei Yin and   
                     Li Liu and   
                Xiaodong Fu and   
                  Lijun Liu and   
                       Wei Peng   Garment-Aware Neural Radiance Fields for
                                  Generalizable 3D Human Digitization  . . 105:1--105:22
                 Bozhi Luan and   
                   Hao Feng and   
                  Hong Chen and   
               Yonghui Wang and   
               Wengang Zhou and   
                    Houqiang Li   TextCoT: Zoom-In for Enhanced Multimodal
                                  Text-Rich Image Understanding  . . . . . 106:1--106:19
                     Han Yi and   
              Zhedong Zheng and   
                 Xiangyu Xu and   
                  Tat-Seng Chua   Progressive Text-to-3D Generation for
                                  Automatic 3D Prototyping . . . . . . . . 107:1--107:19
                   Gang Liu and   
               Boying Zhang and   
                       Yue Ming   PTHUMAN3D: 3D Gaussian Human Avatar
                                  Modeling with the Poincaré Ball and the
                                  Triplane Representation  . . . . . . . . 108:1--108:23
                   Hui Miao and   
               Yuanfang Guo and   
               Leo Yu Zhang and   
               Jiantao Zhou and   
                   Yunhong Wang   A Multi-Grained Parallel Spatio-Temporal
                                  Learning Architecture for Deepfake Video
                                  Detection  . . . . . . . . . . . . . . . 109:1--109:24
              Donglin Zhang and   
               Weixiang Shi and   
                Xiao-Jun Wu and   
                  Josef Kittler   Resilient Semantic Pseudo-Text Embedding
                                  for Zero-Shot Video Moment Retrieval . . 110:1--110:17
                  Wei Zhang and   
                  Juan Chen and   
                     En Zhu and   
              Wenhong Cheng and   
                 Yunpeng Li and   
                   Yuhan Li and   
                  Yanbo J. Wang   MLlm-DR: Towards Explainable Depression
                                  Recognition with MultiModal Large
                                  Language Models  . . . . . . . . . . . . 111:1--111:23
                Zetian Song and   
                Jiaqi Zhang and   
               Wenhong Duan and   
               Yuhuai Zhang and   
              Xinfeng Zhang and   
                       Siwei Ma   SPC-NeRF: Spatial Predictive Compression
                                  for Voxel-Based Radiance Field . . . . . 112:1--112:22
                   Yong Tao and   
                  Xinming Zhang   Text-Guided Cross-Modal Alignment with
                                  Attribute and Contour Prototypes for
                                  Visible-Infrared Person
                                  Re-Identification  . . . . . . . . . . . 113:1--113:18
                Honghao Gao and   
                 Dubin Feng and   
                    Ye Wang and   
                 Zhihao Pan and   
                 Yueshen Xu and   
          Bader Fahad Alkhamees   MonoLS: Multi-Scale Feature Fusion and
                                  Spatially-Aware Attention for Monocular
                                  3D Object Detection  . . . . . . . . . . 114:1--114:24
                 Ali Abdari and   
                Alex Falcon and   
                 Giuseppe Serra   Retrieving Relevant Metaverses Using
                                  Hierarchical Features  . . . . . . . . . 115:1--115:24
                   Shuo Han and   
                    Rui Yao and   
                  Huili Hao and   
                  Qian Feng and   
               Hancheng Zhu and   
                 Jiaqi Zhao and   
                      Yong Zhou   Dual Sparse Long-Short Term Transformer
                                  for Video Shadow Detection . . . . . . . 116:1--116:24
                     Lei Li and   
                Fuqiang Liu and   
                 Yanni Wang and   
                   Junyuan Wang   Bias-Free Semi-Supervised 3D
                                  Reconstruction via Occlusion
                                  Sensitivity-Guided Semantic
                                  Disentanglement  . . . . . . . . . . . . 117:1--117:24
               Chuanle Song and   
               Wenjin Huang and   
                   Han Jiao and   
                 Junfeng Li and   
                    Yihua Huang   Relationship-Experts Transformer for
                                  Image Captioning . . . . . . . . . . . . 118:1--118:20
                   Yali Cai and   
                  Peng Qiao and   
                   Dongsheng Li   VTalker: Text-Driven Synthesis of
                                  Talking Head with Vision Diffusion
                                  Transformer  . . . . . . . . . . . . . . 119:1--119:23
                Zhenxuan Lu and   
                  Zhihua Xu and   
               Zhijing Yang and   
                   Feng Gao and   
                  Yongyi Lu and   
                  Keze Wang and   
                  Tianshui Chen   Exploring Talking Head Models with
                                  Adjacent Frame Prior for
                                  Speech-Preserving Facial Expression
                                  Manipulation . . . . . . . . . . . . . . 120:1--120:19