Skip to main navigation Skip to search Skip to main content

Agent-Driven Generative Semantic Communication with Cross-Modality and Prediction

  • Wanting Yang
  • , Zehui Xiong
  • , Yanli Yuan
  • , Wenchao Jiang
  • , Tony Q.S. Quek
  • , Merouane Debbah
  • Singapore University of Technology and Design
  • Beijing Institute of Technology

Research output: Contribution to journalArticlepeer-review

20 Scopus citations

Abstract

In the era of 6G, with compelling visions of intelligent transportation systems and digital twins, remote surveillance is poised to become a ubiquitous practice. Substantial data volume and frequent updates present challenges in wireless networks. To address these challenges, we propose a novel agent-driven generative semantic communication (A-GSC) framework based on reinforcement learning. In contrast to the existing research on semantic communication (SemCom), which mainly focuses on either semantic extraction or semantic sampling, we seamlessly integrate both by jointly considering the intrinsic attributes of source information and the contextual information regarding the task. Notably, the introduction of generative artificial intelligence (GAI) enables the independent design of semantic encoders and decoders. In this work, we develop an agent-assisted semantic encoder with cross-modality capability, which can track the semantic changes, channel condition, to perform adaptive semantic extraction and sampling. Accordingly, we design a semantic decoder with both predictive and generative capabilities, consisting of two tailored modules. Moreover, the effectiveness of the designed models has been verified using the UA-DETRAC dataset, demonstrating the performance gains of the overall A-GSC framework in both energy saving and reconstruction accuracy.

Original languageBritish English
Pages (from-to)2233-2248
Number of pages16
JournalIEEE Transactions on Wireless Communications
Volume24
Issue number3
DOIs
StatePublished - 2025

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 11 - Sustainable Cities and Communities
    SDG 11 Sustainable Cities and Communities

Keywords

  • deep reinforcement learning
  • diffusion model
  • Semantic communication
  • semantic sampling
  • video streaming

Fingerprint

Dive into the research topics of 'Agent-Driven Generative Semantic Communication with Cross-Modality and Prediction'. Together they form a unique fingerprint.

Cite this