高效视频编码标准(High Efficiency Video Coding,HEVC)作为H.264/AVC的继任者,提高了约2倍的编码效率。但其编码数据的计算复杂度和依赖性的增加,使视频编码器在硬件实现上更加困难。尤其是对编码器视频数据的处理和存取以及编码器内...高效视频编码标准(High Efficiency Video Coding,HEVC)作为H.264/AVC的继任者,提高了约2倍的编码效率。但其编码数据的计算复杂度和依赖性的增加,使视频编码器在硬件实现上更加困难。尤其是对编码器视频数据的处理和存取以及编码器内部状态控制的实现带来挑战。本文基于HEVC的宏块编码流程,提出了一种满足整体编码器实时高效运行的视频数据的存取结构和协调编码器各模块的顶层控制的方案。整个设计基于VCS和VIVADO的联合仿真环境验证功能的正确性。并在Xilinx公司的VCU118型号的FPGA上完成上板验证。测试结果表明,综合后的编码器的主频为100 MHz,可以满足编码器实现1080P30@fps的编码需求。展开更多
Along with the proliferating research interest in semantic communication(Sem Com),joint source channel coding(JSCC)has dominated the attention due to the widely assumed existence in efficiently delivering information ...Along with the proliferating research interest in semantic communication(Sem Com),joint source channel coding(JSCC)has dominated the attention due to the widely assumed existence in efficiently delivering information semantics.Nevertheless,this paper challenges the conventional JSCC paradigm and advocates for adopting separate source channel coding(SSCC)to enjoy a more underlying degree of freedom for optimization.We demonstrate that SSCC,after leveraging the strengths of the Large Language Model(LLM)for source coding and Error Correction Code Transformer(ECCT)complemented for channel coding,offers superior performance over JSCC.Our proposed framework also effectively highlights the compatibility challenges between Sem Com approaches and digital communication systems,particularly concerning the resource costs associated with the transmission of high-precision floating point numbers.Through comprehensive evaluations,we establish that assisted by LLM-based compression and ECCT-enhanced error correction,SSCC remains a viable and effective solution for modern communication systems.In other words,separate source channel coding is still what we need.展开更多
A broadband polarization-independent terahertz multifunctional coding metasurface based on topological optimization using liquid crystal(LC)is proposed.The metasurface can achieve reconfigurability for beam steering a...A broadband polarization-independent terahertz multifunctional coding metasurface based on topological optimization using liquid crystal(LC)is proposed.The metasurface can achieve reconfigurability for beam steering and vortex beam generation within a frequency range of 0.68 THz–0.72 THz.Firstly,the metasurface unit is topologically optimized using the non-dominant sequencing genetic algorithms(NSGA-II)multi-objective optimization algorithm.By applying the LC’s electrically tunable refractive index properties,the metasurface unit enables polarization-independent 2-bit coding within a frequency range of 0.68 THz–0.72 THz.Then,based on the designed metasurface unit,the array arrangement of the metasurface is reverse-designed to achieve beam steering and vortex beam generation.The results show that,for beam steering,not only can polarization-independent steering of both single-and multi-beam be achieved within the 35°elevation angle range,but also independent control of the target angle of each beam in the multi-beam steering.For vortex beam generation,the metasurfaces can achieve the generation of single-and multi-vortex beams with topological charges l=±1,±2 within the 35elevation angle range,and the generation angles of each vortex beam in the multi-vortex beam can be independently controlled.This provides flexibility and diversity in the generation of vortex beams.Therefore,the proposed terahertz LC metasurface can realize flexible control of reconfigurable functions and has certain application prospects in terahertz communication,phased array radar,and vortex radar.展开更多
To address the contradiction between the explosive growth of wireless data and the limited spectrum resources,semantic communication has been emerging as a promising communication paradigm.In this paper,we thus design...To address the contradiction between the explosive growth of wireless data and the limited spectrum resources,semantic communication has been emerging as a promising communication paradigm.In this paper,we thus design a speech semantic coded communication system,referred to as Deep-STS(i.e.,Deep-learning based Speech To Speech),for the lowbandwidth speech communication.Specifically,we first deeply compress the speech data through extracting the textual information from the speech based on the conformer encoder and connectionist temporal classification decoder at the transmitter side of Deep-STS system.In order to facilitate the final speech timbre recovery,we also extract the short-term timbre feature of speech signals only for the starting 2s duration by the long short-term memory network.Then,the Reed-Solomon coding and hybrid automatic repeat request protocol are applied to improve the reliability of transmitting the extracted text and timbre feature over the wireless channel.Third,we reconstruct the speech signal by the mel spectrogram prediction network and vocoder,when the extracted text is received along with the timbre feature at the receiver of Deep-STS system.Finally,we develop the demo system based on the USRP and GNU radio for the performance evaluation of Deep-STS.Numerical results show that the ac-Received:Jan.17,2024 Revised:Jun.12,2024 Editor:Niu Kai curacy of text extraction approaches 95%,and the mel cepstral distortion between the recovered speech signal and the original one in the spectrum domain is less than 10.Furthermore,the experimental results show that the proposed Deep-STS system can reduce the total delay of speech communication by 85%on average compared to the G.723 coding at the transmission rate of 5.4 kbps.More importantly,the coding rate of the proposed Deep-STS system is extremely low,only 0.2 kbps for continuous speech communication.It is worth noting that the Deep-STS with lower coding rate can support the low-zero-power speech communication,unveiling a new era in ultra-efficient coded communications.展开更多
Quantum secure direct communication(QSDC) is a communication method based on quantum mechanics and it is used to transmit secret messages. Unlike quantum key distribution, secret messages can be transmitted directly o...Quantum secure direct communication(QSDC) is a communication method based on quantum mechanics and it is used to transmit secret messages. Unlike quantum key distribution, secret messages can be transmitted directly on a quantum channel with QSDC. Higher channel capacity and noise suppression capabilities are key to achieving longdistance quantum communication. Here, we report a continuous-variable QSDC scheme based on mask-coding and orbital angular momentum, in which the mask-coding is employed to protect the security of the transmitting messages and to suppress the influence of excess noise. The combination of orbital angular momentum and information block transmission effectively improves the secrecy capacity. In the 800 information blocks ×1310 bits length 10-km experiment, the results show a statistical average bit error rate of 0.38%, a system excess noise value of 0.0184 SNU, and a final secrecy capacity of 6.319×10~6 bps. Therefore, this scheme reduces error bits while increasing secrecy capacity, providing a solution for long-distance large-scale quantum communication, which is capable of transmitting text, images and other information of reasonable size.展开更多
Constituted by BCH component codes and its ordered statistics decoding(OSD),the successive cancellation list(SCL)decoding of U-UV structural codes can provide competent error-correction performance in the short-to-med...Constituted by BCH component codes and its ordered statistics decoding(OSD),the successive cancellation list(SCL)decoding of U-UV structural codes can provide competent error-correction performance in the short-to-medium length regime.However,this list decoding complexity becomes formidable as the decoding output list size increases.This is primarily incurred by the OSD.Addressing this challenge,this paper proposes the low complexity SCL decoding through reducing the complexity of component code decoding,and pruning the redundant SCL decoding paths.For the former,an efficient skipping rule is introduced for the OSD so that the higher order decoding can be skipped when they are not possible to provide a more likely codeword candidate.It is further extended to the OSD variant,the box-andmatch algorithm(BMA),in facilitating the component code decoding.Moreover,through estimating the correlation distance lower bounds(CDLBs)of the component code decoding outputs,a path pruning(PP)-SCL decoding is proposed to further facilitate the decoding of U-UV codes.In particular,its integration with the improved OSD and BMA is discussed.Simulation results show that significant complexity reduction can be achieved.Consequently,the U-UV codes can outperform the cyclic redundancy check(CRC)-polar codes with a similar decoding complexity.展开更多
In the article“Silencing of the long non-coding RNA LINC00265 triggers autophagy and apoptosis in lung cancer by reducing protein stability of SIN3A oncogene”(Oncology Research.2024,Vol.32,No.7,pp.1185–1195.doi:10....In the article“Silencing of the long non-coding RNA LINC00265 triggers autophagy and apoptosis in lung cancer by reducing protein stability of SIN3A oncogene”(Oncology Research.2024,Vol.32,No.7,pp.1185–1195.doi:10.32604/or.2023.030771,https://www.techscience.com/or/v32n7/57163),an inadvertent error occurred during the compilation of Fig.3H.This needed corrections to ensure the accuracy and integrity of the data presented.展开更多
Fractional motion estimation(FME) improves the video encoding efficiency significantly. However, its high computational complexity limits the real-time processing capability. Therefore, it is a key problem to reduce t...Fractional motion estimation(FME) improves the video encoding efficiency significantly. However, its high computational complexity limits the real-time processing capability. Therefore, it is a key problem to reduce the implementation complexity of FME, especially in hardware design. This paper presents a novel deeply pipelined interpolation architecture of FME for the real-time realization of H.265/HEVC full Ultra-HD video encoder. First, a pipelined interpolation architecture together with an elegant processing order is proposed to deal with different search positions in parallel without pipeline stall and data conflict. Second, interpolation results sharing strategies are exploited among search positions to reduce the memory cost. Finally, the structure of the interpolation filter is further optimized for an area efficient implementation. As a result, the proposed design costs 41 917 slice LUTs on the Xilinx Kintex-7 FPGA platform with a 308 MHz working frequency. The measured throughput reaches a record of 1.238 Gpixels/s, which is sufficient for the real-time encoding of 8192×4320@ 30 fps video.展开更多
1 IntroductionThe screen content coding (SCC) standard [1] for high efficiency video coding (HEVC) is an international standard specially developed for screen content.
现有自适应视频隐写的成本分配方法主要针对特定变换系数,导致容量较低。此外,失真漂移是HEVC(high efficiency video coding)视频隐写面临的一大挑战。因此,结合HEVC视频编码的帧内帧间过程,提出了一种代价分配方法,以实现高容量、低...现有自适应视频隐写的成本分配方法主要针对特定变换系数,导致容量较低。此外,失真漂移是HEVC(high efficiency video coding)视频隐写面临的一大挑战。因此,结合HEVC视频编码的帧内帧间过程,提出了一种代价分配方法,以实现高容量、低失真传递的高性能视频自适应隐写。首先,该方法针对HEVC视频编码中的离散正弦变换特征进行研究,分析了这些系数在受到扰动后所产生的误差传播规律。在嵌入过程中,对修改变换系数导致的块内失真、块间失真、帧间失真进行了详细分析,并考虑不同块隐写产生的块间失真差异对块进行分类。该算法充分利用所有的非零变换系数,为不同的载体系数分配了不同的失真代价,将隐秘信息嵌入到对视频质量影响较小的帧中。实验结果表明,与现有的HEVC视频系数域隐写方法相比,该算法在视频码率、视频质量和嵌入容量方面具有一定的优势。展开更多
Multi?layer extension is based on single?layer design of High Efficiency Video Coding (HEVC) standard and employed as the com?mon structure for scalability and multi?view video coding extensions of HEVC. In this paper...Multi?layer extension is based on single?layer design of High Efficiency Video Coding (HEVC) standard and employed as the com?mon structure for scalability and multi?view video coding extensions of HEVC. In this paper, an overview of multi?layer extension is presented. The concepts and advantages of multi?layer extension are briefly described. High level syntax (HLS) for multi?layer extension and several new designs are also detailed.展开更多
文摘高效视频编码标准(High Efficiency Video Coding,HEVC)作为H.264/AVC的继任者,提高了约2倍的编码效率。但其编码数据的计算复杂度和依赖性的增加,使视频编码器在硬件实现上更加困难。尤其是对编码器视频数据的处理和存取以及编码器内部状态控制的实现带来挑战。本文基于HEVC的宏块编码流程,提出了一种满足整体编码器实时高效运行的视频数据的存取结构和协调编码器各模块的顶层控制的方案。整个设计基于VCS和VIVADO的联合仿真环境验证功能的正确性。并在Xilinx公司的VCU118型号的FPGA上完成上板验证。测试结果表明,综合后的编码器的主频为100 MHz,可以满足编码器实现1080P30@fps的编码需求。
基金supported in part by the National Key Research and Development Program of China under Grant No.2024YFE0200600the Zhejiang Provincial Natural Science Foundation of China under Grant No.LR23F010005the Huawei Cooperation Project under Grant No.TC20240829036。
文摘Along with the proliferating research interest in semantic communication(Sem Com),joint source channel coding(JSCC)has dominated the attention due to the widely assumed existence in efficiently delivering information semantics.Nevertheless,this paper challenges the conventional JSCC paradigm and advocates for adopting separate source channel coding(SSCC)to enjoy a more underlying degree of freedom for optimization.We demonstrate that SSCC,after leveraging the strengths of the Large Language Model(LLM)for source coding and Error Correction Code Transformer(ECCT)complemented for channel coding,offers superior performance over JSCC.Our proposed framework also effectively highlights the compatibility challenges between Sem Com approaches and digital communication systems,particularly concerning the resource costs associated with the transmission of high-precision floating point numbers.Through comprehensive evaluations,we establish that assisted by LLM-based compression and ECCT-enhanced error correction,SSCC remains a viable and effective solution for modern communication systems.In other words,separate source channel coding is still what we need.
基金Project supported by the Open Fund of Wuhan National Research Center for Optoelectronics(Grant No.2022WNLOKF012)the National College Students Innovation Innovation and Entrepreneurship Training Program(Grant No.2023102930147).
文摘A broadband polarization-independent terahertz multifunctional coding metasurface based on topological optimization using liquid crystal(LC)is proposed.The metasurface can achieve reconfigurability for beam steering and vortex beam generation within a frequency range of 0.68 THz–0.72 THz.Firstly,the metasurface unit is topologically optimized using the non-dominant sequencing genetic algorithms(NSGA-II)multi-objective optimization algorithm.By applying the LC’s electrically tunable refractive index properties,the metasurface unit enables polarization-independent 2-bit coding within a frequency range of 0.68 THz–0.72 THz.Then,based on the designed metasurface unit,the array arrangement of the metasurface is reverse-designed to achieve beam steering and vortex beam generation.The results show that,for beam steering,not only can polarization-independent steering of both single-and multi-beam be achieved within the 35°elevation angle range,but also independent control of the target angle of each beam in the multi-beam steering.For vortex beam generation,the metasurfaces can achieve the generation of single-and multi-vortex beams with topological charges l=±1,±2 within the 35elevation angle range,and the generation angles of each vortex beam in the multi-vortex beam can be independently controlled.This provides flexibility and diversity in the generation of vortex beams.Therefore,the proposed terahertz LC metasurface can realize flexible control of reconfigurable functions and has certain application prospects in terahertz communication,phased array radar,and vortex radar.
基金supported in part by National Natural Science Foundation of China under Grants 62122069,62071431,and 62201507.
文摘To address the contradiction between the explosive growth of wireless data and the limited spectrum resources,semantic communication has been emerging as a promising communication paradigm.In this paper,we thus design a speech semantic coded communication system,referred to as Deep-STS(i.e.,Deep-learning based Speech To Speech),for the lowbandwidth speech communication.Specifically,we first deeply compress the speech data through extracting the textual information from the speech based on the conformer encoder and connectionist temporal classification decoder at the transmitter side of Deep-STS system.In order to facilitate the final speech timbre recovery,we also extract the short-term timbre feature of speech signals only for the starting 2s duration by the long short-term memory network.Then,the Reed-Solomon coding and hybrid automatic repeat request protocol are applied to improve the reliability of transmitting the extracted text and timbre feature over the wireless channel.Third,we reconstruct the speech signal by the mel spectrogram prediction network and vocoder,when the extracted text is received along with the timbre feature at the receiver of Deep-STS system.Finally,we develop the demo system based on the USRP and GNU radio for the performance evaluation of Deep-STS.Numerical results show that the ac-Received:Jan.17,2024 Revised:Jun.12,2024 Editor:Niu Kai curacy of text extraction approaches 95%,and the mel cepstral distortion between the recovered speech signal and the original one in the spectrum domain is less than 10.Furthermore,the experimental results show that the proposed Deep-STS system can reduce the total delay of speech communication by 85%on average compared to the G.723 coding at the transmission rate of 5.4 kbps.More importantly,the coding rate of the proposed Deep-STS system is extremely low,only 0.2 kbps for continuous speech communication.It is worth noting that the Deep-STS with lower coding rate can support the low-zero-power speech communication,unveiling a new era in ultra-efficient coded communications.
基金Project supported by the National Natural Science Foundation of China (Grant Nos. 62071381 and 62301430)Shaanxi Fundamental Science Research Project for Mathematics and Physics (Grant No. 23JSY014)+1 种基金Scientific Research Plan Project of Shaanxi Education Department (Natural Science Special Project (Grant No. 23JK0680)Young Talent Fund of Xi’an Association for Science and Technology (Grant No. 959202313011)。
文摘Quantum secure direct communication(QSDC) is a communication method based on quantum mechanics and it is used to transmit secret messages. Unlike quantum key distribution, secret messages can be transmitted directly on a quantum channel with QSDC. Higher channel capacity and noise suppression capabilities are key to achieving longdistance quantum communication. Here, we report a continuous-variable QSDC scheme based on mask-coding and orbital angular momentum, in which the mask-coding is employed to protect the security of the transmitting messages and to suppress the influence of excess noise. The combination of orbital angular momentum and information block transmission effectively improves the secrecy capacity. In the 800 information blocks ×1310 bits length 10-km experiment, the results show a statistical average bit error rate of 0.38%, a system excess noise value of 0.0184 SNU, and a final secrecy capacity of 6.319×10~6 bps. Therefore, this scheme reduces error bits while increasing secrecy capacity, providing a solution for long-distance large-scale quantum communication, which is capable of transmitting text, images and other information of reasonable size.
基金supported by the National Natural Science Foundation of China(NSFC)with project ID 62071498the Guangdong National Science Foundation(GDNSF)with project ID 2024A1515010213.
文摘Constituted by BCH component codes and its ordered statistics decoding(OSD),the successive cancellation list(SCL)decoding of U-UV structural codes can provide competent error-correction performance in the short-to-medium length regime.However,this list decoding complexity becomes formidable as the decoding output list size increases.This is primarily incurred by the OSD.Addressing this challenge,this paper proposes the low complexity SCL decoding through reducing the complexity of component code decoding,and pruning the redundant SCL decoding paths.For the former,an efficient skipping rule is introduced for the OSD so that the higher order decoding can be skipped when they are not possible to provide a more likely codeword candidate.It is further extended to the OSD variant,the box-andmatch algorithm(BMA),in facilitating the component code decoding.Moreover,through estimating the correlation distance lower bounds(CDLBs)of the component code decoding outputs,a path pruning(PP)-SCL decoding is proposed to further facilitate the decoding of U-UV codes.In particular,its integration with the improved OSD and BMA is discussed.Simulation results show that significant complexity reduction can be achieved.Consequently,the U-UV codes can outperform the cyclic redundancy check(CRC)-polar codes with a similar decoding complexity.
文摘In the article“Silencing of the long non-coding RNA LINC00265 triggers autophagy and apoptosis in lung cancer by reducing protein stability of SIN3A oncogene”(Oncology Research.2024,Vol.32,No.7,pp.1185–1195.doi:10.32604/or.2023.030771,https://www.techscience.com/or/v32n7/57163),an inadvertent error occurred during the compilation of Fig.3H.This needed corrections to ensure the accuracy and integrity of the data presented.
基金Supported by the Zhejiang Provincial Natural Science Foundation of China(No.LQ15F010001,LY16F020029)the General Research Project of Zhejiang Provincial Education Department(No.Y201430479)
文摘Fractional motion estimation(FME) improves the video encoding efficiency significantly. However, its high computational complexity limits the real-time processing capability. Therefore, it is a key problem to reduce the implementation complexity of FME, especially in hardware design. This paper presents a novel deeply pipelined interpolation architecture of FME for the real-time realization of H.265/HEVC full Ultra-HD video encoder. First, a pipelined interpolation architecture together with an elegant processing order is proposed to deal with different search positions in parallel without pipeline stall and data conflict. Second, interpolation results sharing strategies are exploited among search positions to reduce the memory cost. Finally, the structure of the interpolation filter is further optimized for an area efficient implementation. As a result, the proposed design costs 41 917 slice LUTs on the Xilinx Kintex-7 FPGA platform with a 308 MHz working frequency. The measured throughput reaches a record of 1.238 Gpixels/s, which is sufficient for the real-time encoding of 8192×4320@ 30 fps video.
基金supported in part by National Natural Science Foundation of China under Grant No.61201226 and 61271096Specialized Research Fund for the Doctoral Program under Grant No.20130072110054
文摘1 IntroductionThe screen content coding (SCC) standard [1] for high efficiency video coding (HEVC) is an international standard specially developed for screen content.
文摘现有自适应视频隐写的成本分配方法主要针对特定变换系数,导致容量较低。此外,失真漂移是HEVC(high efficiency video coding)视频隐写面临的一大挑战。因此,结合HEVC视频编码的帧内帧间过程,提出了一种代价分配方法,以实现高容量、低失真传递的高性能视频自适应隐写。首先,该方法针对HEVC视频编码中的离散正弦变换特征进行研究,分析了这些系数在受到扰动后所产生的误差传播规律。在嵌入过程中,对修改变换系数导致的块内失真、块间失真、帧间失真进行了详细分析,并考虑不同块隐写产生的块间失真差异对块进行分类。该算法充分利用所有的非零变换系数,为不同的载体系数分配了不同的失真代价,将隐秘信息嵌入到对视频质量影响较小的帧中。实验结果表明,与现有的HEVC视频系数域隐写方法相比,该算法在视频码率、视频质量和嵌入容量方面具有一定的优势。
文摘Multi?layer extension is based on single?layer design of High Efficiency Video Coding (HEVC) standard and employed as the com?mon structure for scalability and multi?view video coding extensions of HEVC. In this paper, an overview of multi?layer extension is presented. The concepts and advantages of multi?layer extension are briefly described. High level syntax (HLS) for multi?layer extension and several new designs are also detailed.