Linjun Li
We prove a polynomial upper bound for the localization length of the Lorentz mirror model and the Manhattan model on the even cylinder. The main input is a conditional cylinder-localization theorem in the winding regime: if short-direction…
The rapid evolution of multimodal large models has revolutionized the simulation of diverse characters in speech dialogue systems, enabling a novel interactive paradigm. Character attributes are manifested not only in textual responses but…
3D texture generation is receiving increasing attention, as it enables the creation of realistic and aesthetic texture materials for untextured 3D meshes. However, existing 3D texture generation methods are limited to producing only a few…
We present Meissner-like photon currents in a quantum Rabi zigzag chain under staggered synthetic magnetic fields. The ground state of the Meissner superradiant phase hosts persistent chiral edge currents in a sequence of cancellation of…
Enhancement and active control of light-matter interactions at the atomic scale is important for developing next-generation nanophotonic and quantum optical devices. Here, we demonstrate electric control of both excitonic strong coupling…
With the rapid development of large language models, researchers have created increasingly advanced spoken dialogue systems that can naturally converse with humans. However, these systems still struggle to handle the full complexity of…
Generative retrieval has recently emerged as a promising approach to sequential recommendation, framing candidate item retrieval as an autoregressive sequence generation problem. However, existing generative methods typically focus solely…
Two-dimensional single-crystal metals are highly sought after for next-generation technologies. Here, we report large-area (>10^4 {\mu}m2), single-crystal two-dimensional gold with thicknesses down to a single-nanometer level, employing an…
Direct speech-to-speech translation achieves high-quality results through the introduction of discrete units obtained from self-supervised learning. This approach circumvents delays and cascading errors associated with model cascading.…
The topology between Bloch states in reciprocal space has attracted tremendous attention in recent years. The quantum geometry of the band structure is composed of quantum metric as real part and berry curvature as imaginary part. While the…
Multi-modal Contrastive Representation learning aims to encode different modalities into a semantically aligned shared space. This paradigm shows remarkable generalization ability on numerous downstream tasks across various modalities.…
Two-dimensional (2D) material heterostructures have attracted considerable attention owing to their interesting and novel physical properties, which expand the possibilities for future optoelectronic, photovoltaic, and nanoelectronic…
Two-dimensional charge density wave (CDW) materials received much attention for high responsivity and broadband photodetection in recent years, due to their collective electron transport and narrow bandgap. However, the high dark current…
Artificial visual systems (AVS) have gained tremendous momentum because of its huge potential in areas such as autonomous vehicles and robotics as part of artificial intelligence (AI) in recent years. However, current machine visual systems…
3D visual grounding aims to localize the target object in a 3D point cloud by a free-form language description. Typically, the sentences describing the target object tend to provide information about its relative relation between other…
3D visual grounding involves finding a target object in a 3D scene that corresponds to a given sentence query. Although many approaches have been proposed and achieved impressive performance, they all require dense object-sentence pair…
Speech Recognition builds a bridge between the multimedia streaming (audio-only, visual-only or audio-visual) and the corresponding text transcription. However, when training the specific model of new domain, it often gets stuck in the lack…
Direct speech-to-speech translation (S2ST) aims to convert speech from one language into another, and has demonstrated significant progress to date. Despite the recent success, current S2ST models still suffer from distinct degradation in…
Photo-Thermoelectric (PTE) response is usually one of the main working mechanisms for photodetectors. However, as another fast and easier way to measure thermoelectric characteristics of materials, it can also reveal important physics such…
2D ferroelectric \{beta}-InSe/graphene heterostructure was fabricated by mechanical exfoliation, and the carrier dynamics crossing the heterostructure interface has been systematically investigated by Raman, photoluminescence and transient…