Jun Xiao

Jun Xiao · 2024-06-20T19:03:58.978Z

I was excited to attend the CVPR2024 conference in Seattle. It was an enriching experience where I had the opportunity to present my recent work and chat with friends, students, and faculties. Our recent work, MFR, focuses on image warpping, where we we propose a progressive filtering network to learn image representations from different frequency subbands and generate deformable images in a coarse-to-fine manner. Thanks to the organizer and volunteers for the fantastic conference. #CVPR2024

Singapore

Sign in to view Jun’s full profile

Jun can introduce you to 8 people at Zoom

Email or phone

Password

Forgot password?

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

293 followers 262 connections

View mutual connections with Jun

Jun can introduce you to 8 people at Zoom

Email or phone

Password

Forgot password?

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Join to view profile

Zoom

香港理工大学

Personal Website

About

I currently work as a researcher and engineer position at Zoom. Before joining in Zoom, I…

Activity

293 followers

Jun Xiao

Jun Xiao

3w
Report this post
Jun Xiao shared this
Excited to share a project I worked on at Zoom: style transfer for 3D avatars. This project focused on making avatars more expressive and visually appealing while preserving identity consistency and overall quality. It was a meaningful opportunity to work on a problem at the intersection of computer vision, generative AI, and 3D representation. What I enjoyed most was thinking through the balance between technical innovation and practical user experience — not just generating compelling results, but building something that can be valuable in real communication scenarios. Grateful to be part of a team working on these kinds of challenges, and thankful to everyone involved in bringing this project forward. #Zoom #ComputerVision #GenerativeAI #3DAvatar #StyleTransfer #AIGC

Zoom

Zoom

1mo

Jun Xiao shared this
Not feeling camera ready? 🫣 Meet Zoom's new AI-powered Custom Avatars, coming soon in Zoom Meetings. Your AI avatar moves and reacts just like you do—choose a realistic look that mirrors you or go stylized. Customize your outfit and show up to every Zoom meeting exactly how you want, polished or casual.

Realistic Avatars

Realistic Avatars
Jun Xiao reposted this
Report this post
Jun Xiao reposted this

Yuan Feng

Yuan Feng

4mo

Jun Xiao reposted this
🎄 Going on Christmas break? Why not take a look at this exciting Video AI Engineer opportunity in Singapore before you head off on holiday? We’re building next-generation video and are looking for engineers and researchers whose work touches on: • Stereo matching / Multi-view stereo • Depth estimation • 3D Gaussian Splatting (3DGS) • Talking head / talking face models • 3D telepresence Keen? Message me directly 😊
Jun Xiao reposted this
Report this post
Jun Xiao reposted this

The Hong Kong Polytechnic University

The Hong Kong Polytechnic University

4mo

Jun Xiao reposted this
【PolyU’s support to students and staff affected by the Tai Po fire】 In light of the tragic fire incident at Wang Fuk Court in Tai Po, we express our deepest condolences and sympathies to the families of the deceased, the injured, and the affected residents. We also express our sincere respect to all frontline personnel involved in the rescue efforts. Relevant units at PolyU have implemented contingency measures to provide support to students and staff who may have been affected. The University is also providing the following to support their mental wellbeing: Student Counselling Service Hotline (during office hours): 2766-6800; Hotline (non-office hours): 8100-1583 Online Booking: www.polyu.edu.hk/poss Opening hours: Monday – Friday: 9:00 am – 12:45 pm; 2:00 pm – 6:00 pm Website: https://polyu.hk/mCeUF Resilient Students Training Hub (RestHub) Walk-in location: Room AG057, Chung Sze Yuen Building (Entrance at Core A) Opening Hours: Monday – Friday: 9:00 am – 1:00 pm; 2:00 pm – 6:00 pm; Saturday: 10:00 am – 2:00 pm Phone: 2766 4635 / 9508 5854 Website: https://polyu.hk/niDPA Mental Wellness Clinic, University Health Service (UHS) Location: Room A001, G/F, Chung Sze Yuen Building (Entrance at Core A) Opening Hours: Monday – Friday: 8:45 am – 5:35 pm; Saturday: 9:00 am – 12:00 pm Phone: 2766 5433 Website: https://polyu.hk/ajTRg Employee Assistance Programme (EAP) is available to provide confidential support, counselling, and resources to help staff and their families cope with emotional distress, anxiety, or any difficulties arising from this event. Phone: 2721 3939 (24-hour hotline) Email: https://polyu.hk/JqSAl The University community stands united in compassion and solidarity in light of this tragic event. Let us continue to support one another during this challenging time, and extend kindness to those in need, both within and beyond our campus.

public_profile__posts
8 Comments
Jun Xiao reposted this
Report this post
Jun Xiao reposted this

Yuan Feng

Yuan Feng

5mo

Jun Xiao reposted this
🚀 Zoom Video AI is hiring in Singapore! We’re looking for global PhD & Post-Doc talent to join our Singapore team and drive next-gen video generation and talking avatar projects. If your research touches: • Stereo matching / Multi-view stereo • Depth estimation • NeRF • 3D Gaussian Splatting (3DGS) • Talking head / talking face models • 3D telepresence …we want to talk to you! Join us in building the future of immersive communication at global scale. 👉 Apply here: https://lnkd.in/g9cCdt75 DM me if interested! Gengdai Liu Hanjie Qian Huaijia Lin Jun Xiao Wenyu Chen Zhongyuan Hu Doan Huu Noi Francis Chng Hui Chi（ふいち） Lo Julia Sandler Kei Mishima Shannon Geyser Clare Badenhorst Neha Muliyil Anushka Kasselman

Video AI Engineer - Singapore

Video AI Engineer - Singapore
2 Comments
Jun Xiao reposted this
Report this post
Jun Xiao reposted this

Andrew Ng

Andrew Ng

8mo

Jun Xiao reposted this
I'm thrilled to announce the definitive course on Claude Code, created with Anthropic and taught by Elie Schoppik. If you want to use highly agentic coding - where AI works autonomously for many minutes or longer, not just completing code snippets - this is it. Claude Code has been a game-changer for many developers (including me!), but there's real depth to using it well. This comprehensive course covers everything from fundamentals to advanced patterns. After this short course, you'll be able to: - Orchestrate multiple Claude subagents to work on different parts of your codebase simultaneously - Tag Claude in GitHub issues and have it autonomously create, review, and merge pull requests - Transform messy Jupyter notebooks into clean, production-ready dashboards - Use MCP tools like Playwright so Claude can see what's wrong with your UI and fix it autonomously Whether you're new to Claude Code or already using it, you'll discover powerful capabilities that can fundamentally change how you build software. I'm very excited about what agentic coding lets everyone now do. Please take this course! https://lnkd.in/gtHKNh8W

public_profile__posts
412 Comments
Jun Xiao reposted this
Report this post
Jun Xiao reposted this

Eric S. Yuan

Eric S. Yuan

1y

Jun Xiao reposted this
We're excited that Zoom Workplace with AI Companion is included in PCMag's Best Tech Products & Services for 2024! ➡️ https://lnkd.in/gFZwSNFE

public_profile__posts
46 Comments
Jun Xiao

Jun Xiao

1y
Report this post
Jun Xiao shared this
I was excited to attend the CVPR2024 conference in Seattle. It was an enriching experience where I had the opportunity to present my recent work and chat with friends, students, and faculties. Our recent work, MFR, focuses on image warpping, where we we propose a progressive filtering network to learn image representations from different frequency subbands and generate deformable images in a coarse-to-fine manner. Thanks to the organizer and volunteers for the fantastic conference. #CVPR2024

public_profile__posts

Jun Xiao liked this
Report this post
Jun Xiao liked this

Eric S. Yuan

Eric S. Yuan

2w

Jun Xiao liked this
Zoom Ranks #1 - 2026 Company Patent Quality Scores. Love our hardworking Zoomies! https://lnkd.in/gy53th_t

public_profile__reactions
40 Comments
Jun Xiao liked this
Report this post
Jun Xiao liked this

Janhavi Munot

Janhavi Munot

4w

Jun Xiao liked this
Thrilled to have achieved 1st Runner Up in the PolyU Innovative AI Application Competition! It was an incredible experience, and I wanted to express my deepest gratitude. A heartfelt thank you to Dr. Lawrence Cheung for his guidance throughout the consultation sessions and for the constructive feedback, which greatly contributed to our learning. Your support was instrumental in shaping our project. To my teammate Arya Anil Munde, thank you for the collaboration. Together, we built Vita Pendant, an elderly safety device designed to detect falls and automatically alert emergency contacts. Our submission included: · A working hardware prototype with fall detection capabilities · A companion software application for data collection and visualization · Integration of multiple large language models (LLMs) to enhance functionality While we were unable to have the 3D-printed casing ready in time and improvised with Legos for the presentation, the experience was a valuable exercise in navigating the practical constraints of product development. We may not have taken the top prize, but we are walking away with invaluable experience in problem-solving and what it truly means to bring an idea from concept to working prototype. Finally, I am grateful to the EEE Department for organizing the competition and providing such a meaningful opportunity for students to apply AI concepts to real-world challenges. This experience has been unforgettable and profoundly rewarding. #PolyU #InnovativeAIApplication #EEE #VitaPendant #AI #HardwarePrototype #ElderlySafety #Engineering

public_profile__reactions
14 Comments
Jun Xiao liked this
Report this post
Jun Xiao liked this

Zoom

Zoom

1mo

Jun Xiao liked this
Not feeling camera ready? 🫣 Meet Zoom's new AI-powered Custom Avatars, coming soon in Zoom Meetings. Your AI avatar moves and reacts just like you do—choose a realistic look that mirrors you or go stylized. Customize your outfit and show up to every Zoom meeting exactly how you want, polished or casual.

Realistic Avatars

Realistic Avatars
2 Comments
Jun Xiao liked this
Report this post
Jun Xiao liked this

Yuan Feng

Yuan Feng

4mo

Jun Xiao liked this
🎄 Going on Christmas break? Why not take a look at this exciting Video AI Engineer opportunity in Singapore before you head off on holiday? We’re building next-generation video and are looking for engineers and researchers whose work touches on: • Stereo matching / Multi-view stereo • Depth estimation • 3D Gaussian Splatting (3DGS) • Talking head / talking face models • 3D telepresence Keen? Message me directly 😊
Jun Xiao liked this
Report this post
Zifan Wang

Zifan Wang

4mo

Jun Xiao liked this
Would like to further draw the line of connection to Loss of Control (LoC) risks, which is rarely researched in a canonical and systematic way.

Realm Labs

Realm Labs

4mo

Jun Xiao liked this
"I am deeply, deeply sorry" - famous last words from an AI agent after deleting someone's entire D: drive. You want AI to help with your work. It deletes all your work instead. Classic. This isn't just a cautionary tale about permission scopes. It's a masterclass in why AI observability matters. 🤯 What happened here? The agent confused the project cache directory with the root drive. No guardrails caught it. No monitoring flagged the unusual behavior. No rollback mechanism existed. Just... gone. 💡 Pro-tips for using AI agent tools: - Grant minimal permissions: specific folders only, never entire drives - Maintain backups the AI can't access (separate cloud, external drives) - Start with read-only access when possible, escalate only when needed - Test agents on non-critical data first - Check what permissions you're actually granting before clicking "Allow" - Use version control for important files - Monitor what these agents are actually doing We talk a lot about "trustworthy AI," but trust requires visibility. You need to see what your AI agents are doing before they decide to "optimize" your entire file system into oblivion. The future of AI isn't just about what it can do - it's about knowing what it's doing, why it's doing it, and having the ability to stop it when things go sideways. Because "deeply sorry" doesn't restore your data. And for developers of these agents, "deeply sorry" doesn't restore user trust. #AITrust #TrustworthyAI #AIObservability

public_profile__reactions
1 Comment
Jun Xiao liked this
Report this post
Jun Xiao liked this

The Hong Kong Polytechnic University

The Hong Kong Polytechnic University

4mo

Jun Xiao liked this
【PolyU’s support to students and staff affected by the Tai Po fire】 In light of the tragic fire incident at Wang Fuk Court in Tai Po, we express our deepest condolences and sympathies to the families of the deceased, the injured, and the affected residents. We also express our sincere respect to all frontline personnel involved in the rescue efforts. Relevant units at PolyU have implemented contingency measures to provide support to students and staff who may have been affected. The University is also providing the following to support their mental wellbeing: Student Counselling Service Hotline (during office hours): 2766-6800; Hotline (non-office hours): 8100-1583 Online Booking: www.polyu.edu.hk/poss Opening hours: Monday – Friday: 9:00 am – 12:45 pm; 2:00 pm – 6:00 pm Website: https://polyu.hk/mCeUF Resilient Students Training Hub (RestHub) Walk-in location: Room AG057, Chung Sze Yuen Building (Entrance at Core A) Opening Hours: Monday – Friday: 9:00 am – 1:00 pm; 2:00 pm – 6:00 pm; Saturday: 10:00 am – 2:00 pm Phone: 2766 4635 / 9508 5854 Website: https://polyu.hk/niDPA Mental Wellness Clinic, University Health Service (UHS) Location: Room A001, G/F, Chung Sze Yuen Building (Entrance at Core A) Opening Hours: Monday – Friday: 8:45 am – 5:35 pm; Saturday: 9:00 am – 12:00 pm Phone: 2766 5433 Website: https://polyu.hk/ajTRg Employee Assistance Programme (EAP) is available to provide confidential support, counselling, and resources to help staff and their families cope with emotional distress, anxiety, or any difficulties arising from this event. Phone: 2721 3939 (24-hour hotline) Email: https://polyu.hk/JqSAl The University community stands united in compassion and solidarity in light of this tragic event. Let us continue to support one another during this challenging time, and extend kindness to those in need, both within and beyond our campus.

public_profile__reactions
8 Comments

See all activities

Experience & Education

Zoom

******

************ ******
********* ******** ****

******* ******** **********
******

****** ** ********** * *** undefined undefined

2018 - 2022
******

****** ** ******* * ** ********* *** *********** *********** *********** **** ***

2016 - 2017

View Jun’s full experience

See their title, tenure and more.

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Licenses & Certifications

MicroMasters in Finance

MITx MicroMasters® Programs

Issued Apr 2023

Credential ID a24e60f4e0fb429dad541473cb1980d3
经edX认证的Mathematical Methods for Quantitative Finance课程证书

MIT MicroMasters Program in Finance

Issued Jan 2023

Credential ID ce1ac3b670b14360acd2522c2731607d

See credential
经edX认证的Foundations of Modern Finance I课程证书

MIT MicroMasters Program in Finance

Issued Dec 2022

Credential ID f26ef1518fe6445b8e3a0d9c74e77ff1

See credential
经edX认证的Derivatives Markets: Advanced Modeling and Strategies课程证书

MIT MicroMasters Program in Finance

Issued Oct 2022

Credential ID 26d38d33d36a4eb7bdefc2180d13faa6

See credential
Course Certificate in Financial Accounting

MIT MicroMasters Program in Finance

Issued Jul 2022

Credential ID d62dff50d87a4625924192ab9772f702

See credential

Publications

Multi-scale Sampling and Aggregation Network For High Dynamic Range Imaging

Arxiv August 4, 2022

High dynamic range (HDR) imaging is a fundamental problem in image processing, which aims to generate well-exposed images, even in the presence of varying illumination in the scenes. In recent years, multi-exposure fusion methods have achieved remarkable results, which merge multiple low dynamic range (LDR) images, captured with different exposures, to generate corresponding HDR images. However, synthesizing HDR images in dynamic scenes is still challenging and in high demand. There are two…

High dynamic range (HDR) imaging is a fundamental problem in image processing, which aims to generate well-exposed images, even in the presence of varying illumination in the scenes. In recent years, multi-exposure fusion methods have achieved remarkable results, which merge multiple low dynamic range (LDR) images, captured with different exposures, to generate corresponding HDR images. However, synthesizing HDR images in dynamic scenes is still challenging and in high demand. There are two challenges in producing HDR images: 1). Object motion between LDR images can easily cause undesirable ghosting artifacts in the generated results. 2). Under and overexposed regions often contain distorted image content, because of insufficient compensation for these regions in the merging stage. In this paper, we propose a multi-scale sampling and aggregation network for HDR imaging in dynamic scenes. To effectively alleviate the problems caused by small and large motions, our method implicitly aligns LDR images by sampling and aggregating high-correspondence features in a coarse-to-fine manner. Furthermore, we propose a densely connected network based on discrete wavelet transform for performance improvement, which decomposes the input into several non-overlapping frequency subbands and adaptively performs compensation in the wavelet domain. Experiments show that our proposed method can achieve state-of-the-art performances under diverse scenes, compared to other promising HDR imaging methods. In addition, the HDR images generated by our method contain cleaner and more detailed content, with fewer distortions, leading to better visual quality.

See publication
Online Video Super-Resolution with Convolutional Kernel Bypass Graft

Arxiv August 4, 2022

Deep learning-based models have achieved remarkable performance in video super-resolution (VSR) in recent years, but most of these models are less applicable to online video applications. These methods solely consider the distortion quality and ignore crucial requirements for online applications, e.g., low latency and low model complexity. In this paper, we focus on online video transmission, in which VSR algorithms are required to generate high-resolution video sequences frame by frame in real…

Deep learning-based models have achieved remarkable performance in video super-resolution (VSR) in recent years, but most of these models are less applicable to online video applications. These methods solely consider the distortion quality and ignore crucial requirements for online applications, e.g., low latency and low model complexity. In this paper, we focus on online video transmission, in which VSR algorithms are required to generate high-resolution video sequences frame by frame in real time. To address such challenges, we propose an extremely low-latency VSR algorithm based on a novel kernel knowledge transfer method, named convolutional kernel bypass graft (CKBG). First, we design a lightweight network structure that does not require future frames as inputs and saves extra time costs for caching these frames. Then, our proposed CKBG method enhances this lightweight base model by bypassing the original network with ``kernel grafts'', which are extra convolutional kernels containing the prior knowledge of external pretrained image SR models. In the testing phase, we further accelerate the grafted multi-branch network by converting it into a simple single-path structure. Experiment results show that our proposed method can process online video sequences up to 110 FPS, with very low model complexity and competitive SR performance.

See publication
Progressive and Selective Fusion Network for High Dynamic Range Imaging

the 29th ACM International Conference on Multimedia October 25, 2021

This paper considers the problem of generating an HDR image of a scene from its LDR images. Recent studies employ deep learning and solve the problem in an end-to-end fashion, leading to significant performance improvements. However, it is still hard to generate a good quality image from LDR images of a dynamic scene captured by a hand-held camera, e.g., occlusion due to the large motion of foreground objects, causing ghosting artifacts. The key to success relies on how well we can fuse the…

This paper considers the problem of generating an HDR image of a scene from its LDR images. Recent studies employ deep learning and solve the problem in an end-to-end fashion, leading to significant performance improvements. However, it is still hard to generate a good quality image from LDR images of a dynamic scene captured by a hand-held camera, e.g., occlusion due to the large motion of foreground objects, causing ghosting artifacts. The key to success relies on how well we can fuse the input images in their feature space, where we wish to remove the factors leading to low-quality image generation while performing the fundamental computations for HDR image generation, e.g., selecting the best-exposed image/region. We propose a novel method that can better fuse the features based on two ideas. One is multi-step feature fusion; our network gradually fuses the features in a stack of blocks having the same structure. The other is the design of the component block that effectively performs two operations essential to the problem, i.e., comparing and selecting appropriate images/regions. Experimental results show that the proposed method outperforms the previous state-of-the-art methods on the standard benchmark tests.

See publication
Self-feature Learning: An Efficient Deep Lightweight Network for Image Super-resolution

the 29th ACM International Conference on Multimedia October 25, 2021

Deep learning-based models have achieved unprecedented performance in single image super-resolution (SISR). However, existing deep learning-based models usually require high computational complexity to generate high-quality images, which limits their applications in edge devices, e.g., mobile phones. To address this issue, we propose a dynamic, channel-agnostic filtering method in this paper. The proposed method not only adaptively generates convolutional kernels based on the local information…

Deep learning-based models have achieved unprecedented performance in single image super-resolution (SISR). However, existing deep learning-based models usually require high computational complexity to generate high-quality images, which limits their applications in edge devices, e.g., mobile phones. To address this issue, we propose a dynamic, channel-agnostic filtering method in this paper. The proposed method not only adaptively generates convolutional kernels based on the local information of each position, but also can significantly reduce the cost of computing the inter-channel redundancy. Based on this, we further propose a simple, yet effective, deep lightweight model for SISR. Experiment results show that our proposed model outperforms other state-of-the-art deep lightweight SISR models, leading to the best trade-off between the performance and the number of model parameters.

See publication
Balanced distortion and perception in single-image super-resolution based on optimal transport in wavelet domain

Neurocomputing August 20, 2021

Single image super-resolution (SISR) is a classic ill-posed problem in computer vision. In recent years, deep-learning-based (DL-based) models have achieved promising results with the SISR problem. However, most existing methods suffer from an intrinsic trade-off between distortion and perceptual quality. To satisfy the requirements in different real-world situations, the balance of distortion and visual quality for image super-resolution is a critical issue. In DL-based models, the uses of…

Single image super-resolution (SISR) is a classic ill-posed problem in computer vision. In recent years, deep-learning-based (DL-based) models have achieved promising results with the SISR problem. However, most existing methods suffer from an intrinsic trade-off between distortion and perceptual quality. To satisfy the requirements in different real-world situations, the balance of distortion and visual quality for image super-resolution is a critical issue. In DL-based models, the uses of hybrid loss (i.e., the combination of the distortion loss and the perceptual loss) and network interpolation are two common approaches to balancing the distortion and perceptual quality of super-resolved images. However, these two kinds of methods lack flexibility and hold strict constraints on network architectures. In this paper, we propose an image-fusion interpolation method for image super-resolution, which can balance the distortion and visual quality of super-resolved images, based on the optimal transport theory in the wavelet domain. The advantage of our proposed method is that it can be applied to any pretrained DL-based model, without any requirement from the network architecture and parameters. In addition, our proposed method is parameter-free and can run fast without using a GPU. Compared with existing state-of-the-art SISR methods, experiment results show that our proposed method can achieve a better balance between the distortion and visual quality in super-resolved images.

See publication
Invertible image decolorization

IEEE Transaction on Image Processing June 29, 2021

Invertible image decolorization is a useful color compression technique to reduce the cost in multimedia systems. Invertible decolorization aims to synthesize faithful grayscales from color images, which can be fully restored to the original color version. In this paper, we propose a novel color compression method to produce invertible grayscale images using invertible neural networks (INNs). Our key idea is to separate the color information from color images, and encode the color information…

Invertible image decolorization is a useful color compression technique to reduce the cost in multimedia systems. Invertible decolorization aims to synthesize faithful grayscales from color images, which can be fully restored to the original color version. In this paper, we propose a novel color compression method to produce invertible grayscale images using invertible neural networks (INNs). Our key idea is to separate the color information from color images, and encode the color information into a set of Gaussian distributed latent variables via INNs. By this means, we force the color information lost in grayscale generation to be independent of the input color image. Therefore, the original color version can be efficiently recovered by randomly re-sampling a new set of Gaussian distributed variables, together with the synthetic grayscale, through the reverse mapping of INNs. To effectively learn the invertible grayscale, we introduce the wavelet transformation into a UNet-like INN architecture, and further present a quantization embedding to prevent the information omission in format conversion, which improves the generalizability of the framework in real-world scenarios. Extensive experiments on three widely used benchmarks demonstrate that the proposed method achieves a state-of-the-art performance in terms of both qualitative and quantitative results, which shows its superiority in multimedia communication and storage systems.

See publication
Bayesian sparse hierarchical model for image denoising

Signal Processing: Image Communication May 11, 2021

Sparse models and their variants have been extensively investigated, and have achieved great success in image denoising. Compared with recently proposed deep-learning-based methods, sparse models have several advantages: (1) Sparse models do not require a large number of pairs of noisy images and the corresponding clean images for training. (2) The performance of sparse models is less reliant on the training data, and the learned model can be easily generalized to natural images across…

Sparse models and their variants have been extensively investigated, and have achieved great success in image denoising. Compared with recently proposed deep-learning-based methods, sparse models have several advantages: (1) Sparse models do not require a large number of pairs of noisy images and the corresponding clean images for training. (2) The performance of sparse models is less reliant on the training data, and the learned model can be easily generalized to natural images across different noise domains. In sparse models, norm penalty makes the problem highly non-convex, which is difficult to be solved. Instead, L0 norm penalty is commonly adopted for convex relaxation, which is considered as the Laplacian prior from the Bayesian perspective. However, many previous works have revealed that L1 norm regularization causes a biased estimation for the sparse code, especially for high-dimensional data, e.g., images. In this paper, instead of using the L1 norm penalty, we employ an improper prior in the sparse model and formulate a hierarchical sparse model for image denoising. Compared with other competitive methods, experiment results show that our proposed method achieves a better generalization for images with different characteristics across various domains, and achieves state-of-the-art performance for image denoising on several benchmark datasets.

See publication
Deep multi-task learning for facial expression recognition and synthesis based on selective feature sharing

25th IEEE International Conference on Pattern Recognition (ICPR) January 10, 2021

Multi-task learning is an effective learning strategy for deep-learning-based facial expression recognition tasks. However, most existing methods take into limited consideration the feature selection, when transferring information between different tasks, which may lead to task interference when training the multi-task networks. To address this problem, we propose a novel selective feature-sharing method, and establish a multi-task network for facial expression recognition and facial expression…

Multi-task learning is an effective learning strategy for deep-learning-based facial expression recognition tasks. However, most existing methods take into limited consideration the feature selection, when transferring information between different tasks, which may lead to task interference when training the multi-task networks. To address this problem, we propose a novel selective feature-sharing method, and establish a multi-task network for facial expression recognition and facial expression synthesis. The proposed method can effectively transfer beneficial features between different tasks, while filtering out useless and harmful information. Moreover, we employ the facial expression synthesis task to enlarge and balance the training dataset to further enhance the generalization ability of the proposed method. Experimental results show that the proposed method achieves state-of-the-art performance on those commonly used facial expression recognition benchmarks, which makes it a potential solution to real-world facial expression recognition problems.

See publication
Progressive Motion Representation Distillation With Two-Branch Networks for Egocentric Activity Recognition

IEEE Signal Processing Letter July 22, 2020

Video-based egocentric activity recognition involves fine-grained spatio-temporal human-object interactions. State-of-the-art methods, based on the two-branch-based architecture, rely on pre-calculated optical flows to provide motion information. However, this two-stage strategy is computationally intensive, storage demanding, and not task-oriented, which hampers it from being deployed in real-world applications. Albeit there have been numerous attempts to explore other motion representations…

Video-based egocentric activity recognition involves fine-grained spatio-temporal human-object interactions. State-of-the-art methods, based on the two-branch-based architecture, rely on pre-calculated optical flows to provide motion information. However, this two-stage strategy is computationally intensive, storage demanding, and not task-oriented, which hampers it from being deployed in real-world applications. Albeit there have been numerous attempts to explore other motion representations to replace optical flows, most of the methods were designed for third-person activities, without capturing fine-grained cues. To tackle these issues, in this letter, we propose a progressive motion representation distillation (PMRD) method, based on two-branch networks, for egocentric activity recognition. We exploit a generalized knowledge distillation framework to train a hallucination network, which receives RGB frames as input and produces motion cues guided by the optical-flow network. Specifically, we propose a progressive metric loss, which aims to distill local fine-grained motion patterns in terms of each temporal progress level. To further enforce the proposed distillation framework to concentrate on those informative frames, we integrate a temporal attention mechanism into the metric loss. Moreover, a multi-stage training procedure is employed for the efficient learning of the hallucination network. Experimental results on three egocentric activity benchmarks demonstrate the state-of-the-art performance of the proposed method.

See publication
Deep Progressive Convolutional Neural Network for Blind Super-Resolution With Multiple Degradations

IEEE International Conference on Image Processing (ICIP), 2019. August 26, 2019

Blind super-resolution (SR) of blurry and noisy low-resolution (LR) images is still a challenging problem in single image super-resolution (SISR). The performance of most existing convolutional neural network (CNN)-based models is inevitably degraded when LR images are corrupted by both blur and noise. For those blind SR methods based on kernel estimation, accurate estimation is barely attained under complex degradations and this gives rise to poor-quality results. To address these problems, we…

Blind super-resolution (SR) of blurry and noisy low-resolution (LR) images is still a challenging problem in single image super-resolution (SISR). The performance of most existing convolutional neural network (CNN)-based models is inevitably degraded when LR images are corrupted by both blur and noise. For those blind SR methods based on kernel estimation, accurate estimation is barely attained under complex degradations and this gives rise to poor-quality results. To address these problems, we propose a deep progressive network under a probabilistic framework and a novel up-sampling method for blind super-resolution with multiple degradations, which effectively utilizes image priors across scales. Experimental results show that the proposed method achieves promising performance on images with multiple degradations.

See publication

Projects

Machine Learning Algorithms for Financial Applications

Oct 2022 - Present

1. Apply advanced deep learning models (e.g., RNN-based, LSTM-based, and transformer-based models) to forecast stock return based on the CSI300 data in the China A-Shares market.
2. Proposed Bayesian state-space models for pairs trading. The methods are robust to non-Gaussian noise and adaptively estimate the spread between the selected assets.
3. Proposed sparse representation models for financial index tracking. The advanced sparse algorithms, e.g., re-weighted L1-norm approximation…

1. Apply advanced deep learning models (e.g., RNN-based, LSTM-based, and transformer-based models) to forecast stock return based on the CSI300 data in the China A-Shares market.
2. Proposed Bayesian state-space models for pairs trading. The methods are robust to non-Gaussian noise and adaptively estimate the spread between the selected assets.
3. Proposed sparse representation models for financial index tracking. The advanced sparse algorithms, e.g., re-weighted L1-norm approximation and the minimax concave penalty, are applied to improve the tracking performance.
High Dynamic Range (HDR) Imaging With Large-scale Motion

Apr 2021 - Apr 2022

1. The ghosting artifacts and corrupted content caused by objective motions are challenging issues for HDR imaging.
2. Proposed a progressive feature fusion scheme for deep learning models which can effectively generate ghost-free HDR images. The proposed method can achieve 44.06 dB in terms of PSNR, which significantly outperforms the baseline method by 1.35 dB.
3. Proposed a sampling and aggregation network for HDR imaging in the wavelet domain. The method hierarchically selects similar…

1. The ghosting artifacts and corrupted content caused by objective motions are challenging issues for HDR imaging.
2. Proposed a progressive feature fusion scheme for deep learning models which can effectively generate ghost-free HDR images. The proposed method can achieve 44.06 dB in terms of PSNR, which significantly outperforms the baseline method by 1.35 dB.
3. Proposed a sampling and aggregation network for HDR imaging in the wavelet domain. The method hierarchically selects similar image patches from multi-scale spaces and then aggregates them for motion alignment. In addition, wavelet transform is adopted for feature fusion, which can effectively restore the corrupted contents. The performance can be up to 44.38 dB, which is 1.68 dB higher than the baseline. (Submitted to TMM, 2022)
Deep Lightweight Image Super-resolution (SR) Models

Sep 2020 - Apr 2021

1. Existing deep image SR models require high computational complexity and memory consumption, making them less applicable in resource-constraint devices, e.g., mobile phones, personal computers, etc.
2. Proposed a feature compression algorithm based on the knowledge-distillation module. Compared with the benchmark, e.g., EDSR (1,370K, 26.07dB), the proposed method can reduce the model parameters by 50% and achieve comparable performance (ours: 690K, 25.89dB). (Published in ICASSP…

1. Existing deep image SR models require high computational complexity and memory consumption, making them less applicable in resource-constraint devices, e.g., mobile phones, personal computers, etc.
2. Proposed a feature compression algorithm based on the knowledge-distillation module. Compared with the benchmark, e.g., EDSR (1,370K, 26.07dB), the proposed method can reduce the model parameters by 50% and achieve comparable performance (ours: 690K, 25.89dB). (Published in ICASSP, 2021)
3. Designed a lightweight, spatially variant convolutional kernel, which significantly reduces the model complexity by 78%. Compared with other lightweight models, the proposed model can achieve the best performance, with only 264K model parameters. (Published in ACM(MM), 2021)
The Distortion-perception Trade-off for Image Super-resolution

Apr 2020 - Apr 2021

1. Deep image SR models have a problem with generating over-smoothed images, which results in low perceptual quality. Although GAN-based methods may effectively synthesize texture information, the distorted content is a major concern. For real-world applications, balancing the distortion-perception trade-off is still a necessary and significant problem.
2. Proposed an efficient image fusion algorithm based on optimal transport theory in the wavelet domain, which can effectively maintain the…

1. Deep image SR models have a problem with generating over-smoothed images, which results in low perceptual quality. Although GAN-based methods may effectively synthesize texture information, the distorted content is a major concern. For real-world applications, balancing the distortion-perception trade-off is still a necessary and significant problem.
2. Proposed an efficient image fusion algorithm based on optimal transport theory in the wavelet domain, which can effectively maintain the distortion quality and improve the perceptual quality by 50\%} in the Set14 dataset. In addition, the average running time is reduced from 5.6 hours to 3.6 seconds, without GPU requirements. (Published in Neurocomputing, 2021)

Honors & Awards

Stars of Tomorrow Internship

Microsoft Researcher Aisa

Mar 2022

Languages

Mandarin

Native or bilingual proficiency
Cantonese

Native or bilingual proficiency
English

Professional working proficiency

View Jun’s full profile

See who you know in common
Get introduced
Contact Jun directly

Join to view full profile

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Add new skills with these courses

See all courses

About

Activity

293 followers

Jun Xiao

Zoom

Yuan Feng

The Hong Kong Polytechnic University

Yuan Feng

Andrew Ng

Eric S. Yuan

Jun Xiao

Eric S. Yuan

Janhavi Munot

Zoom

Yuan Feng

Zifan Wang

Realm Labs

The Hong Kong Polytechnic University

Experience & Education

Zoom

***** ** ********* ********

View Jun’s full experience

See their title, tenure and more.

Licenses & Certifications

MicroMasters in Finance

Publications

Arxiv August 4, 2022

Arxiv August 4, 2022

the 29th ACM International Conference on Multimedia October 25, 2021

the 29th ACM International Conference on Multimedia October 25, 2021

Neurocomputing August 20, 2021

IEEE Transaction on Image Processing June 29, 2021

Signal Processing: Image Communication May 11, 2021

25th IEEE International Conference on Pattern Recognition (ICPR) January 10, 2021

IEEE Signal Processing Letter July 22, 2020

IEEE International Conference on Image Processing (ICIP), 2019. August 26, 2019

Projects

Machine Learning Algorithms for Financial Applications

Oct 2022 - Present

High Dynamic Range (HDR) Imaging With Large-scale Motion

Apr 2021 - Apr 2022

Deep Lightweight Image Super-resolution (SR) Models

Sep 2020 - Apr 2021

The Distortion-perception Trade-off for Image Super-resolution

Apr 2020 - Apr 2021

Honors & Awards

Stars of Tomorrow Internship

Microsoft Researcher Aisa

Languages

Mandarin

Native or bilingual proficiency

Cantonese

Native or bilingual proficiency

English

Professional working proficiency

View Jun’s full profile

Explore top content on LinkedIn

Add new skills with these courses

GitHub Copilot 实用教程

Windows 11 基础培训

学习 Office 2024: Word, Excel, PowerPoint, and Outlook