I am now Senior Staff Researcher and Media Analytics Reseach Group Lead at the Core Networks Research, FutureWei (Huawei) Technology , Bridgewater, New Jersey, USA, conducting research in video analytics, web scale video fingerprint and identification, Compact Descriptors for Visual Search (CDVS) for MPEG, as well as wireless video communication, network measurement, tomography and cache/bandwidth management optimization in Content Delivery Networks (CDN).
My research interests are in:
1) Signal/Image Analysis and Machine Learning : image/video analytics, audio processing, machine leanring and their applications in image/video/music search and mining, video action and event recognition,
2) Large Scale Multimedia Search and Mining : subspace indexing on Grassmann manifold, Grassmann hashing, large scale video repository duplicate detection, search by sing-along over large scale music respository (over 1.5 million songs), location and product search via image repository mining.
Please check out our mobile product search-by-capture demo
3) Next Gen Video Networking : space-time coding for providing embedded channels for scalable video broadcasting, joint fountain and layered video coding, spatio-temporal video quality metric for scalable video modeling, network optimization and distributed resource coordinating in video over mutli-access, broadcasting and mesh networks. Recently edited Springer-Verlag book (with Changwen Chen and Shiguo Lian) Intelligent Video Communication: Techniques and Applications
Exciting recent results in large repository video duplicate detection, achieving 98% precision on 100% recall for a wide range of query corruptions, having duplicate localization within 120 frames in sequence, while having a response time of 0.003 sec for an 120 hour repository. We will demonstrate that we can achieve the same performance for 10,000 hour video within 0.1 sec. see my recent talk (slides) at MSRA for more detail.
My research projects are currently supported by grants from Microsoft Research Asia, Hong Kong RGC/GRF, and PolyU, which are graciously acknowledged here.
Yin Yuan, PhD Student, MS/MPhil, HKUST, Video Networking, Source-Channel Coding and Optimization
Bo Liu, Research Assistant, MS, USTC, Machine Learning, Multimedia Search. Now pursuing PhD at Rutgers University.
Hao Xu, Research Assistant, BS/MS, USTC, Intern at MSRA, Large Scale Image Repository Search and Mining.
Zhi Ye and Xin Chen, Parttime Research Assistant, MS, HK PolyU, Mobile Location and Product Search, SIFT indexing and fast search.
Xinchao Wang, FYP student, valedictorian, COMP 2010 class, Machine Learning, Point Set Topology Indexing. Now PhD Student at EPFL.
Post-Doc/PhD/Research Assistant Recruiting: support is available for motivated
students in areas of video analysis, video search and mining, video communication and networking. Please see my
related research projects for more detail. See the current opening for more detail.
Also there is a Hong Kong PhD Fellowship Program for outstanding candidates.
Some highlights of recent projects and selected publications (book chapters/journal papers and submissions in purple):
Multimedia Computing:
Currently I am interested in spatio-temporal appearance modeling, piece-wise linear approximation of non-linear
appearance manifolds, with query driven and/or global structures, and their applications in multimedia computing problems.
[1] Visual Pattern Recognition and Biometrics:
appearance manifold modeling, local embeddings, diffusions, model localization with piece-wise linear model
approximation of a global non-linear manifold, with application in face recognition, head-pose estimation and
video search metrics.
X. Wang, Z. Li, and D. Tao, "Subspace Indexing on Grassmann Manifold for Large Scale Multimedia Retrieval", in press, IEEE Trans. on Image Processing. , ppt
H. Zheng, Z. Li, Yun Fu,
"Efficient Human Action Recognition by Luminance Field Trajectory and Geometry Information",
IEEE Int'l Conf on Multimedia & Expo, New York, USA, 2009.
Z. Li, Yun Fu, Shuicheng Yan, and Thomas S. Huang,
"Real-Time Human Action Recognition by Luminance Field Trajectory Analysis",
ACM Multimedia, Vancouver, Canada, 2008.
Yun Fu, Z. Li, J. Yuan, Ying Wu, and Thomas S. Huang,
"Locality vs. Globality: Query-Driven Localized Linear Models for Facial Image Computing,"
IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), vol. 18(12), pp. 1741-1752, December, 2008.
Y. Fu, Z. Li, T. S. Huang, and A. K. Katsaggelos,
"Locally adaptive subspace and similarity metric learning for visual data clustering and retrieval",
Computer Vision and Image Understanding (CVIU), vol. 110(3), pp. 390-402, June, 2008.
Z. Li, Yun Fu, Junsong Yuan, T. S. Huang , and Ying Wu, "Query Driven Local Linear Discriminant Models for Head
Pose Estimation", Best Paper Candidate from HCI track,
Proc. of IEEE Intl. Conf on Multimedia & Expo (ICME), Beijing, China, 2007.
Y. Fu, J. Yuan, Z. Li, T. S. Huang, and Y. Wu,
"Query-Driven Locally Adaptive Fisher Faces and Expert-Model for Face Recognition", oral paper,
IEEE Intl. Conference on Image Processing (ICIP), San Antonio, USA, 2007.
Y. Fu, Z. Li, X. Zhou, and T. S. Huang, "Laplacian Affinity Propagation For Semi-Supervised Object
Classification", Best Paper Award (DoCoMo Innovation Paper),
IEEE Intl. Conference on Image Processing (ICIP) , San Antonio, USA, 2007.
[2] Image/Video Search and Mining : repeated clip search and mining with the LUminance Field Trajectory (LUFT)
modeling, scalability in searching large video repositories, SIFT based image similarity search. Our LUFT based repated
video clip searching achieves very high performance in speed (0.012sec to search an 5-hour collection) and precision-recall
(100% vs 96%), see a report,
B. Liu, Z. Li, L. Lin, and M. Wang, "Real-Time Video Copy Location Detection in Large Scale Repository ", accepted, IEEE Multimedia., ppt
Z. Li, Y. Fu, J. Yuan, Y. Wu, A. K. Katsaggelos and T. S. Huang,
"Multimedia Data Indexing", book chapter in Semantic Mining Technologies for Multimedia Databases, Ed. D. Tao, D. Xu, and X. Li, IGI Publishing, to appear, 2008.
L. Gao, Z. Li, and A. K. Katsaggelos,
"Luminance Filed Trajectory Based Video Indexing and Searching", accepted, IEEE Trans. On Circuits & Sys. For Video Tech.
J. Yuan, Z. Li, Y. Fu, Y. Wu, and T. S. Huang,
"Common Spatial Pattern Discovery by Efficient Candidate Pruning", oral paper,
IEEE Intl. Conference on Image Processing (ICIP) , San Antonio, USA, 2007.
Z. Li, L. Gao, and A. K. Katsaggelos, "Locally Embedded Linear Spaces for Efficient Video Shot
Indexing and Retrieval", Best Poster Paper Award, Proceedings of IEEE Intl. Confernece
on Multimedia & Expo (ICME), Toronto, Canada, 2006.
L. Gao, Z. Li, and A. K. Katsaggelos, "Fast Video Shot Retrieval with Luminance Field Trace Indexing and Geometry Matching",
Proc of IEEE Int'l Conf on Image Processing (ICIP), 2006.
Z. Li, A. K. Katsaggelos, and B. Bandhi,
"Fast Video Shot Segmentation and Retrieval Based on Trace Geometry in Principal Component Space",
IEE Proceedings on Vision, Image and Signal Processing, pp. 367-373, vol. 152(3), May, 2005.
Multimedia Communication:
Adances in video signal processing and coding give us a rich set of video coding and adaptation tools with associated quality metrics,
how to utilize these tools and metrics, and integrate with underlying network engineering elements like channel and network coding, routing and
resource allocation and optimization solutions, developing a distributed, scalable and adaptive content delivery network solution, are my interests.
[1] Internet Video Delivery: utility gradient driven scheduling, P2P, content-aware, source-channel coding, elasticity and R-D optimization
Ying Li, Z. Li, Mung Chiang and A. Robert Calderbank,
"Content-Aware Distortion Fair Video Streaming in Congested Networks", in press,
IEEE Trans. on Multimedia, 2009.
Ying Li, Z. Li, Mung Chiang and A. Robert Calderbank,
"Video Transmission Scheduling for Peer-to-Peer Live Streaming System", oral paper,
Proceedings of IEEE International Conference on Multimedia & Expo (ICME), Hanover, Germany, 2008.
Ying Li, Z. Li, Mung Chiang, and A. Robert Calderbank,
"Content Aware Distortion-Fair Video Streaming in Networks",
Proc of IEEE GLOBECOM, New Orleans, USA, 2008.
Z. Li, J. Huang, and A. K. Katsaggelos,
"Utility Driven Video Segment Scheduling for Peer-to-Peer Live Video Streaming System",
Proc of 45th Allerton Conference on Communication, Control and Computing, Monticello, IL, USA, 2007.
[2] Multi-Access Multimedia Networking: pricing model on resource allocation, distributed coordination for outer loop
control, while source coding/adaptation R-D optimization in the inner loop:
Y. Yang, Z. Li, W. Shi, Y. Chen, and H. Xu,
"Cross-Layer Optimization for State Update in Mobile Gaming",
IEEE Trans. on Multimedia, vol. 10(5), pp. 701-710, August, 2008.
J. Huang, Z. Li, M. Chiang, and A. K. Katsaggelos,
"Joint Source Adaptation and Resource Allocation for Multi-User Wireless Video Streaming",
IEEE Trans. on Circuits & System for Video Tech, vol. 18 (5), pp. 582-595, May, 2008.
F. Zhai, Z. Li and A. K. Katsaggelos,
"Joint Source-Channel Coding for Multi-User Wireless Video Communication",
Proc of IEEE Intl. Conf on Multimedia & Expo (ICME), Beijing, China, 2007.
Z. Li, J. Huang, and A. K. Katsaggelos,
"Pricing based collaborative multi-user video streaming over power constrained wireless down link", oral paper,
Proceedings of IEEE Intl. Conference on Acoustics, Speech and Signal Processing (ICASSP) , Toulouse, France, 2006.
Z. Li, J. Huang, M. Chiang, and A. K. Katsaggelos,
"Intelligent Wireless Video Communication: Source Adaptation and Multi-User Collaboration", invited paper,
special issue on Multimedia Communication, Ed. Changwen Chen, China Journal of Communication, December, 2006.
Z. Li; Alan Q. Cheng; Aggelos K. Katsaggelos; Faisal Ishtiaq,
"Video Summarization and Transmission Power Adaptation for Very Low Bit Rate Multiuser Wireless Uplink Video Communication",
Proc of IEEE Int'l Workshop on Multimedia Signal Processing (MMSP), 2005.
[3] Wireless Video:
video source adaptation, cross-layer optimization, source-channel coding, resource allocation and collaboration,
local relays, wireless P2P, energy efficiency.
W. Ji, Z. Li, and Y.-Q. Chen, "Joint Source-Channel Coding and Optimization for Layered Video Broadcasting to Heterogeneous Devices", in press, IEEE Trans on Multimedia, 2011.
Z. Li, Ying Li, Mung Chiang and A. Robert Calderbank,
"Optimal Transmission Scheduling For Scalable Wireless Video Broadcast with Rateless Erasure Correction Code",
Proc of IEEE Consumer Communication and Networking Conference (CCNC), Las Vegas, USA, 2009.
Ying Li, Z. Li, Mung Chiang and A. Robert Calderbank,
"Energy-Efficient Video Transmission Scheduling for Wireless Peer-to-Peer Live Streaming",
Proc of IEEE Consumer Communication and Networking Conference (CCNC), Las Vegas, USA, 2009.
Z. Li, F. Zhai, and A. K. Katsaggelos,
"Joint Video Summarization and Transmission Adaptation for Energy Efficient Wireless Streaming",
EURASIP Journal on Advances in Signal Processing, special issue on Wireless Video, vol. 2008, May, 2008.
[4] Video Summarization & VLBR Video Streaming: Frame drop distortion metrics, Viterbi algorithm based frame drop optimization to minimize frame drop distortions,
with applications in VLBR video streaming (e.g. QCIF "foreman" sequence streaming at 18kbps, demo available on request).
Z. Li, A. K. Katsaggelos, G. Schuster and B. Gandhi,
"Rate-Distortion Optimal Video Summary Generation",
IEEE Trans. on Image Processing, pp. 1550-1560, vol. 14, no. 10, October, 2005.
Z. Li, G. Schuster, A. K. Katsaggelos,
"MINMAX Optimal Video Summarization and Coding", special issue on Analysis & Understanding for Media Adaptation,
IEEE Trans. on Circuits and System for Video Technology, pp. 1245-1256, vol. 15, no. 10, October, 2005.
Z. Li, G. M. Schuster, and A. K. Katsaggelos,
"Video summarization for multiple path communication",
Proceedings of IEEE Intl. Conference on Image Processing (ICIP), Geona, Italy, 2005.
Z. Li, A. K. Katsaggelos, and G. M. Schuster,
"Rate-Distortion Optimal Video Summarization", book chapter in
Intelligent Multimedia Processing with Soft Computing, pp. 171-204, editors: Y.P. Tan, K. H. Yap, and L. Wang,, Springer-Verlag, Heidelberg, 2004.