Publication


The best is yet to come

I am interested in computer vision related research. My most updated publication list is in google scholar.



2021

Poster: Video Question Answering with Phrases via Semantic Roles. Arka Sadhu, Kan Chen, Ram Nevatia, North American Chapter of the Association for Computational Linguistics (NAACL), Annual Conference of, 2021 [PDF][Code]

Poster: FBNetV3: Joint Architecture-Recipe Search using Neural Acquisition Function. Xiaoliang Dai*, Alvin Wan*, Peizhao Zhang*, Bichen Wu, Zijian He, Zhen Wei, Kan Chen, Yuandong Tian, Matthew Yu, Peter Vajda, Joseph E. Gonzalez, Computer Vision and Pattern Recognition (CVPR), IEEE/CVF Conference on, 2021 [PDF]

Poster: Unbiased Teacher for Semi-Supervised Object Detection. Yen-Cheng Liu, Chih-Yao Ma, Zijian He, Chia-Wen Kuo, Kan Chen, Peizhao Zhang, Bichen Wu, Zsolt Kira, Peter Vajda, Learning Representations (ICLR), International Conference on, 2021 [PDF]


2020

Poster: Video Object Grounding using Semantic Roles in Language Description. Arka Sadhu, Kan Chen, Ram Nevatia, Computer Vision and Pattern Recognition (CVPR), IEEE/CVF Conference on, 2020 [PDF][Code]

Poster: FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions. Alvin Wan*, Xiaoliang Dai*, Peizhao Zhang, Zijian He, Yuandong Tian, Saining Xie, Bichen Wu, Matthew Yu, Tao Xu, Kan Chen, Peter Vajda, Joseph E. Gonzalez, Computer Vision and Pattern Recognition (CVPR), IEEE/CVF Conference on, 2020 [PDF][Code]

Poster: CPARR: Category-based Proposal Analysis for Referring Relationships. Chuanzi He, Haidong Zhu, Jiyang Gao, Kan Chen, Ram Nevatia, Computer Vision and Pattern Recognition Workshop (CVPRW), IEEE/CVF Conference on, 2020 [PDF]


2019

Oral: Zero-Shot Grounding of Objects from Natural Language Queries. Arka Sadhu, Kan Chen, Ram Nevatia, Computer Vision (ICCV), IEEE/CVF International Conference on, 2019 [PDF][Code]

ArXiv: Billion-scale Semi-supervised Learning for Image Classification. Zeki Yalniz, Hervé Jégou, Kan Chen, Manohar Paluri, Dhruv Mahajan, ArXiv, 2019 [PDF][Code]

Thesis: Multimodal Reasoning of Visual Information and Natural Language. Kan Chen, USC Digital Library, 2019 [PDF]

Oral: MAC: Mining Activity Concepts for Language-based Temporal Localization. Runzhou Ge, Jiyang Gao, Kan Chen, Ram Nevatia, Applications of Computer Vision (WACV), IEEE Winter Conference on, 2019 [PDF][Code]


2018

Poster: CTAP: Complementary Temporal Action Proposal Generation. Kan Chen*, Jiyang Gao*, Ram Nevatia, Computer Vision (ECCV), European Conference on, 2018 [PDF][Code]

Best paper: Visually Indicated Sound Generation by Perceptually Optimized Classification. Kan Chen*, Chuanxi Zhang*, Chen Fang, Zhaowen Wang, Trung Bui, Ram Nevatia, Computer Vision Workshop (ECCVW), European Conference on, 2018 [PDF][Code]

Poster: Knowledge Aided Consistency for Weakly Supervised Phrase Grounding. Kan Chen, Jiyang Gao, Ram Nevatia, Computer Vision and Pattern Recognition (CVPR), IEEE/CVF Conference on, 2018 [PDF][Supplementary][Code]

Poster: Motion-Appearance Co-Memory Networks for Video Question Answering. Jiyang Gao*, Runzhou Ge*, Kan Chen, Ram Nevatia, Computer Vision and Pattern Recognition (CVPR), IEEE/CVF Conference on, 2018 [PDF]


2017

Journal: MSRC: Multimodal Spatial Regression with Semantic Context for Phrase Grounding. Kan Chen, Rama Kovvuri, Jiyang Gao, Ram Nevatia, International Journal of Multimedia Information Retrieval, 2017 [PDF]

Spotlight: Query-guided Regression Network with Context Policy for Phrase Grounding. Kan Chen*, Rama Kovvuri*, Ram Nevatia, Computer Vision (ICCV), IEEE/CVF International Conference on, 2017 [PDF][Code]

Poster: TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals. Jiyang Gao*, Zhenheng Yang*, Kan Chen, Chen Sun, Ram Nevatia, Computer Vision (ICCV), IEEE/CVF International Conference on, 2017 [PDF][Code]

Oral: MSRC: Multimodal Spatial Regression with Semantic Context for Phrase Grounding. Kan Chen, Rama Kovvuri, Jiyang Gao, Ram Nevatia, Multimedia Retrieval (ICMR), ACM International Conference on, 2017 [PDF]

Poster: AMC: Attention guided Multi-modal Correlation Learning for Image Search. Kan Chen, Trung Bui, Fang Chen, Zhaowen Wang, Ram Nevatia, Computer Vision and Pattern Recognition (CVPR), IEEE/CVF Conference on, 2017 [PDF][Code]


2016

Poster: Activity Recognition and Prediction with Pose based Discriminative Patch Model. Song Cao, Kan Chen, Ram Nevatia, Applications of Computer Vision (WACV), IEEE Winter Conference on, 2016 [PDF]

Poster: Abstraction Hierarchy and Self Annotation Update for Fine Grained Activity Recognition. Song Cao, Kan Chen, Ram Nevatia, Applications of Computer Vision (WACV), IEEE Winter Conference on, 2016 [PDF]

Poster: ABC-CNN: An attention based convolutional neural network for visual question answering. Kan Chen, Jiang Wang, Liang-Chieh Chen, Haoyuan Gao, Wei Xu, Ram Nevatia, Computer Vision and Pattern Recognition Workshop (CVPRW), IEEE/CVF Conference on, 2016 [PDF]


Earlier

Poster: Estimating the 3D Layout of Indoor Scenes and its Clutter from Depth Sensors. Jian Zhang, Kan Chen, Alexander G. Schwing, Raquel Urtasun, Computer Vision (ICCV), IEEE/CVF International Conference on, 2013 [PDF]

Journal: Image super resolution via analysis sparse prior. Qiang Ning, Kan Chen, Li Yi, Chuchu Fan, Jiangtao Wen, IEEE Transactions on Signal Process, 2013 [PDF]