CVPR 2021 论文和开源项目合集
CVPR 2021 论文和开源项目合集(papers with code)!
CVPR 2021 收录列表:http://cvpr2021.thecvf.com/sites/default/files/2021-03/acceptedpaperids.txt
注1:欢迎各位大佬提交issue,分享CVPR 2021论文和开源项目!
注2:关于往年CV顶会论文以及其他优质CV论文和大盘点,详见: https://github.com/amusi/daily-paper-computer-vision
CVPR 2021 中奖群已成立!已经收录的同学,可以添加微信:CVer9999,请备注:CVPR2021已收录+姓名+学校/公司名称!一定要根据格式申请,可以拉你进群沟通开会等事宜。
Diverse Branch Block: Building a Convolution as an Inception-like Unit
Paper: https://arxiv.org/abs/2103.13425
Code: https://github.com/DingXiaoH/DiverseBranchBlock
Scaling Local Self-Attention For Parameter Efficient Visual Backbones
Paper(Oral): https://arxiv.org/abs/2103.12731
Code: None
ReXNet: Diminishing Representational Bottleneck on Convolutional Neural Network
Involution: Inverting the Inherence of Convolution for Visual Recognition
Coordinate Attention for Efficient Mobile Network Design
Inception Convolution with Efficient Dilation Search
RepVGG: Making VGG-style ConvNets Great Again
DiNTS: Differentiable Neural Network Topology Search for 3D Medical Image Segmentation
HR-NAS: Searching Efficient High-Resolution Neural Architectures with Transformers
Neural Architecture Search with Random Labels
Towards Improving the Consistency, Efficiency, and Flexibility of Differentiable Neural Architecture Search
Joint-DetNAS: Upgrade Your Detector with NAS, Pruning and Dynamic Distillation
Prioritized Architecture Sampling with Monto-Carlo Tree Search
Contrastive Neural Architecture Search with Neural Architecture Comparators
AttentiveNAS: Improving Neural Architecture Search via Attentive
ReNAS: Relativistic Evaluation of Neural Architecture Search
HourNAS: Extremely Fast Neural Architecture
Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator
OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection
Inception Convolution with Efficient Dilation Search
Regularizing Generative Adversarial Networks under Limited Data
Towards Real-World Blind Face Restoration with Generative Facial Prior
TediGAN: Text-Guided Diverse Image Generation and Manipulation
Homepage: https://xiaweihao.com/projects/tedigan/
Paper: https://arxiv.org/abs/2012.03308
Code: https://github.com/weihaox/TediGAN
Generative Hierarchical Features from Synthesizing Image
Homepage: https://genforce.github.io/ghfeat/
Paper(Oral): https://arxiv.org/abs/2007.10379
Code: https://github.com/genforce/ghfeat
Teachers Do More Than Teach: Compressing Image-to-Image Models
HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms
pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis
Homepage: https://marcoamonteiro.github.io/pi-GAN-website/
Paper(Oral): https://arxiv.org/abs/2012.00926
Code: None
DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network
Diverse Semantic Image Synthesis via Probability Distribution Modeling
LOHO: Latent Optimization of Hairstyles via Orthogonalization
PISE: Person Image Synthesis and Editing with Decoupled GAN
DeFLOCNet: Deep Image Editing via Flexible Low-level Controls
PD-GAN: Probabilistic Diverse GAN for Image Inpainting
Efficient Conditional GAN Transfer with Knowledge Propagation across Classes
Exploiting Spatial Dimensions of Latent in GAN for Real-time Image Editing
Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs
Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation
A 3D GAN for Improved Large-pose Facial Recognition
HumanGAN: A Generative Model of Humans Images
ID-Unet: Iterative Soft and Hard Deformation for View Synthesis
CoMoGAN: continuous model-guided image-to-image translation
Training Generative Adversarial Networks in One Stage
Closed-Form Factorization of Latent Semantics in GANs
Anycost GANs for Interactive Image Synthesis and Editing
Image-to-image Translation via Hierarchical Style Disentanglement
Soft-IntroVAE: Analyzing and Improving Introspective Variational Autoencoders
Homepage: https://taldatech.github.io/soft-intro-vae-web/
Paper: https://arxiv.org/abs/2012.13253
Code: https://github.com/taldatech/soft-intro-vae-pytorch
Variational Transformer Networks for Layout Generation
LoFTR: Detector-Free Local Feature Matching with Transformers
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers
Transformer Tracking
HR-NAS: Searching Efficient High-Resolution Neural Architectures with Transformers
MIST: Multiple Instance Spatial Transformer Network
Multimodal Motion Prediction with Stacked Transformers
Revamping cross-modal recipe retrieval with hierarchical Transformers and self-supervised learning
Paper: https://www.amazon.science/publications/revamping-cross-modal-recipe-retrieval-with-hierarchical-transformers-and-self-supervised-learning
Code: https://github.com/amzn/image-to-recipe-transformers
Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking
Paper(Oral): https://arxiv.org/abs/2103.11681
Code: https://github.com/594422814/TransformerTrack
Pre-Trained Image Processing Transformer
End-to-End Video Instance Segmentation with Transformers
UP-DETR: Unsupervised Pre-training for Object Detection with Transformers
End-to-End Human Object Interaction Detection with HOI Transformer
Transformer Interpretability Beyond Attention Visualization
Regularizing Neural Networks via Adversarial Model Perturbation
Generalizing to the Open World: Deep Visual Odometry with Online Adaptation
Adaptive Class Suppression Loss for Long-Tail Object Detection
Contrastive Learning based Hybrid Networks for Long-Tailed Image Classification
Scale-aware Automatic Augmentation for Object Detection
Paper: https://arxiv.org/abs/2103.17220
Code: https://github.com/Jia-Research-Lab/SA-AutoAug
Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning
Spatially Consistent Representation Learning
VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples
Exploring Simple Siamese Representation Learning
Dense Contrastive Learning for Self-Supervised Visual Pre-Training
Instant-Teaching: An End-to-End Semi-Supervised Object Detection Framework
Adaptive Consistency Regularization for Semi-Supervised Transfer Learning
Capsule Network is Not More Robust than Convolutional Network
Adaptive Class Suppression Loss for Long-Tail Object Detection
VarifocalNet: An IoU-aware Dense Object Detector
Paper(Oral): https://arxiv.org/abs/2008.13367
Code: https://github.com/hyz-xmaster/VarifocalNet
Scale-aware Automatic Augmentation for Object Detection
Paper: https://arxiv.org/abs/2103.17220
Code: https://github.com/Jia-Research-Lab/SA-AutoAug
OTA: Optimal Transport Assignment for Object Detection
Distilling Object Detectors via Decoupled Features
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals
There is More than Meets the Eye: Self-Supervised Multi-Object Detection and Tracking with Sound by Distilling Multimodal Knowledge
Positive-Unlabeled Data Purification in the Wild for Object Detection
Instance Localization for Self-supervised Detection Pretraining
MeGA-CDA: Memory Guided Attention for Category-Aware Unsupervised Domain Adaptive Object Detection
End-to-End Object Detection with Fully Convolutional Network
Robust and Accurate Object Detection via Adversarial Learning
Paper: https://arxiv.org/abs/2103.13886
Code: None
I^3Net: Implicit Instance-Invariant Network for Adapting One-Stage Object Detectors
Instant-Teaching: An End-to-End Semi-Supervised Object Detection Framework
OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection
YOLOF:You Only Look One-level Feature
UP-DETR: Unsupervised Pre-training for Object Detection with Transformers
General Instance Distillation for Object Detection
There is More than Meets the Eye: Self-Supervised Multi-Object Detection and Tracking with Sound by Distilling Multimodal Knowledge
Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection
Multiple Instance Active Learning for Object Detection
Towards Open World Object Detection
Dense Relation Distillation with Context-aware Aggregation for Few-Shot Object Detection
Semantic Relation Reasoning for Shot-Stable Few-Shot Object Detection
Few-Shot Object Detection via Contrastive Proposal Encoding
ReDet: A Rotation-equivariant Detector for Aerial Object Detection
Paper: https://arxiv.org/abs/2103.07733
Code: https://github.com/csuhan/ReDet
Towards More Flexible and Accurate Object Tracking with Natural Language: Algorithms and Benchmark
Homepage: https://sites.google.com/view/langtrackbenchmark/
Paper: https://arxiv.org/abs/2103.16746
Evaluation Toolkit: https://github.com/wangxiao5791509/TNL2Kevaluationtoolkit
Demo video: https://www.youtube.com/watch?v=7lvVDlkkff0&ab_channel=XiaoWang
IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for Visual Object Tracking
Graph Attention Tracking
Rotation Equivariant Siamese Networks for Tracking
Track to Detect and Segment: An Online Multi-Object Tracker
Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking
Paper(Oral): https://arxiv.org/abs/2103.11681
Code: https://github.com/594422814/TransformerTrack
Transformer Tracking
Multiple Object Tracking with Correlation Learning
Probabilistic Tracklet Scoring and Inpainting for Multiple Object Tracking
Learning a Proposal Classifier for Multiple Object Tracking
Track to Detect and Segment: An Online Multi-Object Tracker
Progressive Semantic Segmentation
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Bidirectional Projection Network for Cross Dimension Scene Understanding
Cross-Dataset Collaborative Learning for Semantic Segmentation
Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations
Capturing Omni-Range Context for Omnidirectional Segmentation
Learning Statistical Texture for Semantic Segmentation
PLOP: Learning without Forgetting for Continual Semantic Segmentation
Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation
BBAM: Bounding Box Attribution Map for Weakly Supervised Semantic and Instance Segmentation
Semi-supervised Domain Adaptation based on Dual-level Domain Mixing for Semantic Segmentation
RobustNet: Improving Domain Generalization in Urban-Scene Segmentation via Instance Selective Whitening
Coarse-to-Fine Domain Adaptive Semantic Segmentation with Photometric Alignment and Category-Center Regularization
MetaCorrection: Domain-aware Meta Loss Correction for Unsupervised Domain Adaptation in Semantic Segmentation
Multi-Source Domain Adaptation with Collaborative Learning for Semantic Segmentation
Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation
VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild
Boundary IoU: Improving Object-Centric Image Segmentation Evaluation
Paper: https://arxiv.org/abs/2103.16562
Code: https://github.com/bowenc0221/boundary-iou-api
Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers
Paper: https://arxiv.org/abs/2103.12340
Code: https://github.com/lkeab/BCNet
End-to-End Video Instance Segmentation with Transformers
Zero-shot instance segmentation(Not Sure)
Panoptic Segmentation Forecasting
Fully Convolutional Networks for Panoptic Segmentation
Paper: https://arxiv.org/abs/2012.00720
Code: https://github.com/yanwei-li/PanopticFCN
Cross-View Regularization for Domain Adaptive Panoptic Segmentation
FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space
DiNTS: Differentiable Neural Network Topology Search for 3D Medical Image Segmentation
Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild
Paper: https://arxiv.org/abs/2103.10391
Code: https://github.com/svip-lab/IVOS-W
Uncertainty-aware Joint Salient Object and Camouflaged Object Detection
Paper: https://arxiv.org/abs/2104.02628
Code: https://github.com/JingZhang617/JointCODSOD
Deep RGB-D Saliency Detection with Depth-Sensitive Attention and Automatic Multi-Modal Fusion
Uncertainty-aware Joint Salient Object and Camouflaged Object Detection
Paper: https://arxiv.org/abs/2104.02628
Code: https://github.com/JingZhang617/JointCODSOD
Anchor-Free Person Search
No frame left behind: Full Video Action Recognition
Learning Salient Boundary Feature for Anchor-free Temporal Action Localization
Temporal Context Aggregation Network for Temporal Action Proposal Refinement
ACTION-Net: Multipath Excitation for Action Recognition
Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning
TDN: Temporal Difference Networks for Efficient Action Recognition
A 3D GAN for Improved Large-pose Facial Recognition
MagFace: A Universal Representation for Face Recognition and Quality Assessment
WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition
When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework
HLA-Face: Joint High-Low Adaptation for Low Light Face Detection
CRFace: Confidence Ranker for Model-Agnostic Face Detection Refinement
Cross Modal Focal Loss for RGBD Face Anti-Spoofing
Spatial-Phase Shallow Learning: Rethinking Face Forgery Detection in Frequency Domain
Multi-attentional Deepfake Detection
PML: Progressive Margin Loss for Long-tailed Age Classification
Affective Processes: stochastic modelling of temporal context for emotion and facial expression recognition
MagDR: Mask-guided Detection and Reconstruction for Defending Deepfakes
Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing
DCPose: Deep Dual Consecutive Network for Human Pose Estimation
HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation
From Synthetic to Real: Unsupervised Domain Adaptation for Animal Pose Estimation
POSEFusion: Pose-guided Selective Fusion for Single-view Human Volumetric Capture
Homepage: http://www.liuyebin.com/posefusion/posefusion.html
Paper(Oral): https://arxiv.org/abs/2103.15331
Code: None
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
Checkerboard Context Model for Efficient Learned Image Compression
Slimmable Compressive Autoencoders for Practical Neural Image Compression
Attention-guided Image Compression by Deep Reconstruction of Compressive Sensed Saliency Skeleton
Teachers Do More Than Teach: Compressing Image-to-Image Models
Dynamic Slimmable Network
Zero-shot Adversarial Quantization
Learnable Companding Quantization for Accurate Low-bit Neural Networks
Distilling Object Detectors via Decoupled Features
ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic
AdderSR: Towards Energy Efficient Image Super-Resolution
Temporal Modulation Network for Controllable Space-Time Video Super-Resolution
Multi-Stage Progressive Image Restoration
TransFill: Reference-guided Image Inpainting by Merging Multiple Color and Spatial Transformations
PD-GAN: Probabilistic Diverse GAN for Image Inpainting
High-Fidelity and Arbitrary Face Editing
Anycost GANs for Interactive Image Synthesis and Editing
PISE: Person Image Synthesis and Editing with Decoupled GAN
DeFLOCNet: Deep Image Editing via Flexible Low-level Controls
Exploiting Spatial Dimensions of Latent in GAN for Real-time Image Editing
LoFTR: Detector-Free Local Feature Matching with Transformers
Convolutional Hough Matching Networks
Robust Reflection Removal with Reflection-free Flash-only Cues
Equivariant Point Network for 3D Point Cloud Analysis
PAConv: Position Adaptive Convolution with Dynamic Kernel Assembling on Point Clouds
LiDAR R-CNN: An Efficient and Universal 3D Object Detector
M3DSSD: Monocular 3D Single Stage Object Detector
Paper: https://arxiv.org/abs/2103.13164
Code: https://github.com/mumianyuxin/M3DSSD
SE-SSD: Self-Ensembling Single-Stage Object Detector From Point Cloud
Center-based 3D Object Detection and Tracking
Categorical Depth Distribution Network for Monocular 3D Object Detection
Bidirectional Projection Network for Cross Dimension Scene Understanding
Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion
Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation
Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges
Panoptic-PolarNet: Proposal-free LiDAR Point Cloud Panoptic Segmentation
Center-based 3D Object Detection and Tracking
ReAgent: Point Cloud Registration using Imitation and Reinforcement Learning
PointDSC: Robust Point Cloud Registration using Deep Spatial Consistency
PREDATOR: Registration of 3D Point Clouds with Low Overlap
Style-based Point Generator with Adversarial Rendering for Point Cloud Completion
NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video
Homepage: https://zju3dv.github.io/neuralrecon/
Paper(Oral): https://arxiv.org/abs/2104.00681
Code: https://github.com/zju3dv/NeuralRecon
FS-Net: Fast Shape-based Network for Category-Level 6D Object Pose Estimation with Decoupled Rotation Mechanism
GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation
FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation
Back to the Feature: Learning Robust Camera Localization from Pixels to Pose
Beyond Image to Depth: Improving Depth Prediction using Echoes
S3: Learnable Sparse Signal Superdensity for Guided Depth Estimation
Depth from Camera Motion and Object Detection
LiBRe: A Practical Bayesian Approach to Adversarial Detection
Natural Adversarial Examples
StyleMeUp: Towards Style-Agnostic Sketch-Based Image Retrieval
QAIR: Practical Query-efficient Black-Box Attacks for Image Retrieval
On Semantic Similarity in Video Retrieval
Paper: https://arxiv.org/abs/2103.10095
Homepage: https://mwray.github.io/SSVR/
Code: https://github.com/mwray/Semantic-Video-Retrieval
Cross-Modal Center Loss for 3D Cross-Modal Retrieval
Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers
Revamping cross-modal recipe retrieval with hierarchical Transformers and self-supervised learning
Paper: https://www.amazon.science/publications/revamping-cross-modal-recipe-retrieval-with-hierarchical-transformers-and-self-supervised-learning
Code: https://github.com/amzn/image-to-recipe-transformers
Counterfactual Zero-Shot and Open-Set Visual Recognition
FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space
CDFI: Compression-Driven Network Design for Frame Interpolation
FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation
Homepage: https://tarun005.github.io/FLAVR/
Paper: https://arxiv.org/abs/2012.08512
Code: https://github.com/tarun005/FLAVR
Transformation Driven Visual Reasoning
Self-Supervised Visibility Learning for Novel View Synthesis
NeX: Real-time View Synthesis with Neural Basis Expansion
Variational Transformer Networks for Layout Generation
RobustNet: Improving Domain Generalization in Urban-Scene Segmentation via Instance Selective Whitening
Adaptive Methods for Real-World Domain Generalization
FSDR: Frequency Space Domain Randomization for Domain Generalization
Learning Placeholders for Open-Set Recognition
IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for Visual Object Tracking
Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information
Reformulating HOI Detection as Adaptive Set Prediction
Detecting Human-Object Interaction via Fabricated Compositional Learning
End-to-End Human Object Interaction Detection with HOI Transformer
Auto-Exposure Fusion for Single-Image Shadow Removal
Parser-Free Virtual Try-on via Distilling Appearance Flows
基于外观流蒸馏的无需人体解析的虚拟换装
VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild
Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and Benchmark
Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and Benchmark
Homepage: https://vap.aau.dk/sewer-ml/
Paper: https://arxiv.org/abs/2103.10895
Nutrition5k: Towards Automatic Nutritional Understanding of Generic Food
Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges
When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework
Depth from Camera Motion and Object Detection
There is More than Meets the Eye: Self-Supervised Multi-Object Detection and Tracking with Sound by Distilling Multimodal Knowledge
Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
Code: https://github.com/daveredrum/Scan2Cap
Dataset: https://github.com/daveredrum/ScanRefer
There is More than Meets the Eye: Self-Supervised Multi-Object Detection and Tracking with Sound by Distilling Multimodal Knowledge
Visually Informed Binaural Audio Generation without Binaural Audios
Paper: None
GitHub: https://github.com/SheldonTsui/PseudoBinaural_CVPR2021
Demo: https://www.youtube.com/watch?v=r-uC2MyAWQc
Domain Consensus Clustering for Universal Domain Adaptation
Exploring intermediate representation for monocular vehicle pose estimation
Tuning IR-cut Filter for Illumination-aware Spectral Reconstruction from RGB
Invertible Image Signal Processing
Video Rescaling Networks with Joint Optimization Strategies for Downscaling and Upscaling
SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D Sequences
Embedding Transfer with Label Relaxation for Improved Metric Learning
Picasso: A CUDA-based Library for Deep Learning over 3D Meshes
Meta-Mining Discriminative Samples for Kinship Verification
Cloud2Curve: Generation and Vectorization of Parametric Sketches
TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution
Homepage: http://wellyzhang.github.io/project/prae.html
Paper: https://arxiv.org/abs/2103.14230
Code: None
ACRE: Abstract Causal REasoning Beyond Covariation
Homepage: http://wellyzhang.github.io/project/acre.html
Paper: https://arxiv.org/abs/2103.14232
Code: None
Confluent Vessel Trees with Accurate Bifurcations
Few-Shot Human Motion Transfer by Personalized Geometry and Texture Modeling
Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks
Knowledge Evolution in Neural Networks
Multi-institutional Collaborations for Improving Deep Learning-based Magnetic Resonance Image Reconstruction Using Federated Learning
SGP: Self-supervised Geometric Perception
Oral
Paper: https://arxiv.org/abs/2103.03114
Code: https://github.com/theNded/SGP
Multi-institutional Collaborations for Improving Deep Learning-based Magnetic Resonance Image Reconstruction Using Federated Learning
Diffusion Probabilistic Models for 3D Point Cloud Generation
Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
Code: https://github.com/daveredrum/Scan2Cap
Dataset: https://github.com/daveredrum/ScanRefer
There is More than Meets the Eye: Self-Supervised Multi-Object Detection and Tracking with Sound by Distilling Multimodal Knowledge
Code: http://rl.uni-freiburg.de/research/multimodal-distill
Dataset: http://rl.uni-freiburg.de/research/multimodal-distill
CT Film Recovery via Disentangling Geometric Deformation and Photometric Degradation: Simulated Datasets and Deep Models
Toward Explainable Reflection Removal with Distilling and Model Uncertainty
DeepOIS: Gyroscope-Guided Deep Optical Image Stabilizer Compensation
Exploring Adversarial Fake Images on Face Manifold
Uncertainty-Aware Semi-Supervised Crowd Counting via Consistency-Regularized Surrogate Task
Temporal Contrastive Graph for Self-supervised Video Representation Learning
Boosting Monocular Depth Estimation Models to High-Resolution via Context-Aware Patching
Fast and Memory-Efficient Compact Bilinear Pooling
Identification of Empty Shelves in Supermarkets using Domain-inspired Features with Structural Support Vector Machine
Estimating A Child's Growth Potential From Cephalometric X-Ray Image via Morphology-Aware Interactive Keypoint Estimation
https://github.com/ShaoQiangShen/CVPR2021
https://github.com/gillesflash/CVPR2021
https://github.com/anonymous-submission1991/BaLeNAS
https://github.com/cvpr2021dcb/cvpr2021dcb
https://github.com/anonymousauthorCV/CVPR2021PaperID8578
https://github.com/AldrichZeng/FreqPrune
https://github.com/Anonymous-AdvCAM/Anonymous-AdvCAM
https://github.com/ddfss/datadrive-fss