Papers about deep learning ordered by task, date. Current state-of-the-art papers and papers useful for getting started are labelled.
- YOLO9000: Better, Faster, Stronger, dec 2016, arxiv
- Feature Pyramid Networks for Object Detection, dec 2016, arxiv
- Densely Connected Convolutional Networks, dec 2016, arxiv
- PVANet: Lightweight Deep Neural Networks for Real-time Object Detection, dec 2016, arxiv
- Aggregated Residual Transformations for Deep Neural Networks, nov 2016, arxiv
- Hierarchical Object Detection with Deep Reinforcement Learning, nov 2016, arxiv
- Learning to Make Better Mistakes: Semantics-aware Visual Food Recognition, okt 2016, IBM, paper
- T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos, aug 2016, github, arxiv
- Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning, aug 2016, Google, arxiv
- Residual Networks of Residual Networks: Multilevel Residual Networks, aug 2016, arxiv
- Context Matters: Refining Object Detection in Video with Recurrent Neural Networks, jul 2016, arxiv
- R-FCN: Object Detection via Region-based Fully Convolutional Networks, jun 2016, arxiv
- Training Region-based Object Detectors with Online Hard Example Mining, apr 2016, Facebook, arxiv
- Deep Residual Learning for Image Recognition, dec 2015, arxiv
- SSD: Single Shot MultiBox Detector, dec 2015, Google, github, arxiv
- ParseNet: Looking Wider to See Better, jun 2015, arxiv
- You Only Look Once: Unified, Real-Time Object Detection, jun 2015, Facebook, arxiv
- Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, jun 2015, Microsoft/Facebook arxiv
- Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification, feb 2015, arxiv
- Deep Image: Scaling up Image Recognition, jul 2015, arxiv
- Rich feature hierarchies for accurate object detection and semantic segmentation, 2014, paper
- Selective Search for Object Recognition, 2012, paper
- ImageNet Classification with Deep Convolutional Neural Networks, 2012, paper
- Mask R-CNN, mar 2017, arxiv
- Learning Features by Watching Objects Move, dec 2016, Facebook, arxiv
- Fully Convolutional Networks for Semantic Segmentation, may 2016, arxiv
- Instance-aware Semantic Segmentation via Multi-task Network Cascades, dec 2015, arxiv
- SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation, dec 2015, arxiv
- Learning Rich Features from RGB-D Images for Object Detection and Segmentation, jul 2014, arxiv
- Fast Single Shot Detection and Pose Estimation, sep 2016, arxiv
- Accessorize to a Crime: Real and Stealthy Attacks on State-of-the-Art Face Recognition, paper
- OpenFace: A general-purpose face recognition library with mobile applications, June 2016, paper
- Deep Face Recognition, 2015, paper
- Compact Convolutional Neural Network Cascade for Face Detection, aug 2015, arxiv
- Learning Robust Deep Face Representation, Jul 2015, arxiv
- FaceNet: A Unified Embedding for Face Recognition and Clustering, jun 2015, paper
- Multi-view Face Detection Using Deep Convolutional Neural Networks, yahoo, feb 2015, arxiv
- Deep Photo Style Transfer, mar 2017, arxiv
- A learned representation for artistic style, okt 2016, Google, arxiv, demo
- Fast Style Transfer in TensorFlow, github
- Instance Normalization: The Missing Ingredient for Fast Stylization, sept 2016, arxiv
- A Neural Algorithm of Artistic Style, sept 2015, arxiv
- Perceptual Losses for Real-Time Style Transfer and Super-Resolution, arxiv, github
- Deep Learning Logo Detection with Data Expansion by Synthesising Context, jan 2016, arxiv
- Automatic Graphic Logo Detection via Fast Region-based Convolutional Networks, apr 2016, arxiv
- Logo Localization and Recognition in Natural Images Using Homographic Class Graphs, 2016, paper
- LOGO-Net: Large-scale Deep Logo Detection and Brand Recognition with Deep Region-based Convolutional Networks, nov 2015, arxiv
- DeepLogo: Hitting Logo Recognition with the Deep Neural Network Hammer, okt 2015, Berkely, arxiv
- Automatic detection of logos in video and their removal using inpainting, jul 2015, paper
- On the Benefit of Synthetic Data for Company Logo Detection, 2015, paper
- Fast and Robust Realtime Storefront Logo Recognition, paper
- Scalable Logo Recognition in Real-World Images, 2011, paper
- https://arxiv.org/pdf/1609.01414v1.pdf
note: also includes some papers that use SIFT
- COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images, jun 2016, arxiv
- Recursive Recurrent Nets with Attention Modeling for OCR in the Wild, mar 2016, arxiv
- OCR Error Correction Using Character Correction and Feature-Based Word Classification, 2016, arxiv
- Efficient Scene Text Localization and Recognition with Local Character Refinement, apr 2015, arxiv
- Reading Text in the Wild with Convolutional Neural Networks, dec 2014, arxiv
- Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition, jun 2014, arxiv
- Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning, 2011, paper
- Generation and Comprehension of Unambiguous Object Descriptions, apr 2016, arxiv
- Long-term Recurrent Convolutional Networks for Visual Recognition and Description, may 2016, arxiv
- Detecting events and key actors in multi-person videos, mar 2015, arxiv
- End to End Learning for Self-Driving Cars, apr 2016, NVIDIA, arxiv
- Visualizing and Understanding Convolutional Networks, nov 2013, arxiv
- Discovering the physical parts of an articulated object class from multiple videos, 2016, paper
- Joint Deep Learning for Pedestrian Detection, 2013, paper
- Lip Reading in the Wild, 2016, Oxford, paper
- RAISR: Rapid and Accurate Image Super Resolution, okt 2016, Google, arxiv
- Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network, sep 2016, Twitter, arxiv
- Full Resolution Image Compression with Recurrent Neural Networks, aug 2016, Google arxiv
- DeepMath - Deep Sequence Models for Premise Selection, jun 2016, Google arxiv
- Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data, okt 2016, arxiv
- Stealing Machine Learning Models via Prediction APIs, aug 2016, paper
- Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation, nov 2016, Google, arxiv
- Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation, okt 2016, Google arxiv
- Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker, dec 2016, arxiv