卷积神经网络及其在计算机视觉方面的应用整理

exiaohu 于 2019-08-01 发布

Convolution

Discrete convolutions

Pooling

Transposed convolution

Dilated/Atrous convolutions

Classification

LeNet-5 (Gradient-Based Learning Applied to Document Recognition)

AlexNet (ImageNet Classification with Deep Convolutional Neural Networks)

ZFNet (Visualizing and Understanding Convolutional Networks)

NIN (Network In Network)

VGG (Very Deep Convolutional Networks for Large-Scale Image Recognition)

GoogLeNet, Inceptionv2, Inceptionv3

GoogLeNet (Going deeper with convolutions)

ResNet (Deep Residual Learning for Image Recognition)

DenseNet (Densely Connected Convolutional Networks)

Object Detection

Selective Search (Selective Search for Object Recognition)

SPP-net (Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition)

R-CNN, Fast R-CNN, Faster R-CNN, Mask R-CNN

R-CNN (Rich feature hierarchies for accurate object detection and semantic segmentation)

Fast R-CNN (Fast R-CNN)

Faster R-CNN (Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks)

Mask R-CNN (Mask R-CNN)

SSD (SSD: Single Shot MultiBox Detector)

YOLO, YOLO9000, YOLOv3

YOLO (You Only Look Once: Unified, Real-Time Object Detection)

YOLOv2, YOLO9000 (YOLO9000: Better, Faster, Stronger)

YOLOv3 (YOLOv3: An Incremental Improvement)

Image Segmentation

FCN (Fully Convolutional Networks for Semantic Segmentation)

U-Net (U-Net: Convolutional Networks for Biomedical Image Segmentation)

image.png

DeepLab, DeepLabv2, DeepLabv3, DeepLabv3+, Auto-DeepLab

DeepLab (Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs)

DeepLabv3 (Rethinking Atrous Convolution for Semantic Image Segmentation)

image.png

DeepLabv3+ (Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation)

image.png

Auto-DeepLab (Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation)

Multi-Scale Context Aggregation by Dilated Convolutions (dilated convolution)

RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation

image.png image.png

Large Kernel Matters – Improve Semantic Segmentation by Global Convolutional Network

image.png

Pyramid Scene Parsing Network (PSPNet)

image.png

Deep Extreme Cut: From Extreme Points to Object Segmentation

image.png

Learning to Segment Every Thing

image.png

Multi-Scale Context Intertwining for Semantic Segmentation

image.png

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

image.png

MaskLab: Instance Segmentation by Refining Object Detection With Semantic and Direction Features

image.png

Concurrent Spatial and Channel ‘Squeeze & Excitation’ in Fully Convolutional Networks

image.png

Fully Convolutional Adaptation Networks for Semantic Segmentation

image.png

CCNet: Criss-Cross Attention for Semantic Segmentation

image.png

Reference: