Deep Dive into the Computer Vision World

: repo for implementing the neural networks from scratch

1. Paper Reproduction

Studying neural networks can be divided into three parts: The applications, the implementations, and intuitions behind these architectures. Thanks to the user-friendly frameworks such Keras, the applications part are open to everyone. But grasping the real intuition behind the model is overlooked sometimes. What’s the researchers’ intention for building a model with such structures? What motivated them to take such an approach? And what can we infer from the outcome?

This repository is an on-going project for studying the state-of-the-art networks. Starting from VGG, the intuitions and implementation of networks will be covered. The networks are mostly focused on the milestones in Computer Vision such as Image Classification, Object Detections, Image Segmentation, Face Detections etc.

Project Date: Jul 2019 ~
Applied skills: Tensorflow, Keras, PyTorch

2. Paper Reviews

Part1. VGG, ResNet, Inception Network, Xception and MobileNet
Part2. R-CNN, Fast R-CNN, and Faster R-CNN
Part3. YOLO and SSD, Mask R-CNN

3. Implementation From Scratch

4. Reference

Karen Simonyan and Andrew Zisserman, Very deep convolutional network for large-scale image recognition, 2015
Kaiming He, et al. Deep residual learning for image recognition, 2015
Christian Szegedy, et al. Going deeper with convolutions, 2014
Christian Szegedy et al., Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning, 2016
Franc¸ois Chollet, Xception: Deep Learning with Depthwise Separable Convolutions, 2017
Andrew G. Howard et al., MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications, 2017
Ross Girshick et al, Rich feature hierarchies for accurate object detection and semantic segmentation, 2014
Kaiming He et al, Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition, 2015
Ross Girshick, Fast R-CNN, 2015
Shaoqing Ren et al, Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, 2015
Joseph Redmon et al. You Only Look Once: Unified, Real-Time Object Detection, 2016
Joseph Redmon et al. YOLO9000: Better, Faster, Stronger, 2016
Joseph Redmon et al. YOLOv3: An Incremental Improvement, 2018

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
01.ResNet		01.ResNet
02.InceptionNetwork		02.InceptionNetwork
03.YOLO		03.YOLO
data		data
.gitignore		.gitignore
README.md		README.md
vgg_tf.py		vgg_tf.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Dive into the Computer Vision World

1. Paper Reproduction

2. Paper Reviews

3. Implementation From Scratch

4. Reference

About

Releases

Packages

Languages

jjone36/dl_scratch

Folders and files

Latest commit

History

Repository files navigation

Deep Dive into the Computer Vision World

1. Paper Reproduction

2. Paper Reviews

3. Implementation From Scratch

4. Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages