Dr.Jiw: Convolution neural network (cnn)

วันจันทร์ที่ 8 พฤษภาคม พ.ศ. 2560

Convolution neural network (cnn)

An MLP with normally 3 hidden layers : convolution layer (similar to sliding window but here called filter producing output called feature map) & pooling layer (for non-linear down-sampling e.g. max pooling taking max value pixel) & fully connected layer.

Traditional MLP do not scale well to higher resolution images. For example, in CIFAR-10, images are only of size 32x32x3 (32 wide, 32 high, 3 color channels), so a single fully connected neuron in a first hidden layer of a regular neural network would have 32*32*3 = 3,072 weights. A 200x200 image, however, would lead to neurons that have 200*200*3 = 120,000 weights.
Also, such network architecture does not take into account the spatial structure of data, treating input pixels which are far apart the same as pixels that are close together. Thus, full connectivity of neurons is wasteful for the purpose of image recognition

http://youtu.be/7Wq-QmMT4gM

https://medium.com/@natthawatphongchit/%E0%B8%A1%E0%B8%B2%E0%B8%A5%E0%B8%AD%E0%B8%87%E0%B8%94%E0%B8%B9%E0%B8%A7%E0%B8%B4%E0%B8%98%E0%B8%B5%E0%B8%81%E0%B8%B2%E0%B8%A3%E0%B8%84%E0%B8%B4%E0%B8%94%E0%B8%82%E0%B8%AD%E0%B8%87-cnn-%E0%B8%81%E0%B8%B1%E0%B8%99-e3f5d73eebaa

https://datawow.io/blogs/interns-explain-cnn-8a669d053f8b

Difference between AlexNet, VGGNet, ResNet, and Inception : https://towardsdatascience.com/the-w3h-of-alexnet-vggnet-resnet-and-inception-7baaaecccc96#:~:text=VGG16%20has%20a%20total%20of%20138%20million%20parameters.

CNN classification (by Gemini)

1. Image-Based CNNs

Classic Architectures: LeNet-5, AlexNet, VGGNet, GoogLeNet, ResNet

Specialized Architectures: MobileNet, EfficientNet, DenseNet, R-CNN, Fast R-CNN, Faster R-CNN, Mask R-CNN

Key Features: Grid-like input data, convolutional layers, pooling layers, fully connected layers.

2. Sequence-Based CNNs

1D CNNs: Used for processing sequential data like text or time series.

Key Features: 1D convolutional layers, pooling layers, fully connected layers.

3. Graph-Based CNNs

GCNs: Graph Convolutional Networks; while Image-based CNNs are designed to process data with a regular grid structure, like images, GCNs can handle data where data can be naturally represented as a graph, such as social networks, molecular structures, and knowledge graphs.

Key Features: Graph-structured input data, graph convolution operation, graph Laplacian, handling irregular data.

https://jonathan-hui.medium.com/graph-convolutional-networks-gcn-pooling-839184205692

4. Hybrid CNNs

Combining Different Types: CNNs can be combined with other architectures like Recurrent Neural Networks (RNNs) or Long Short-Term Memory (LSTM) networks for tasks involving both spatial and temporal information.

วันจันทร์ที่ 8 พฤษภาคม พ.ศ. 2560

Convolution neural network (cnn)

ค้นหาบล็อกนี้

คลังบทความของบล็อก