Generate pictures pixel by pixel
Related Works
- PixelRNN: better performance
- PixelCNN: faster to train (easier to parallelize)
Gated PixelCNN
Conditional variant of the Gated PixelCNN

Posted 2019-08-16Updated 2021-03-31Paper Notes

Attention is All You Need

NIPS 2017
@google.com

pdf

Introduction

Recurrent Models
- sequence operations
- hard to do parallelization
the Transformer
- no recurrence
- relying entirely on an attention mechanism
- global dependencies
- parallelization with 8 GPUs

Posted 2019-08-16Updated 2021-03-31Paper Notes

Effective Approaches to Attention-based Neural Machine Translation

EMNLP 2015
@stanford.edu

pdf

Introduction

Attentional mechanism jointly translate and align words
2 types of attention-based models
- global
  - all source words
- local
  - subset of source words

Posted 2019-07-31Updated 2021-03-31Paper Notes

Auto-Encoding Variational Bayes

ICLR 2014
@google.com

pdf

Introduction

Variational Bayesian (VB)
Approximate posterior using MLP
Stochatic Gradient Variational Bayes (SGVB)
Auto-Encoding VB (AEVB) algorithm
Variational auto-encoder (VAE)

Method

problem

the integral of the marginal likelihood $p_\theta(x) = \int p_\theta(z)p_\theta(x|z) dz$ is intractable
a large dataset: the need of updating using small minibatches

Posted 2019-07-25Updated 2021-03-31Paper Notes

Sequence to Sequence Learning with Neural Networks

NIPS 2014
@google.com

pdf

Abstract

Task: an English to French translation task from the WMT-14 dataset
Method:
- a Deep LSTM: maps theinput sequence to a vector of a fixed dimensionality
- another Deep LSTM: decodes the target sequence from the vector
Result: BLEU score 34.8
Additional Founds: reversing the order of the words in all source sentences (but not target sentences) improved the LSTM’s performance markedly

Posted 2019-07-17Updated 2021-03-31Paper Notes

Residual Learning for Image Recognition

CVPR 2016
@microsoft.com

pdf

Abstract

residual networks
VGG nets
COCO object detection dataset

Datasets

MNIST Series

MNIST

Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization

Contents

Introduction

Learning Deep Features for Discriminative Localization

Contents

Introduction

Problem

Goal

Conditional Image Generation with PixelCNN Decoders

Contents

Introduction

Attention is All You Need

Contents

Introduction

Effective Approaches to Attention-based Neural Machine Translation

Contents

Introduction

Auto-Encoding Variational Bayes

Contents

Introduction

Method

problem

Sequence to Sequence Learning with Neural Networks

Contents

Abstract

Residual Learning for Image Recognition

Contents

Abstract

Recents

Categories

Tags

Archives

Links