Transformer
Sequence-to-sequence (Seq2seq)

Hokkien(闽南语、台语)

Text to Speech (TTS) Synthesis

Seq2Seq for Chatbot

Most Natural Language Processing appliactions…


Seq2seq for Syntactic Parsing


Seq2seq for Multi-label Classification

Seq2seq for Object Detection

Seq2seq

Encoder


Batch Norm:同一个 dimension ,不同 feature,不同 example,去计算 mean $m$ 和 standard deviation $\sigma$
Layer Norm:同一个 example,同一个 feature,不同的 dimension 去计算 mean $m$ 和 standard deviation $\sigma$


To learn more……

Autoregressive



Self-attention->Masked Self-attention




AT vs NAT

Transformer


Cross Attention


Training


Copy Mechanism


Guided Attention

Beam Search


Optimizing Evaluation Metrics?

exposure bias

