Alex的部落格-FINTECH專案管理與行動支付研究: 20200906 AI論文研討筆記

20200906 AI論文研討筆記

課程一 The Core Concept of NLP

何謂QKV?

在NLP領域，Q代表一整句話的EMBEDDING，K=V代表著每個單詞的EMBEDDING，K和V相等是因為，KEY只是一個索引而已，最終的值都是VALUE

LSTM=>transformer(Seq2seq)=>bert=>nlp

LSTM(因為要一個字一個字訓練太慢)

Transformer(一次丟20個字)

研討素材

1.論文 Frustratingly Short Attention Spans in Neural Language Modeling

https://arxiv.org/pdf/1702.04521.pdf

2.Learning Word Embedding-QKV

https://bit.ly/2GwxnJH

Word2vec Parameter Learning Explained

https://bit.ly/3btIH4S

3.論文解説 Convolutional Sequence to Sequence Learning (ConvS2S)

https://bit.ly/2GsY028

4.Attention

https://bit.ly/2QYXVFt

延伸閱讀

課程三 word2vect

重點

Word2vec 就是NNLM的簡化版

Word2vec模型

Recent Trends in Deep Learning Based Natural Language Processing

Word2Vec基本概念

King – Man + Woman = Queen

霍夫曼編碼

論文

Recent Trends in Deep Learning Based Natural Language Processing

Word Embeddings, Analogies, and Machine Learning: Beyond King - M an + W oman = Queen

課程四 06C_Word2vec

重點

Hierarchical Softmax

霍夫曼樹中有 V-1 個中間節點，V 個葉節點

On word embeddings - Part 2: Approximating the Softmax

Hierarchical softmax computations (Hugo Lachorelle's Youtube lectures)
適用於object detection、類似Yolov2
分層Softmax計算
GloVe
fastText v1
WordRank
NNLMs

結論

Word2vec v1: CBOW and Skip-gram
Word2vec v2: Hierarchical Softmax and Negative Sampling
Word2vec v3: Simplfied Word2vec v1 and v2
LSA: Co-occurrence Matrix+SVD
GloVe: Word2vec+LSA
fastText v1: CBOW and w(t) to Label
fastText v2: Skip-gram and Word to Subword
WordRank: Word Embedding to Word Ranking

論文

GloVe: Global Vectors for Word Representation

課程五: 2020 LeNet Tutorial - 1、LeNet、精華篇

觀察

好的文章會引用好的文章，也會被好的文章引用

讀論文要讀他以前引用的論文

產業應用

卓騰語言科技AI -> NLP

課程六: 如何讀懂一篇論文以LeNet為例

閱讀順序

LeNet: LeNet1、LeNet2、LeNet3、LeNet4、LeNetLab

AlexNet: AlexNet AlexNet2 AlexNet3 AlexNet4
ZFNet: ZFNet1、ZFNet2、ZFNet3、ZFNet4

總結: 建議翻譯AlexNet論文

作法: 一天一小段

Alex的部落格-FINTECH專案管理與行動支付研究

2020年9月6日星期日

20200906 AI論文研討筆記

沒有留言:

張貼留言

2020年9月6日 星期日

20200906 AI論文研討筆記

沒有留言:

張貼留言

2020年9月6日星期日