Transformer models: an introduction and catalog

02/12/2023
by   Xavier Amatriain, et al.
0

In the past few years we have seen the meteoric appearance of dozens of models of the Transformer family, all of which have funny, but not self-explanatory, names. The goal of this paper is to offer a somewhat comprehensive but simple catalog and classification of the most popular Transformer models. The paper also includes an introduction to the most important aspects and innovation in Transformer models.

READ FULL TEXT

page 4

page 7

page 11

research
11/25/2021

Transformer-based Korean Pretrained Language Models: A Survey on Three Years of Progress

With the advent of Transformer, which was used in translation models in ...
research
01/20/2021

PGT: Pseudo Relevance Feedback Using a Graph-Based Transformer

Most research on pseudo relevance feedback (PRF) has been done in vector...
research
12/21/2020

RealFormer: Transformer Likes Residual Attention

Transformer is the backbone of modern NLP models. In this paper, we prop...
research
03/24/2022

Transformers Meet Visual Learning Understanding: A Comprehensive Review

Dynamic attention mechanism and global modeling ability make Transformer...
research
09/16/2022

Transformer-based Detection of Multiword Expressions in Flower and Plant Names

Multiword expression (MWE) is a sequence of words which collectively pre...
research
11/04/2022

BERT for Long Documents: A Case Study of Automated ICD Coding

Transformer models have achieved great success across many NLP problems....
research
02/23/2021

Do Transformer Modifications Transfer Across Implementations and Applications?

The research community has proposed copious modifications to the Transfo...

Please sign up or login with your details

Forgot password? Click here to reset