research
∙
06/12/2022
ChordMixer: A Scalable Neural Attention Model for Sequences with Different Lengths
Sequential data naturally have different lengths in many domains, with s...
research
∙
04/22/2022
Paramixer: Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention
Self-Attention is a widely used building block in neural modeling to mix...
research
∙
01/06/2022
Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks
Classification of long sequential data is an important Machine Learning ...
research
∙
09/16/2021