b'Aston Zhang'

research

∙ 09/20/2023

You Only Look at Screens: Multimodal Chain-of-Action Agents

Autonomous user interface (UI) agents aim to facilitate task automation ...

0 Zhuosheng Zhang, et al. ∙

research

∙ 05/21/2023

Automated Few-shot Classification with Instruction-Finetuned Language Models

A particularly successful class of approaches for few-shot learning comb...

0 Rami Aly, et al. ∙

research

∙ 05/07/2023

Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens

Transformer models are foundational to natural language processing (NLP)...

0 Zhanpeng Zeng, et al. ∙

research

∙ 04/10/2023

A Cheaper and Better Diffusion Language Model with Soft-Masked Noise

Diffusion models that are based on iterative denoising have been recentl...

5 Jiaao Chen, et al. ∙

research

∙ 04/10/2023

Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition

This work proposes POMP, a prompt pre-training method for vision-languag...

5 Shuhuai Ren, et al. ∙

research

∙ 02/08/2023

Is ChatGPT a General-Purpose Natural Language Processing Task Solver?

Spurred by advancements in scale, large language models (LLMs) have demo...

10 Chengwei Qin, et al. ∙

research

∙ 02/02/2023

Multimodal Chain-of-Thought Reasoning in Language Models

Large language models (LLMs) have shown impressive performance on comple...

0 Zhuosheng Zhang, et al. ∙

research

∙ 01/04/2023

Parameter-Efficient Fine-Tuning Design Spaces

Parameter-efficient fine-tuning aims to achieve performance comparable t...

1 Jiaao Chen, et al. ∙

research

∙ 12/29/2022

Learning Multimodal Data Augmentation in Feature Space

The ability to jointly learn from multiple modalities, such as text, aud...

14 Zichang Liu, et al. ∙

research

∙ 12/21/2022

SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning

Pre-trained large language models can efficiently interpolate human-writ...

8 M Saiful Bari, et al. ∙

research

∙ 12/10/2022

SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing

The mixture of Expert (MoE) parallelism is a recent advancement that sca...

12 Chaoyang He, et al. ∙

research

∙ 10/12/2022

Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace Subnetwork

Deep neural networks (DNNs) are vulnerable to backdoor attacks. Previous...

0 Haotao Wang, et al. ∙

research

∙ 10/07/2022

Automatic Chain of Thought Prompting in Large Language Models

Large language models (LLMs) can perform complex reasoning by generating...

0 Zhuosheng Zhang, et al. ∙

research

∙ 07/04/2022

Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition

Existing out-of-distribution (OOD) detection methods are typically bench...

7 Haotao Wang, et al. ∙

research

∙ 07/04/2022

Removing Batch Normalization Boosts Adversarial Training

Adversarial training (AT) defends deep neural networks against adversari...

0 Haotao Wang, et al. ∙

research

∙ 06/16/2022

MixGen: A New Multi-Modal Data Augmentation

Data augmentation is a necessity to enhance data efficiency in deep lear...

36 Xiaoshuai Hao, et al. ∙

research

∙ 10/08/2021

Lightweight Convolutional Neural Networks By Hypercomplex Parameterization

Hypercomplex neural networks have proved to reduce the overall number of...

1 Eleonora Grassucci, et al. ∙

research

∙ 06/21/2021

Dive into Deep Learning

This open-source book represents our attempt to make deep learning appro...

0 Aston Zhang, et al. ∙

research

∙ 02/23/2021

Controllable and Diverse Text Generation in E-commerce

In E-commerce, a key challenge in text generation is to find a good trad...

11 Huajie Shao, et al. ∙

research

∙ 02/17/2021

Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with 1/n Parameters

Recent works have demonstrated reasonable success of representation lear...

6 Aston Zhang, et al. ∙

research

∙ 02/12/2021

A Unified Lottery Ticket Hypothesis for Graph Neural Networks

With graphs rapidly growing in size and deeper graph neural networks (GN...

0 Tianlong Chen, et al. ∙

research

∙ 11/11/2020

Learning User Representations with Hypercuboids for Recommender Systems

Modeling user interests is crucial in real-world recommender systems. In...

0 Shuai Zhang, et al. ∙

research

∙ 10/31/2020

ControlVAE: Tuning, Analytical Properties, and Performance Analysis

This paper reviews the novel concept of controllable variational autoenc...

0 Huajie Shao, et al. ∙

research

∙ 10/06/2020

Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder

This paper demonstrates a fatal vulnerability in natural language infere...

21 Alvin Chan, et al. ∙

research

∙ 06/05/2020

CoCon: A Self-Supervised Approach for Controlled Text Generation

Pretrained Transformer-based language models (LMs) display remarkable na...

9 Alvin Chan, et al. ∙

research

∙ 04/13/2020

paper2repo: GitHub Repository Recommendation for Academic Papers

GitHub has become a popular social application platform, where a large n...

0 Huajie Shao, et al. ∙

research

∙ 04/13/2020

Controllable Variational Autoencoder

Variational Autoencoders (VAE) and their variants have been widely used ...

5 Huajie Shao, et al. ∙

research

∙ 02/14/2020

Transformer on a Diet

Transformer has been widely used thanks to its ability to capture sequen...

0 Chenguang Wang, et al. ∙

research

∙ 08/18/2019

Parsimonious Morpheme Segmentation with an Application to Enriching Word Embeddings

Traditionally, many text-mining tasks treat individual word-tokens as th...

0 Ahmed El-Kishky, et al. ∙

research

∙ 07/09/2019

GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing

We present GluonCV and GluonNLP, the deep learning toolkits for computer...

5 Jian Guo, et al. ∙

research

∙ 06/11/2019

Lightweight and Efficient Neural Natural Language Processing with Quaternion Networks

Many state-of-the-art neural models for NLP are heavily parameterized an...

0 Yi Tay, et al. ∙

research

∙ 06/06/2019

Quaternion Collaborative Filtering for Recommendation

This paper proposes Quaternion Collaborative Filtering (QCF), a novel re...

0 Shuai Zhang, et al. ∙

research

∙ 05/26/2019

Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives

This paper tackles the problem of reading comprehension over long narrat...

8 Yi Tay, et al. ∙

research

∙ 03/09/2018

Expert Finding in Heterogeneous Bibliographic Networks with Locally-trained Embeddings

Expert finding is an important task in both industry and academia. It is...

0 Huan Gui, et al. ∙

research

∙ 06/05/2017

DeepIoT: Compressing Deep Neural Network Structures for Sensing Systems with a Compressor-Critic Framework

Recent advances in deep learning motivate the use of deep neutral networ...

0 Shuochao Yao, et al. ∙

research

∙ 11/07/2016

DeepSense: A Unified Deep Learning Framework for Time-Series Mobile Sensing Data Processing

Mobile sensing applications usually require time-series inputs from sens...

0 Shuochao Yao, et al. ∙

Aston Zhang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro