Nikos Karampatziakis

research

∙ 03/13/2023

Meet in the Middle: A New Pre-training Paradigm

Most language models (LMs) are trained and applied in an autoregressive ...

0 Anh Nguyen, et al. ∙

research

∙ 10/19/2022

Anytime-valid off-policy inference for contextual bandits

Contextual bandit algorithms are ubiquitous tools for active sequential ...

0 Ian Waudby-Smith, et al. ∙

research

∙ 12/06/2021

Contextual Bandit Applications in Customer Support Bot

Virtual support agents have grown in popularity as a way for businesses ...

0 Sandra Sajeev, et al. ∙

research

∙ 02/18/2021

Off-policy Confidence Sequences

We develop confidence bounds that hold uniformly over time for off-polic...

0 Nikos Karampatziakis, et al. ∙

research

∙ 06/07/2019

Empirical Likelihood for Contextual Bandits

We apply empirical likelihood techniques to contextual bandit policy val...

5 Nikos Karampatziakis, et al. ∙

research

∙ 05/06/2019

Lessons from Real-World Reinforcement Learning in a Customer Support Bot

In this work, we describe practical lessons we have learned from success...

0 Nikos Karampatziakis, et al. ∙

research

∙ 06/15/2016

Logarithmic Time One-Against-Some

We create a new online reduction of multiclass classification to binary ...

0 Hal Daumé III, et al. ∙

research

∙ 02/05/2016

Active Information Acquisition

We propose a general framework for sequential and dynamic acquisition of...

0 He He, et al. ∙

research

∙ 11/10/2015

A Hierarchical Spectral Method for Extreme Classification

Extreme classification problems are multiclass and multilabel classifica...

0 Paul Mineiro, et al. ∙

research

∙ 11/13/2014

A Randomized Algorithm for CCA

We present RandomizedCCA, a randomized algorithm for computing canonical...

0 Paul Mineiro, et al. ∙

research

∙ 10/07/2013

Least Squares Revisited: Scalable Approaches for Multi-class Prediction

This work provides simple algorithms for multi-class (and multi-label) p...

0 Alekh Agarwal, et al. ∙

research

∙ 10/07/2013

Discriminative Features via Generalized Eigenvectors

Representing examples in a way that is compatible with the underlying cl...

0 Nikos Karampatziakis, et al. ∙

research

∙ 06/07/2013

Loss-Proportional Subsampling for Subsequent ERM

We propose a sampling scheme suitable for reducing a data set prior to s...

0 Paul Mineiro, et al. ∙

research

∙ 06/13/2011

Efficient Optimal Learning for Contextual Bandits

We address the problem of learning in an online setting where the learne...

0 Miroslav Dudík, et al. ∙

Nikos Karampatziakis

Featured Co-authors

Sign in with Google

Consider DeepAI Pro