Kyle Min

research

∙ 06/18/2023

STHG: Spatial-Temporal Heterogeneous Graph Learning for Advanced Audio-Visual Diarization

This report introduces our novel method named STHG for the Audio-Visual ...

0 Kyle Min, et al. ∙

research

∙ 06/07/2023

WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models

The rapid advancement of generative models, facilitating the creation of...

0 Changhoon Kim, et al. ∙

research

∙ 04/18/2023

SViTT: Temporal Learning of Sparse Video-Text Transformers

Do video-text transformers learn to model temporal relationships across ...

0 Yi Li, et al. ∙

research

∙ 04/03/2023

Unbiased Scene Graph Generation in Videos

The task of dynamic scene graph generation (SGG) from videos is complica...

0 Sayak Nag, et al. ∙

research

∙ 10/14/2022

Intel Labs at Ego4D Challenge 2022: A Better Baseline for Audio-Visual Diarization

This report describes our approach for the Audio-Visual Diarization (AVD...

0 Kyle Min, et al. ∙

research

∙ 07/15/2022

Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection

Active speaker detection (ASD) in videos with multiple speakers is a cha...

0 Kyle Min, et al. ∙

research

∙ 12/02/2021

Learning Spatial-Temporal Graphs for Active Speaker Detection

We address the problem of active speaker detection through a new framewo...

0 Sourya Roy, et al. ∙

research

∙ 11/08/2020

Integrating Human Gaze into Attention for Egocentric Activity Recognition

It is well known that human gaze carries significant information about v...

0 Kyle Min, et al. ∙

research

∙ 07/13/2020

Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization

Temporally localizing activities within untrimmed videos has been extens...

0 Kyle Min, et al. ∙

research

∙ 08/15/2019

TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection

TASED-Net is a 3D fully-convolutional network architecture for video sal...

5 Kyle Min, et al. ∙

research

∙ 04/02/2018

Hierarchical Novelty Detection for Visual Object Recognition

Deep neural networks have achieved impressive success in large-scale vis...

1 Kibok Lee, et al. ∙

Kyle Min

Featured Co-authors

Sign in with Google

Consider DeepAI Pro