Many popular machine learning models scale poorly when deployed on CPUs....
Creating and destroying threads on modern Linux systems incurs high late...
Multi-thread applications commonly utilize condition variables for
commu...
The Transformer architecture revolutionized the field of natural languag...
We present Hemlock, a novel mutual exclusion locking algorithm that is
e...
For scope and context, the idea we'll describe below, Compact Java Monit...
Atomic lock-free multi-word compare-and-swap (MCAS) is a powerful tool f...
Range locks are a synchronization construct designed to provide concurre...
Classic test-and-test (TS) mutual exclusion locks are simple, and enjoy ...
Saturated locks often degrade the performance of a multithreaded applica...
Modern multi-socket architectures exhibit non-uniform memory access (NUM...
The classic ticket lock consists of ticket and grant fields. Arriving th...
Designers of modern reader-writer locks confront a difficult trade-off
r...
The hardware transactional memory (HTM) implementation in Intel's i7-477...