Papers
401 papers found
FAERY: An FPGA-accelerated Embedding-based Retrieval System
Chaoliang Zeng, Layong Luo, Qingsong Ning et al.
From Dynamic Loading to Extensible Transformation: An Infrastructure for Dynamic Library Transformation
Yuxin Ren, Kang Zhou, Jianhai Luan et al.
Groove: Flexible Metadata-Private Messaging
Ludovic Barman, Moshe Kol, David Lazar et al.
Hubble: Performance Debugging with In-Production, Just-In-Time Method Tracing on Android
Yu Luo, Kirk Rodrigues, Cuiqin Li et al.
Immortal Threads: Multithreaded Event-driven Intermittent Computing on Ultra-Low-Power Microcontrollers
Eren Yıldız, Lijun Chen, Kasim Sinan Yıldırım
Jawa: Web Archival in the Era of JavaScript
Ayush Goel, Jingyuan Zhu, Ravi Netravali et al.
KSplit: Automating Device Driver Isolation
Yongzhe Huang, Vikram Narayanan, David Detweiler et al.
ListDB: Union of Write-Ahead Logs and Persistent SkipLists for Incremental Checkpointing on Persistent Memory
Wonbae Kim, Chanyeol Park, Dongui Kim et al.
Looking Beyond GPUs for DNN Scheduling on Multi-Tenant Clusters
Jayashree Mohan, Amar Phanishayee, Janardhan Kulkarni et al.
MemLiner: Lining up Tracing and Application for a Far-Memory-Friendly Runtime
Chenxi Wang, Haoran Ma, Shi Liu et al.
Metastable Failures in the Wild
Lexiang Huang, Matthew Magnusson, Abishek Bangalore Muralikrishna et al.
Microsecond-scale Preemption for Concurrent GPU-accelerated DNN Inferences
Mingcong Han, Hanze Zhang, Rong Chen et al.
Occualizer: Optimistic Concurrent Search Trees From Sequential Code
Tomer Shanny, Adam Morrison
ODINFS: Scaling PM Performance with Opportunistic Delegation
Diyu Zhou, Yuchen Qian, Vishal Gupta et al.
Operating System Support for Safe and Efficient Auxiliary Execution
Yuzhuo Jing, Peng Huang
Orca: A Distributed Serving System for Transformer-Based Generative Models
Gyeong-In Yu, Joo Seong Jeong, Geon-Woo Kim et al.
ORION and the Three Rights: Sizing, Bundling, and Prewarming for Serverless DAGs
Ashraf Mahgoub, Edgardo Barsallo Yi, Karthick Shankar et al.
Owl: Scale and Flexibility in Distribution of Hot Content
Jason Flinn, Xianzheng Dou, Arushi Aggarwal et al.
Practically Correct, Just-in-Time Shell Script Parallelization
Konstantinos Kallas, Tammam Mustafa, Jan Bielak et al.
RESIN: A Holistic Service for Dealing with Memory Leaks in Production Cloud Infrastructure
Chang Lou, Cong Chen, Peng Huang et al.
SHORTSTACK: Distributed, Fault-tolerant, Oblivious Data Access
Midhul Vuppalapati, Kushal Babel, Anurag Khandelwal et al.
SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute
Ningxin Zheng, Bin Lin, Quanlu Zhang et al.
Tiger: Disk-Adaptive Redundancy Without Placement Restrictions
Saurabh Kadekodi, Francisco Maturana, Sanjith Athlur et al.
TriCache: A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs
Guanyu Feng, Huanqi Cao, Xiaowei Zhu et al.
Trinity: High-Performance Mobile Emulation through Graphics Projection
Di Gao, Hao Lin, Zhenhua Li et al.