Papers
441 papers found
AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels
Nicholas Roberts, Xintong Li, Tzu-Heng Huang et al.
The trade-offs of model size in large recommendation models : 100GB to 10MB Criteo-tb DLRM model
Aditya Desai, Anshumali Shrivastava
DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps
Cheng Lu, Yuhao Zhou, Fan Bao et al.
Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark
Jiaxi Gu, Xiaojun Meng, Guansong Lu et al.
Objaverse-XL: A Universe of 10M+ 3D Objects
Matt Deitke, Ruoshi Liu, Matthew Wallingford et al.
Saving 100x Storage: Prototype Replay for Reconstructing Training Sample Distribution in Class-Incremental Semantic Segmentation
Jinpeng Chen, Runmin Cong, Yuxuan LUO et al.
DISCO-10M: A Large-Scale Music Dataset
Luca Lanzendörfer, Florian Grötschla, Emil Funke et al.
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
Coleman Hooper, Sehoon Kim, Hiva Mohammadzadeh et al.
Scaling the Codebook Size of VQ-GAN to 100,000 with a Utilization Rate of 99%
Lei Zhu, Fangyun Wei, Yanye Lu et al.
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens
Anas Awadalla, Le Xue, Oscar Lo et al.
The Multimodal Universe: Enabling Large-Scale Machine Learning with 100 TB of Astronomical Scientific Data
Eirini Angeloudi, Jeroen Audenaert, Micah Bowles et al.
Swift Sampler: Efficient Learning of Sampler by 10 Parameters
Jiawei Yao, Chuming Li, Canran Xiao
Just Add $100 More: Augmenting Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem
Mincheol Chang, Siyeong Lee, Jinkyu Kim et al.
HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting
Yuanhao Cai, Zihao Xiao, Yixun Liang et al.
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection
Yuxuan Li, Xiang Li, Weijie Li et al.
Slice-100K: A Multimodal Dataset for Extrusion-based 3D Printing
Anushrut Jignasu, Kelly O. Marshall, Ankush Kumar Mishra et al.
Up to 100x Faster Data-Free Knowledge Distillation
Gongfan Fang, Kanya Mo, Xinchao Wang et al.
TinyNeRF: Towards 100 x Compression of Voxel Radiance Fields
Tianli Zhao, Jiayuan Chen, Cong Leng et al.
CowClip: Reducing CTR Prediction Model Training Time from 12 Hours to 10 Minutes on 1 GPU
Zangwei Zheng, Pengtai Xu, Xuan Zou et al.
Foundations of Autonomous Vehicles: A Curriculum Model for Developing Competencies in Artificial Intelligence and the Internet of Things for Grades 7–10
Elham Buxton, Elahe Javadi, Matthew Hagaman
Queries, Representation & Detection: The Next 100 Model Fingerprinting Schemes
Augustin Godinot, Erwan Le Merrer, Camilla Penzo et al.
SP-10K: A Large-scale Evaluation Set for Selectional Preference Acquisition
Hongming Zhang, Hantian Ding, Yangqiu Song
CODA-19: Using a Non-Expert Crowd to Annotate Research Aspects on 10,000+ Abstracts in the COVID-19 Open Research Dataset
Ting-Hao Kenneth Huang, Chieh-Yang Huang, Chien-Kuang Cornelia Ding et al.