Jie Fu
85 papers · 2014–2025 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (17) π Interdisciplinary Bridge π Conference Polyglot (15)
π
Renaissance Researcher
(11)
π
Academic Marathon
(11)
πΊοΈ
Taxonomy Completionist
(17)
π
Conference Loyalist
(21)
π
Grand Slam
π
Triple Crown
π₯
Mega-Team
(40)
π€
Dynamic Duo
(21)
ποΈ
Keyword Collector
(326)
β
The Questioner
(3)
β‘
Prolific Year
(25)
π
Conference Pioneer
π
Trend Setter
π
Century Club
(85)
π₯
Unstoppable
(7)
Conferences
ACL (21)
EMNLP (16)
ICLR (12)
NIPS (7)
IJCAI (6)
NAACL (6)
AAAI (5)
ICML (3)
IJCNLP (3)
AACL (1)
COLING (1)
CORL (1)
ECCV (1)
ICCV (1)
RSS (1)
Top co-authors
Research topics
Keywords
large language model
(15)
text generation
(5)
question answering
(4)
instruction tuning
(4)
machine reading comprehension
(4)
graph neural network
(4)
zero-shot learning
(3)
reinforcement learning
(3)
text classification
(3)
temporal logic
(3)
transfer learning
(3)
knowledge distillation
(3)
interactive learning
(3)
markov decision process
(3)
transformer architecture
(2)
data augmentation
(2)
unsupervised learning
(2)
machine translation
(2)
gradient-based optimization
(2)
deep reinforcement learning
(2)
Papers
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
ACL 2025
Efficient Domain Continual pretraining by Mitigating the Stability Gap
ACL 2025
PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment
ACL 2025
Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking
ACL 2025
Enhancing Language Model Hypernetworks with Restart: A Study on Optimization
NAACL 2025
COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning
NAACL 2025
A Closer Look into Mixture-of-Experts in Large Language Models
NAACL 2025
PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative APIs
ICML 2025
Layerwise Recurrent Router for Mixture-of-Experts
ICLR 2025
MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation
ICLR 2025
Huatuo-26M, a Large-scale Chinese Medical QA Dataset
NAACL 2025
Learning Nash Equilibrium of Markov Potential Games with a Shared Constraint via Primal-Dual Optimization
AAAI 2025
Sequential Decision Making in Stochastic Games with Incomplete Preferences over Temporal Objectives
AAAI 2025
VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text
ICLR 2025
MuPT: A Generative Symbolic Music Pretrained Transformer
ICLR 2025
Fine-Grained Manipulation of Arithmetic Neurons
EMNLP 2025
MIO: A Foundation Model on Multimodal Tokens
EMNLP 2025
Information-Theoretic Opacity-Enforcement in Markov Decision Processes
IJCAI 2024
Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training
NIPS 2024
D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models
NIPS 2024
Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers
AAAI 2024
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
ACL 2024
HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts
ACL 2024
E2-LLM: Efficient and Extreme Length Extension of Large Language Models
ACL 2024
ChatMusician: Understanding and Generating Music Intrinsically with LLM
ACL 2024
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models
ACL 2024
SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval
ACL 2024
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
ACL 2024
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
ACL 2024
CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation
COLING 2024
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers
ECCV 2024
LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
EMNLP 2024
Unlocking Continual Learning Abilities in Language Models
EMNLP 2024
PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness
EMNLP 2024
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
ICLR 2024
Massive Editing for Large Language Models via Meta Learning
ICLR 2024
Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs
ICLR 2024
ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate
ICLR 2024
Think Before You Act: Decision Transformers with Working Memory
ICML 2024
AutoAgents: A Framework for Automatic Agent Generation
IJCAI 2024
Unlocking Emergent Modularity in Large Language Models
NAACL 2024
Defending Against Weight-Poisoning Backdoor Attacks for Parameter-Efficient Fine-Tuning
NAACL 2024
Probabilistic Planning with Prioritized Preferences over Temporal Logic Objectives
IJCAI 2023
MARBLE: Music Audio Representation Benchmark for Universal Evaluation
NIPS 2023
Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias
NIPS 2023
GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule Zero-Shot Learning
NIPS 2023
Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models
EMNLP 2023
Unifying Discrete and Continuous Representations for Unsupervised Paraphrase Generation
EMNLP 2023
Prototype-based HyperAdapter for Sample-Efficient Multi-task Tuning
EMNLP 2023
When Do Graph Neural Networks Help with Node Classification? Investigating the Homophily Principle on Node Distinguishability
NIPS 2023
Text Editing as Imitation Game
EMNLP 2022
1Cademy @ Causal News Corpus 2022: Leveraging Self-Training in Causality Classification of Socio-Political Event Data
EMNLP 2022
1Cademy @ Causal News Corpus 2022: Enhance Causal Span Detection via Beam-Search-based Position Selector
EMNLP 2022
Bidirectional Learning for Offline Infinite-width Model-based Optimization
NIPS 2022
Unifying Likelihood-free Inference with Black-box Optimization and Beyond
ICLR 2022
HERB: Measuring Hierarchical Regional Bias in Pre-trained Language Models
AACL 2022
Biological Sequence Design with GFlowNets
ICML 2022
Learning Multi-Objective Curricula for Robotic Policy Learning
CORL 2022
Reconciliation of Pre-trained Models and Prototypical Neural Networks in Few-shot Named Entity Recognition
EMNLP 2022
FloW: A Dataset and Benchmark for Floating Waste Detection in Inland Waters
ICCV 2021
On Orthogonality Constraints for Transformers
ACL 2021
GLGE: A New General Language Generation Evaluation Benchmark
ACL 2021
On Orthogonality Constraints for Transformers
IJCNLP 2021
CoCon: A Self-Supervised Approach for Controlled Text Generation
ICLR 2021
Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with $1/n$ Parameters
ICLR 2021
GLGE: A New General Language Generation Evaluation Benchmark
IJCNLP 2021
Jacobian Adversarially Regularized Networks for Robustness
ICLR 2020
Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space
EMNLP 2020
Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation
EMNLP 2020
Synthesis of Deceptive Strategies in Reachability Games with Action Misperception
IJCAI 2020
Semi-Dynamic Hypergraph Neural Network for 3D Pose Estimation
IJCAI 2020
RikiNet: Reading Wikipedia Pages for Natural Question Answering
ACL 2020
Would you Rather? A New Benchmark for Learning Machine Alignment with Cultural Values and Social Preferences
ACL 2020
Interactive Machine Comprehension with Information Seeking Agents
ACL 2020
Revision in Continuous Space: Unsupervised Text Style Transfer without Adversarial Learning
AAAI 2020
Graph Neural Networks with Generated Parameters for Relation Extraction
ACL 2019
Interactive Language Learning by Question Answering
IJCNLP 2019
Learning Multi-Task Communication with Message Passing for Sequence Learning
AAAI 2019
Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives
ACL 2019
TIGS: An Inference Algorithm for Text Infilling with Gradient Search
ACL 2019
Interactive Language Learning by Question Answering
EMNLP 2019
Lightweight and Efficient Neural Natural Language Processing with Quaternion Networks
ACL 2019
Structure Learning for Neural Module Networks
EMNLP 2019
DrMAD: Distilling Reverse-Mode Automatic Differentiation for Optimizing Hyperparameters of Deep Neural Networks
IJCAI 2016
Probably Approximately Correct MDP Learning and Control With Temporal Logic Constraints
RSS 2014