| 1 |
Distilling Reasoning Capabilities into Smaller Language Models
Kumar Shridhar, Alessandro Stolfo, Mrinmaya Sachan
|
📜
Death by README
|
cs.LG
|
224 |
3 years ago |
| 2 |
A Survey of Deep Learning for Mathematical Reasoning
Pan Lu, Liang Qiu, ... (+3 more)
|
📜
Death by README
|
cs.AI
|
187 |
3 years ago |
| 3 |
When Federated Learning Meets Pre-trained Language Models' Parameter-Efficient Tuning Methods
Zhuo Zhang, Yuanhang Yang, ... (+3 more)
|
📜
Death by README
|
cs.LG
|
118 |
3 years ago |
| 4 |
Explicit Alignment Objectives for Multilingual Bidirectional Encoders
Junjie Hu, Melvin Johnson, ... (+3 more)
|
📜
Death by README
|
cs.CL
|
63 |
5 years ago |
| 5 |
WikiBERT models: deep transfer learning for many languages
Sampo Pyysalo, Jenna Kanerva, ... (+2 more)
|
📜
Death by README
|
cs.CL
|
40 |
5 years ago |
| 6 |
Overcoming Language Priors in Visual Question Answering via Distinguishing Superficially Similar Instances
Yike Wu, Yu Zhao, ... (+5 more)
|
📜
Death by README
|
cs.CL
|
25 |
3 years ago |
| 7 |
Template-free Data-to-Text Generation of Finnish Sports News
Jenna Kanerva, Samuel Rönnqvist, ... (+3 more)
|
📜
Death by README
|
cs.CL
|
21 |
6 years ago |
| 8 |
Context Dependent Semantic Parsing: A Survey
Zhuang Li, Lizhen Qu, Gholamreza Haffari
|
📜
Death by README
|
cs.CL
|
21 |
5 years ago |
| 9 |
SLM-Mod: Small Language Models Surpass LLMs at Content Moderation
Xianyang Zhan, Agam Goyal, ... (+3 more)
|
📜
Death by README
|
cs.CL
|
20 |
1 year ago |
| 10 |
Controllable Text Summarization: Unraveling Challenges, Approaches, and Prospects -- A Survey
Ashok Urlana, Pruthwik Mishra, ... (+2 more)
|
📜
Death by README
|
cs.CL
|
18 |
2 years ago |
| 11 |
BPP-Search: Enhancing Tree of Thought Reasoning for Mathematical Modeling Problem Solving
Teng Wang, Wing-Yin Yu, ... (+9 more)
|
📜
Death by README
|
cs.AI
|
11 |
1 year ago |
| 12 |
KoCHET: a Korean Cultural Heritage corpus for Entity-related Tasks
Gyeongmin Kim, Jinsung Kim, ... (+2 more)
|
📜
Death by README
|
cs.CL
|
9 |
3 years ago |
| 13 |
IndicIRSuite: Multilingual Dataset and Neural Information Models for Indian Languages
Saiful Haq, Ashutosh Sharma, Pushpak Bhattacharyya
|
📜
Death by README
|
cs.IR
|
7 |
2 years ago |
| 14 |
From Generalist to Specialist: A Survey of Large Language Models for Chemistry
Yang Han, Ziping Wan, ... (+3 more)
|
📜
Death by README
|
physics.chem-ph
|
7 |
1 year ago |
| 15 |
Obliviate: Neutralizing Task-agnostic Backdoors within the Parameter-efficient Fine-tuning Paradigm
Jaehan Kim, Minkyoo Song, ... (+2 more)
|
📜
Death by README
|
cs.CL
|
6 |
1 year ago |
| 16 |
EIT: Enhanced Interactive Transformer
Tong Zheng, Bei Li, ... (+3 more)
|
📜
Death by README
|
cs.CL
|
3 |
3 years ago |
| 17 |
GA-S$^3$: Comprehensive Social Network Simulation with Group Agents
Yunyao Zhang, Zikai Song, ... (+5 more)
|
📜
Death by README
|
cs.SI
|
3 |
9 months ago |
| 18 |
SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving
Xueliang Zhao, Xinting Huang, ... (+2 more)
|
📜
Death by README
|
cs.CL
|
1 |
2 years ago |
| 19 |
ECoRAG: Evidentiality-guided Compression for Long Context RAG
Yeonseok Jeong, Jinsu Kim, ... (+2 more)
|
📜
Death by README
|
cs.CL
|
1 |
9 months ago |