Papers | Notion
Accepted Papers – ACL 2022 Workshop “BigScience – Challenges & Perspectives in Creating Large Language Models”
What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization? (2022)
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts (2022)
Paper Submission: Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources (2022)
🗄️ Paper Submission: Data Governance
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP (2021)
Multitask Prompted Training Enables Zero-Shot Task Generalization (2021)
Masader: Metadata Sourcing for Arabic Text and Speech Data Resources (2021)