The Place of Language Models in the Information-Theoretic Science of Language

Mon statut pour la session

Quoi:

Talk

Partie de:

Jour 1

Quand:

9:00 AM, Lundi 3 Juin 2024 EDT (1 heure 30 minutes)

Où:

Université du Québec à Montréal

Thème:

Large Language Models & Understanding

Language models succeed in part because they share information-processing constraints with humans. These information-processing constraints do not have to do with the specific neural-network architecture nor any hardwired formal structure, but with the shared core task of language models and the brain: predicting upcoming input. I show that universals of language can be explained in terms of generic information-theoretic constraints, and that the same constraints explain language model performance when learning human-like versus non-human-like languages. I argue that this information-theoretic approach provides a deeper explanation for the nature of human language than purely symbolic approaches, and links the science of language with neuroscience and machine learning.

References

Futrell, R., Hahn, M. 2024. Linguistic Structure from a Bottleneck on Sequential Information Processing. arXiv:2405.12109

Kallini, J., Papadimitriou, I., Futrell, R., Mahowald, K., & Potts, C. (2024). Mission: Impossible language models. arXiv preprint arXiv:2401.06416.

Wilcox, E. G., Futrell, R., & Levy, R. (2023). Using computational models to test syntactic learnability. Linguistic Inquiry, 1-44.

Richard Futrell

Conférencier.ère

Mon statut pour la session

Permettre aux participants d'évaluer les sessions avec un "pouces vers le haut/bas" (thumbs up/thumbs down).

Permettre aux participants d'envoyer un feedback à l'organisateur.

Pour chaque session, permet aux participants d'écrire un court texte de feedback qui sera envoyé à l'organisateur. Ce texte n'est pas envoyé aux présentateurs.

Afficher la liste des personnes dans l'auditoire de chaque session du programme.

Afin de respecter les règles de gestion des données privées, cette option affiche uniquement les profils des personnes qui ont accepté de partager leur profil publiquement.

Permettre aux participants de participer à des discussions en ligne sur les sessions.

Les changements ici affecteront toutes les pages de détails des sessions

The Place of Language Models in the Information-Theoretic Science of Language

Mon statut pour la session

References

Mon statut pour la session

Detail de session

Nous utilisons des cookies