Towards an Ethical Compression of Large Language Models

Irina Proskurina; Guillaume Metzler; Julien Velcin

Communication Dans Un Congrès Année : 2024

Towards an Ethical Compression of Large Language Models

(1) , (1) , (1)

Irina Proskurina

Fonction : Auteur
PersonId : 1262868
IdHAL : irina-proskurina

Entrepôts, Représentation et Ingénierie des Connaissances

Guillaume Metzler

Fonction : Auteur
PersonId : 740506
IdHAL : guillaume-metzler
ORCID : 0000-0002-0904-2843

Entrepôts, Représentation et Ingénierie des Connaissances

Julien Velcin

Fonction : Auteur
PersonId : 967191
IdHAL : julien-velcin
ORCID : 0000-0002-2262-045X

Entrepôts, Représentation et Ingénierie des Connaissances

Résumé

This proposal explores the fairness of compressed large language models (LLMs). We focus on the ethical implications of applying efficient compression techniques, particularly quantization, to generative LLMs, motivated by recent studies. While quantization enhances inference efficiency, marked by existing works, we primarily focus on understanding its effects on token-level confidence and predictive probability distributions in our research. We also identify significant influences on LLM behaviour during text generation, shedding light on potential biases and ethical concerns. We have determined the difference in output probability distributions after compression and aim to use this observation to propose a debiasing quantization approach.

Domaines

Traitement du texte et du document Intelligence artificielle [cs.AI] Apprentissage [cs.LG]

Fichier principal

soumission_ethique_tal-2.pdf (50.69 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Guillaume METZLER : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04646400

Soumis le : vendredi 12 juillet 2024-13:52:41

Dernière modification le : mercredi 4 septembre 2024-17:34:07

Archivage à long terme le : lundi 14 octobre 2024-09:33:45

Dates et versions

hal-04646400 , version 1 (12-07-2024)

Identifiants

HAL Id : hal-04646400 , version 1

Citer

Irina Proskurina, Guillaume Metzler, Julien Velcin. Towards an Ethical Compression of Large Language Models. Journée Éthique et TAL 2024, Apr 2024, Nancy, France. ⟨hal-04646400⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LYON1 UNIV-LYON2 ERIC UDL ANR

17 Consultations

19 Téléchargements

Towards an Ethical Compression of Large Language Models

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager