PatentEval: Understanding Errors in Patent Generation - PaRis AI Research InstitutE Accéder directement au contenu
Communication Dans Un Congrès Année : 2024

PatentEval: Understanding Errors in Patent Generation

Résumé

In this work, we introduce a comprehensive error typology specifically designed for evaluating two distinct tasks in machine-generated patent texts: claims-to-abstract generation, and the generation of the next claim given previous ones. We have also developed a benchmark, PatentEval, for systematically assessing language models in this context. Our study includes a comparative analysis, annotated by humans, of various models. These range from those specifically adapted during training for tasks within the patent domain to the latest general-purpose large language models (LLMs). Furthermore, we explored and evaluated some metrics to approximate human judgments in patent text evaluation, analyzing the extent to which these metrics align with expert assessments. These approaches provide valuable insights into the capabilities and limitations of current language models in the specialized field of patent text generation.
Fichier principal
Vignette du fichier
acl_latex.pdf (753.11 Ko) Télécharger le fichier
NAACL2024___PatentEval.pdf (781.92 Ko) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)
Origine Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04595013 , version 1 (04-06-2024)

Identifiants

  • HAL Id : hal-04595013 , version 1

Citer

You Zuo, Kim Gerdes, Eric Villemonte de La Clergerie, Benoît Sagot. PatentEval: Understanding Errors in Patent Generation. NAACL2024 - 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Jun 2024, Mexico City, Mexico. ⟨hal-04595013⟩
0 Consultations
0 Téléchargements

Partager

Gmail Mastodon Facebook X LinkedIn More