Filtern nach
Letzte Suchanfragen

Ergebnisse für *

Zeige Ergebnisse 1 bis 3 von 3.

  1. Building and using comparable corpora for multilingual natural language processing
    Erschienen: [2023]
    Verlag:  Springer, Cham, Switzerland

    This book provides a comprehensive overview of methods to build comparable corpora and of their applications, including machine translation, cross-lingual transfer, and various kinds of multilingual natural language processing. The authors begin with... mehr

    Universitätsbibliothek Stuttgart
    keine Fernleihe

     

    This book provides a comprehensive overview of methods to build comparable corpora and of their applications, including machine translation, cross-lingual transfer, and various kinds of multilingual natural language processing. The authors begin with a brief history on the topic followed by a comparison to parallel resources and an explanation of why comparable corpora have become more widely used. In particular, they provide the basis for the multilingual capabilities of pre-trained models, such as BERT or GPT. The book then focuses on building comparable corpora, aligning their sentences to create a database of suitable translations, and using these sentence translations to produce dictionaries and term banks. Then, it is explained how comparable corpora can be used to build machine translation engines and to develop a wide variety of multilingual applications

     

    Export in Literaturverwaltung   RIS-Format
      BibTeX-Format
    Hinweise zum Inhalt
    Cover (lizenzpflichtig)
    Quelle: Verbundkataloge
    Sprache: Englisch
    Medientyp: Buch (Monographie)
    Format: Druck
    ISBN: 9783031313837
    Schriftenreihe: Synthesis lectures on human language technologies
    Schlagworte: Angewandte Informatik; COM094000; COMPUTERS / Computer Science; COMPUTERS / Natural Language Processing; Computational linguistics; Computerlinguistik und Korpuslinguistik; Information technology: general issues; Machine learning; Maschinelles Lernen; Natural language & machine translation; Natürliche Sprachen und maschinelle Übersetzung
    Umfang: viii, 133 Seiten, Illustrationen, Diagramme
    Bemerkung(en):

    Chapter 1 Introduction.- Chapter 2 Basic principles of cross-lingual models.- Chapter 3 Building comparable corpora.- Chapter 4 Extraction of parallel sentences.- Chapter 5 Induction of bilingual Dictionaries.- Chapter 6 Comparable and Parallel Corpora for Machine Translation.- Chapter 7 Other applications of comparable corpora.- Chapter 8 Conclusions and future research.- Index.

  2. Building and using comparable corpora for multilingual Natural language processing
    Erschienen: [2023]
    Verlag:  Springer International Publishing AG, Cham

    Intro -- Contents -- About the Authors -- 1 Introduction -- 1.1 Rationale for Working with Comparable Corpora -- 1.1.1 Availability of Truly Parallel Data -- 1.1.2 Translationese in Parallel Data -- 1.2 Levels of Comparability -- 1.3 Methodology for... mehr

    Zugang:
    Aggregator (lizenzpflichtig)
    Universitätsbibliothek Leipzig
    keine Fernleihe

     

    Intro -- Contents -- About the Authors -- 1 Introduction -- 1.1 Rationale for Working with Comparable Corpora -- 1.1.1 Availability of Truly Parallel Data -- 1.1.2 Translationese in Parallel Data -- 1.2 Levels of Comparability -- 1.3 Methodology for Dealing with Comparable Resources -- 2 Basic Principles of Cross-Lingual Models -- [DELETE] -- 2.1 Monolingual VSMs -- 2.2 Cross-Lingual VSMs -- 2.3 Contextual Embeddings -- 3 Building Comparable Corpora -- [DELETE] -- 3.1 Measures for Document Similarity Across Languages -- 3.2 Evaluation Methods and Datasets -- 3.3 Natural Annotation: Building Strongly Comparable Corpora -- 3.4 Low-Hanging Fruit: Building Weakly Comparable Corpora -- 3.5 Large Scale Document Alignment -- 3.5.1 Structural Similarity -- 3.5.2 Lexical Similarity -- 3.6 Comparable Corpora of Unrelated Documents -- 4 Extraction of Parallel Sentences -- [DELETE] -- 4.1 Extraction from Parallel Corpora -- 4.2 Assessing Cross-Lingual Sentence Similarity -- 4.3 Datasets and Evaluation -- 4.4 General Principles -- 4.5 Pre-neural Methods -- 4.6 Supervised Neural Methods -- 4.7 Limitations of Supervised Methods in Low-Resource Settings -- 4.8 Unsupervised Neural Methods -- 5 Induction of Bilingual Dictionaries -- 5.1 Setting the Task -- 5.2 Bilingual Lexicon Induction From Parallel Corpora -- 5.3 Matching Contexts -- 5.4 Geometric Properties of Word Embedding Spaces -- 5.5 Alignment of Word Embeddings -- 5.6 Alignment of Contextual Embeddings -- 5.7 Evaluation -- 5.7.1 Evaluation Experiments for BLI -- 5.7.2 Evaluation on Multilingual Termbanks -- 5.8 The BUCC 2020 Shared Task on Bilingual Dictionary Induction -- 5.8.1 Resources -- 5.8.2 Evaluation -- 5.9 The BUCC 2022 Shared Task on Bilingual Terminology Extraction -- 5.9.1 Specifications of the Task -- 5.9.2 Shared Task Results -- 6 Comparable and Parallel Corpora for Machine Translation.

     

    Export in Literaturverwaltung   RIS-Format
      BibTeX-Format
    Quelle: Verbundkataloge
    Sprache: Englisch
    Medientyp: Ebook
    Format: Online
    ISBN: 9783031313844
    Schriftenreihe: Synthesis lectures on human language technologies series
    Schlagworte: Angewandte Informatik; COM094000; COMPUTERS / Computer Science; COMPUTERS / Natural Language Processing; Computational linguistics; Computerlinguistik und Korpuslinguistik; Information technology: general issues; Machine learning; Maschinelles Lernen; Natural language & machine translation; Natürliche Sprachen und maschinelle Übersetzung
    Umfang: 1 Online-Ressource (viii, 133 Seiten)
    Bemerkung(en):

    Description based on publisher supplied metadata and other sources

  3. Building and Using Comparable Corpora
    Beteiligt: Sharoff, Serge (Herausgeber); Rapp, Reinhard (Herausgeber); Zweigenbaum, Pierre (Herausgeber); Fung, Pascale (Herausgeber)
    Erschienen: 2013
    Verlag:  Springer Berlin Heidelberg, Berlin, Heidelberg ; Springer International Publishing AG, Cham

    Bibliothek der Hochschule Darmstadt, Zentralbibliothek
    keine Fernleihe
    Universitätsbibliothek J. C. Senckenberg, Zentralbibliothek (ZB)
    keine Fernleihe
    Export in Literaturverwaltung   RIS-Format
      BibTeX-Format
    Quelle: Verbundkataloge
    Beteiligt: Sharoff, Serge (Herausgeber); Rapp, Reinhard (Herausgeber); Zweigenbaum, Pierre (Herausgeber); Fung, Pascale (Herausgeber)
    Sprache: Englisch
    Medientyp: Ebook
    Format: Online
    ISBN: 9783642201288; 3642201288
    Weitere Identifier:
    RVK Klassifikation: ES 960
    DDC Klassifikation: Datenverarbeitung; Informatik (004); Sprache (400)
    Auflage/Ausgabe: 1st ed. 2013
    Schlagworte: Computerlinguistik; Übersetzung; Korpus <Linguistik>; Vergleichbarkeit; Natural language processing (Computer science); Computational linguistics; Application software; Natural Language Processing (NLP); Computational Linguistics; Computer and Information Systems Applications
    Umfang: 1 Online-Ressource (XII, 335 Seiten), 70 illus., 14 illus. in color.