Repository

  • ATHENA RC Repository (347)
  • Corpus (271)
  • Lexical/Conceptual resource (39)
  • Tool/Service (35)
  • Language description (2)
  • text (311)
  • video (4)
  • audio (3)
  • Modern Greek (1453-) (329)
  • English (86)
  • German (51)
  • Portuguese (45)
  • French (25)
  • greeklish (2)
  • bilingual (211)
  • monolingual (78)
  • multilingual (24)
  • parallel (215)
  • comparable (2)
  • multilingual single text (2)
  • unspecified (1)
  • Law (157)
  • Medicine & Health (150)
  • Economics (137)
  • general (135)
  • Computer Science, Information & General Works (133)
  • 10.12.2018-15.02.2022 (1)
  • 18th century onwards (1)
  • 2003-2012 (1)
  • 2004-2005 (1)
  • 2004-2012 (1)
  • Europe (1)
  • European Union (1)
  • no (183)
  • yes (88)
  • Alignment (199)
  • Structural annotation type (37)
  • Part of Speech (4)
  • Lemma (3)
  • Clause (2)
  • yes (21)
  • no (14)
  • Linguistic analysis (7)
  • Extraction of quantitative information (6)
  • Morphosyntactic tagging (5)
  • Sentence splitting (5)
  • Tokenization (5)
  • All Greek to me! terms of use (1)
  • Apache License 2.0 (2)
  • BSD 2-Clause "Simplified" License (7)
  • CLARIN-EL Terms of Service (19)
  • Creative Commons Attribution 3.0 Unported (3)
347 search results
filter-text
ATHENA RC Repository

A parallel corpus collected from the European Constitution. 21 languages, 210 bitexts total number of files: 986 total number of tokens: 3,01M total number of sentence fragments: 0,22M.

Languages:
Maltese
English
French
Swedish
...
Media Type:
text
Keywords:
multilingual
European Constitution
alignment
parallel

92 languages, 4.101 bitexts total number of files: 75.535 total number of tokens: 60,75M total number of sentence fragments: 8,89M.

Languages:
Turkish
Swedish
Slovak
Khmer
...
Media Type:
text
Keywords:
parallel
alignment
multilingual
technical texts

The EUconst subcorpus (DE-EL) (Moses) is a parallel subcorpus for German and Greek, subset of the EUconst, a parallel corpus collected from the European Constitution. 21 languages, 210 bitexts total number of files: 986 total number of tokens: 3.01M total number of sentence fragm ...

Languages:
German
Modern Greek (1453-)
Media Type:
text
Keywords:
parallel
alignment
bilingual
administrative texts

The EUconst subcorpus (DE-EL) (TMX) is a parallel subcorpus for German and Greek, subset of the EUconst, a parallel corpus collected from the European Constitution. 21 languages, 210 bitexts total number of files: 986 total number of tokens: 3.01M total number of sentence fragmen ...

Languages:
German
Modern Greek (1453-)
Media Type:
text
Keywords:
parallel
alignment
bilingual
administrative texts

The EUconst subcorpus EL-PT (Moses) is a parallel subcorpus for Greek and Portuguese, subset of the EUconst, a parallel corpus collected from the European Constitution. 21 languages, 210 bitexts total number of files: 986 total number of tokens: 3.01M total number of sentence fra ...

Languages:
Modern Greek (1453-)
Portuguese
Media Type:
text
Keywords:
parallel
alignment
bilingual
administrative texts

The EUconst subcorpus EL-PT (TMX) is a parallel subcorpus for Greek and Portuguese, subset of the EUconst, a parallel corpus collected from the European Constitution. 21 languages, 210 bitexts total number of files: 986 total number of tokens: 3.01M total number of sentence fragm ...

Languages:
Modern Greek (1453-)
Portuguese
Media Type:
text
Keywords:
parallel
alignment
bilingual
administrative texts

The EUconst subcorpus EN-EL (Moses) is a parallel subcorpus for English and Greek, subset of the EUconst, a parallel corpus collected from the European Constitution. 21 languages, 210 bitexts total number of files: 986 total number of tokens: 3.01M total number of sentence fragme ...

Languages:
Modern Greek (1453-)
English
Media Type:
text
Keywords:
parallel
alignment
bilingual
administrative texts

The EUconst subcorpus EN-EL (TMX) is a parallel subcorpus for English and Greek, subset of the EUconst, a parallel corpus collected from the European Constitution. 21 languages, 210 bitexts total number of files: 986 total number of tokens: 3.01M total number of sentence fragment ...

Languages:
Modern Greek (1453-)
English
Media Type:
text
Keywords:
parallel
alignment
bilingual
administrative texts

Collection of paralell sentences for seven under resourced languages: Croatian, Estonian, Greek, Latvian, Lithuanian, Romanian and Slovenian, and English and German.

Languages:
Lithuanian
English
German
Modern Greek (1453-)
...
Media Type:
text
Keywords:
written language
multilingual
parallel

The corpus contains comparable sentence pairs extracted from news comparable corpora for 12 language pairs: English-Croatian, English- Greek, English-Estonian, English-Latvian, English-Lithuanian, English-Romanian, English-Slovenian, Greek-Romanian, Latvian-Lithuanian, Romanian-G ...

Languages:
Lithuanian
English
German
Modern Greek (1453-)
...
Media Type:
text
Keywords:
written language
multilingual
comparable

All Greek to me!™ is the first complete system for automatic transcription of Greeklish into Greek and vice versa, obtaining correct spelling. It consists of a result of long research activities of ILSP / "Athena" RC, and it is very accurate and efficient in handling mixed-multil ...

Language:
Modern Greek (1453-)
Keywords:
automatic transcription system
text
processing service

GrNE-Tagger is a natural language processing web service designed to automatically detect and classify Named Entities (proper names) in Greek texts into the following five types: PERSON, LOCATION, ORGANIZATION, FACILITY, GPE (Geo-political entity). You can use this service to pro ...

Language:
Modern Greek (1453-)
Keywords:
named entities
text

The COMBO Tool is a tool for the extraction of words or letter combinations from a pre-processed corpus of the greek language named CLEAN based on specific spelling or pronunciation patterns as well as quantitative criteria set by the user, such as word length (by letters, phones ...

Language:
Modern Greek (1453-)
Keywords:
text
quantititative measures
statistical information

ConceptNet-el

Lexical/Conceptual resource

A computational lexicon of Modern Greek that is conceptually organised as a linguistic ontology. It currently comprises c.34K tokens (i.e. unique instances of meaning and form combination). Both single- and multi-word entries are included in the lexicon. These are specified for ( ...

Language:
Modern Greek (1453-)
Media Type:
text
Keywords:
morphological information
computational lexicon
monolingual
syntactic frames

Anthology of Greek literary texts (prose, poetry, drama, essays) from the 18th century onwards.

Language:
Modern Greek (1453-)
Media Type:
text
Keywords:
written language
monolingual
cultural

The DGT-Acquis is a family of several multingual parallel corpora extracted from the Official Journal of the European Union (OJ) in Formex 4 (XML) format, consisting of documents from the middle of 2004 to the end of 2011 in up to 23 languages. The original data of the OJ has bee ...

Languages:
Maltese
English
French
Swedish
...
Media Type:
text
Keywords:
Official Journal of the European Union
bilingual
parallel

The DGT-Acquis subcorpus DE-EL is a parallel subcorpus for German and Greek, subset of the DGT-Acquis. The DGT-Acquis is a family of several multingual parallel corpora extracted from the Official Journal of the European Union (OJ) in Formex 4 (XML) format, consisting of document ...

Languages:
German
Modern Greek (1453-)
Media Type:
text
Keywords:
parallel
alignment
bilingual
Official Journal of the European Union

The DGT-Acquis subcorpus EL-PT is a parallel subcorpus for Greek and Portuguese, subset of the DGT-Acquis. The DGT-Acquis is a family of several multingual parallel corpora extracted from the Official Journal of the European Union (OJ) in Formex 4 (XML) format, consisting of docu ...

Languages:
Modern Greek (1453-)
Portuguese
Media Type:
text
Keywords:
parallel
alignment
bilingual
Official Journal of the European Union

The DGT-Acquis subcorpus EN-EL is a parallel subcorpus for English and Greek, subset of the DGT-Acquis. The DGT-Acquis is a family of several multingual parallel corpora extracted from the Official Journal of the European Union (OJ) in Formex 4 (XML) format, consisting of documen ...

Languages:
Modern Greek (1453-)
English
Media Type:
text
Keywords:
parallel
alignment
bilingual
Official Journal of the European Union

Since November 2007 the European Commission's Directorate-General for Translation has made its multilingual Translation Memory for the Acquis Communautaire, DGT-TM, publicly accessible in order to foster the European Commission’s general effort to support multilingualism, languag ...

Languages:
Maltese
English
French
Swedish
...
Media Type:
text
Keywords:
parallel
alignment
translation memory
multilingual