Score Combination for Improved Parallel Corpus Filtering for Low Resource Conditions Article Swipe
Muhammad ElNokrashy
,
Amr Hendy
,
Mohamed Abdelghaffar
,
Mohamed Afify
,
Ahmed Y. Tawfik
,
Hany Hassan Awadalla
·
YOU?
·
· 2020
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2011.07933
YOU?
·
· 2020
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2011.07933
This paper describes our submission to the WMT20 sentence filtering task. We combine scores from (1) a custom LASER built for each source language, (2) a classifier built to distinguish positive and negative pairs by semantic alignment, and (3) the original scores included in the task devkit. For the mBART finetuning setup, provided by the organizers, our method shows 7% and 5% relative improvement over baseline, in sacreBLEU score on the test set for Pashto and Khmer respectively.
Related Topics
Concepts
Metadata
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2011.07933
- https://arxiv.org/pdf/2011.07933
- OA Status
- green
- Cited By
- 1
- References
- 6
- Related Works
- 20
- OpenAlex ID
- https://openalex.org/W3098515495
All OpenAlex metadata
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W3098515495Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2011.07933Digital Object Identifier
- Title
-
Score Combination for Improved Parallel Corpus Filtering for Low Resource ConditionsWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2020Year of publication
- Publication date
-
2020-11-16Full publication date if available
- Authors
-
Muhammad ElNokrashy, Amr Hendy, Mohamed Abdelghaffar, Mohamed Afify, Ahmed Y. Tawfik, Hany Hassan AwadallaList of authors in order
- Landing page
-
https://arxiv.org/abs/2011.07933Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2011.07933Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2011.07933Direct OA link when available
- Concepts
-
Computer science, Sentence, Baseline (sea), Classifier (UML), Natural language processing, Task (project management), Artificial intelligence, Test set, Set (abstract data type), Engineering, Programming language, Geology, Oceanography, Systems engineeringTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
1Total citation count in OpenAlex
- Citations by year (recent)
-
2021: 1Per-year citation counts (last 5 years)
- References (count)
-
6Number of works referenced by this work
- Related works (count)
-
20Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W3098515495 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2011.07933 |
| ids.doi | https://doi.org/10.48550/arxiv.2011.07933 |
| ids.mag | 3098515495 |
| ids.openalex | https://openalex.org/W3098515495 |
| fwci | |
| type | preprint |
| title | Score Combination for Improved Parallel Corpus Filtering for Low Resource Conditions |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | 951 |
| biblio.first_page | 947 |
| topics[0].id | https://openalex.org/T10181 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9998999834060669 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1702 |
| topics[0].subfield.display_name | Artificial Intelligence |
| topics[0].display_name | Natural Language Processing Techniques |
| topics[1].id | https://openalex.org/T10028 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9995999932289124 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1702 |
| topics[1].subfield.display_name | Artificial Intelligence |
| topics[1].display_name | Topic Modeling |
| topics[2].id | https://openalex.org/T11714 |
| topics[2].field.id | https://openalex.org/fields/17 |
| topics[2].field.display_name | Computer Science |
| topics[2].score | 0.9955000281333923 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1707 |
| topics[2].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[2].display_name | Multimodal Machine Learning Applications |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C41008148 |
| concepts[0].level | 0 |
| concepts[0].score | 0.7129659652709961 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[0].display_name | Computer science |
| concepts[1].id | https://openalex.org/C2777530160 |
| concepts[1].level | 2 |
| concepts[1].score | 0.6709764003753662 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q41796 |
| concepts[1].display_name | Sentence |
| concepts[2].id | https://openalex.org/C12725497 |
| concepts[2].level | 2 |
| concepts[2].score | 0.644458532333374 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q810247 |
| concepts[2].display_name | Baseline (sea) |
| concepts[3].id | https://openalex.org/C95623464 |
| concepts[3].level | 2 |
| concepts[3].score | 0.6284494996070862 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q1096149 |
| concepts[3].display_name | Classifier (UML) |
| concepts[4].id | https://openalex.org/C204321447 |
| concepts[4].level | 1 |
| concepts[4].score | 0.6269187927246094 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q30642 |
| concepts[4].display_name | Natural language processing |
| concepts[5].id | https://openalex.org/C2780451532 |
| concepts[5].level | 2 |
| concepts[5].score | 0.5772109627723694 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q759676 |
| concepts[5].display_name | Task (project management) |
| concepts[6].id | https://openalex.org/C154945302 |
| concepts[6].level | 1 |
| concepts[6].score | 0.5564249753952026 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[6].display_name | Artificial intelligence |
| concepts[7].id | https://openalex.org/C169903167 |
| concepts[7].level | 2 |
| concepts[7].score | 0.5060035586357117 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q3985153 |
| concepts[7].display_name | Test set |
| concepts[8].id | https://openalex.org/C177264268 |
| concepts[8].level | 2 |
| concepts[8].score | 0.48050153255462646 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q1514741 |
| concepts[8].display_name | Set (abstract data type) |
| concepts[9].id | https://openalex.org/C127413603 |
| concepts[9].level | 0 |
| concepts[9].score | 0.08977550268173218 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q11023 |
| concepts[9].display_name | Engineering |
| concepts[10].id | https://openalex.org/C199360897 |
| concepts[10].level | 1 |
| concepts[10].score | 0.08691051602363586 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q9143 |
| concepts[10].display_name | Programming language |
| concepts[11].id | https://openalex.org/C127313418 |
| concepts[11].level | 0 |
| concepts[11].score | 0.0 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q1069 |
| concepts[11].display_name | Geology |
| concepts[12].id | https://openalex.org/C111368507 |
| concepts[12].level | 1 |
| concepts[12].score | 0.0 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q43518 |
| concepts[12].display_name | Oceanography |
| concepts[13].id | https://openalex.org/C201995342 |
| concepts[13].level | 1 |
| concepts[13].score | 0.0 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q682496 |
| concepts[13].display_name | Systems engineering |
| keywords[0].id | https://openalex.org/keywords/computer-science |
| keywords[0].score | 0.7129659652709961 |
| keywords[0].display_name | Computer science |
| keywords[1].id | https://openalex.org/keywords/sentence |
| keywords[1].score | 0.6709764003753662 |
| keywords[1].display_name | Sentence |
| keywords[2].id | https://openalex.org/keywords/baseline |
| keywords[2].score | 0.644458532333374 |
| keywords[2].display_name | Baseline (sea) |
| keywords[3].id | https://openalex.org/keywords/classifier |
| keywords[3].score | 0.6284494996070862 |
| keywords[3].display_name | Classifier (UML) |
| keywords[4].id | https://openalex.org/keywords/natural-language-processing |
| keywords[4].score | 0.6269187927246094 |
| keywords[4].display_name | Natural language processing |
| keywords[5].id | https://openalex.org/keywords/task |
| keywords[5].score | 0.5772109627723694 |
| keywords[5].display_name | Task (project management) |
| keywords[6].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[6].score | 0.5564249753952026 |
| keywords[6].display_name | Artificial intelligence |
| keywords[7].id | https://openalex.org/keywords/test-set |
| keywords[7].score | 0.5060035586357117 |
| keywords[7].display_name | Test set |
| keywords[8].id | https://openalex.org/keywords/set |
| keywords[8].score | 0.48050153255462646 |
| keywords[8].display_name | Set (abstract data type) |
| keywords[9].id | https://openalex.org/keywords/engineering |
| keywords[9].score | 0.08977550268173218 |
| keywords[9].display_name | Engineering |
| keywords[10].id | https://openalex.org/keywords/programming-language |
| keywords[10].score | 0.08691051602363586 |
| keywords[10].display_name | Programming language |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2011.07933 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2011.07933 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2011.07933 |
| locations[1].id | mag:3098515495 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | submittedVersion |
| locations[1].raw_type | |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | False |
| locations[1].raw_source_name | arXiv (Cornell University) |
| locations[1].landing_page_url | https://arxiv.org/abs/2011.07933v1 |
| locations[2].id | doi:10.48550/arxiv.2011.07933 |
| locations[2].is_oa | True |
| locations[2].source.id | https://openalex.org/S4306400194 |
| locations[2].source.issn | |
| locations[2].source.type | repository |
| locations[2].source.is_oa | True |
| locations[2].source.issn_l | |
| locations[2].source.is_core | False |
| locations[2].source.is_in_doaj | False |
| locations[2].source.display_name | arXiv (Cornell University) |
| locations[2].source.host_organization | https://openalex.org/I205783295 |
| locations[2].source.host_organization_name | Cornell University |
| locations[2].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[2].license | cc-by |
| locations[2].pdf_url | |
| locations[2].version | |
| locations[2].raw_type | article |
| locations[2].license_id | https://openalex.org/licenses/cc-by |
| locations[2].is_accepted | False |
| locations[2].is_published | |
| locations[2].raw_source_name | |
| locations[2].landing_page_url | https://doi.org/10.48550/arxiv.2011.07933 |
| locations[3].id | mag:3118434027 |
| locations[3].is_oa | True |
| locations[3].source.id | https://openalex.org/S4306400194 |
| locations[3].source.issn | |
| locations[3].source.type | repository |
| locations[3].source.is_oa | True |
| locations[3].source.issn_l | |
| locations[3].source.is_core | False |
| locations[3].source.is_in_doaj | False |
| locations[3].source.display_name | arXiv (Cornell University) |
| locations[3].source.host_organization | https://openalex.org/I205783295 |
| locations[3].source.host_organization_name | Cornell University |
| locations[3].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[3].license | |
| locations[3].pdf_url | |
| locations[3].version | |
| locations[3].raw_type | |
| locations[3].license_id | |
| locations[3].is_accepted | False |
| locations[3].is_published | |
| locations[3].raw_source_name | arXiv (Cornell University) |
| locations[3].landing_page_url | https://arxiv.org/pdf/2011.07933 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5073249931 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Muhammad ElNokrashy |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Muhammad N. ElNokrashy |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5007758583 |
| authorships[1].author.orcid | |
| authorships[1].author.display_name | Amr Hendy |
| authorships[1].countries | US |
| authorships[1].affiliations[0].institution_ids | https://openalex.org/I1290206253 |
| authorships[1].affiliations[0].raw_affiliation_string | Microsoft (United States), Redmond, United States |
| authorships[1].institutions[0].id | https://openalex.org/I1290206253 |
| authorships[1].institutions[0].ror | https://ror.org/00d0nc645 |
| authorships[1].institutions[0].type | company |
| authorships[1].institutions[0].lineage | https://openalex.org/I1290206253 |
| authorships[1].institutions[0].country_code | US |
| authorships[1].institutions[0].display_name | Microsoft (United States) |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Amr Hendy |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | Microsoft (United States), Redmond, United States |
| authorships[2].author.id | https://openalex.org/A5040098231 |
| authorships[2].author.orcid | |
| authorships[2].author.display_name | Mohamed Abdelghaffar |
| authorships[2].countries | US |
| authorships[2].affiliations[0].institution_ids | https://openalex.org/I1290206253 |
| authorships[2].affiliations[0].raw_affiliation_string | Microsoft (United States), Redmond, United States |
| authorships[2].institutions[0].id | https://openalex.org/I1290206253 |
| authorships[2].institutions[0].ror | https://ror.org/00d0nc645 |
| authorships[2].institutions[0].type | company |
| authorships[2].institutions[0].lineage | https://openalex.org/I1290206253 |
| authorships[2].institutions[0].country_code | US |
| authorships[2].institutions[0].display_name | Microsoft (United States) |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Mohamed Abdelghaffar |
| authorships[2].is_corresponding | False |
| authorships[2].raw_affiliation_strings | Microsoft (United States), Redmond, United States |
| authorships[3].author.id | https://openalex.org/A5021938376 |
| authorships[3].author.orcid | https://orcid.org/0000-0002-4445-9767 |
| authorships[3].author.display_name | Mohamed Afify |
| authorships[3].countries | US |
| authorships[3].affiliations[0].institution_ids | https://openalex.org/I1290206253 |
| authorships[3].affiliations[0].raw_affiliation_string | Microsoft (United States), Redmond, United States |
| authorships[3].institutions[0].id | https://openalex.org/I1290206253 |
| authorships[3].institutions[0].ror | https://ror.org/00d0nc645 |
| authorships[3].institutions[0].type | company |
| authorships[3].institutions[0].lineage | https://openalex.org/I1290206253 |
| authorships[3].institutions[0].country_code | US |
| authorships[3].institutions[0].display_name | Microsoft (United States) |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Mohamed Afify |
| authorships[3].is_corresponding | False |
| authorships[3].raw_affiliation_strings | Microsoft (United States), Redmond, United States |
| authorships[4].author.id | https://openalex.org/A5048873489 |
| authorships[4].author.orcid | https://orcid.org/0000-0003-3561-3248 |
| authorships[4].author.display_name | Ahmed Y. Tawfik |
| authorships[4].countries | GB |
| authorships[4].affiliations[0].institution_ids | https://openalex.org/I133837150 |
| authorships[4].affiliations[0].raw_affiliation_string | University of Huddersfield, Huddersfield, United Kingdom |
| authorships[4].institutions[0].id | https://openalex.org/I133837150 |
| authorships[4].institutions[0].ror | https://ror.org/05t1h8f27 |
| authorships[4].institutions[0].type | education |
| authorships[4].institutions[0].lineage | https://openalex.org/I133837150 |
| authorships[4].institutions[0].country_code | GB |
| authorships[4].institutions[0].display_name | University of Huddersfield |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Ahmed Tawfik |
| authorships[4].is_corresponding | False |
| authorships[4].raw_affiliation_strings | University of Huddersfield, Huddersfield, United Kingdom |
| authorships[5].author.id | https://openalex.org/A5030937723 |
| authorships[5].author.orcid | |
| authorships[5].author.display_name | Hany Hassan Awadalla |
| authorships[5].countries | US |
| authorships[5].affiliations[0].institution_ids | https://openalex.org/I1290206253 |
| authorships[5].affiliations[0].raw_affiliation_string | Microsoft (United States), Redmond, United States |
| authorships[5].institutions[0].id | https://openalex.org/I1290206253 |
| authorships[5].institutions[0].ror | https://ror.org/00d0nc645 |
| authorships[5].institutions[0].type | company |
| authorships[5].institutions[0].lineage | https://openalex.org/I1290206253 |
| authorships[5].institutions[0].country_code | US |
| authorships[5].institutions[0].display_name | Microsoft (United States) |
| authorships[5].author_position | last |
| authorships[5].raw_author_name | Hany Hassan Awadalla |
| authorships[5].is_corresponding | False |
| authorships[5].raw_affiliation_strings | Microsoft (United States), Redmond, United States |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2011.07933 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Score Combination for Improved Parallel Corpus Filtering for Low Resource Conditions |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T10181 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9998999834060669 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1702 |
| primary_topic.subfield.display_name | Artificial Intelligence |
| primary_topic.display_name | Natural Language Processing Techniques |
| related_works | https://openalex.org/W3118434027, https://openalex.org/W3119188361, https://openalex.org/W2917025446, https://openalex.org/W2127293685, https://openalex.org/W2917098612, https://openalex.org/W2464151748, https://openalex.org/W2395971734, https://openalex.org/W3120955312, https://openalex.org/W2000026602, https://openalex.org/W9218159, https://openalex.org/W2957558225, https://openalex.org/W2963842834, https://openalex.org/W2395386636, https://openalex.org/W2463861136, https://openalex.org/W2740984590, https://openalex.org/W2759035416, https://openalex.org/W2970950209, https://openalex.org/W3034775979, https://openalex.org/W2003484529, https://openalex.org/W2092910966 |
| cited_by_count | 1 |
| counts_by_year[0].year | 2021 |
| counts_by_year[0].cited_by_count | 1 |
| locations_count | 4 |
| best_oa_location.id | pmh:oai:arXiv.org:2011.07933 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2011.07933 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2011.07933 |
| primary_location.id | pmh:oai:arXiv.org:2011.07933 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2011.07933 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2011.07933 |
| publication_date | 2020-11-16 |
| publication_year | 2020 |
| referenced_works | https://openalex.org/W2593864460, https://openalex.org/W2963919854, https://openalex.org/W3001434439, https://openalex.org/W2952509486, https://openalex.org/W2962735107, https://openalex.org/W3025490068 |
| referenced_works_count | 6 |
| abstract_inverted_index.a | 16, 25 |
| abstract_inverted_index.5% | 61 |
| abstract_inverted_index.7% | 59 |
| abstract_inverted_index.We | 11 |
| abstract_inverted_index.by | 34, 53 |
| abstract_inverted_index.in | 43, 66 |
| abstract_inverted_index.on | 69 |
| abstract_inverted_index.to | 5, 28 |
| abstract_inverted_index.(1) | 15 |
| abstract_inverted_index.(2) | 24 |
| abstract_inverted_index.(3) | 38 |
| abstract_inverted_index.For | 47 |
| abstract_inverted_index.and | 31, 37, 60, 75 |
| abstract_inverted_index.for | 20, 73 |
| abstract_inverted_index.our | 3, 56 |
| abstract_inverted_index.set | 72 |
| abstract_inverted_index.the | 6, 39, 44, 48, 54, 70 |
| abstract_inverted_index.This | 0 |
| abstract_inverted_index.each | 21 |
| abstract_inverted_index.from | 14 |
| abstract_inverted_index.over | 64 |
| abstract_inverted_index.task | 45 |
| abstract_inverted_index.test | 71 |
| abstract_inverted_index.Khmer | 76 |
| abstract_inverted_index.LASER | 18 |
| abstract_inverted_index.WMT20 | 7 |
| abstract_inverted_index.built | 19, 27 |
| abstract_inverted_index.mBART | 49 |
| abstract_inverted_index.pairs | 33 |
| abstract_inverted_index.paper | 1 |
| abstract_inverted_index.score | 68 |
| abstract_inverted_index.shows | 58 |
| abstract_inverted_index.task. | 10 |
| abstract_inverted_index.Pashto | 74 |
| abstract_inverted_index.custom | 17 |
| abstract_inverted_index.method | 57 |
| abstract_inverted_index.scores | 13, 41 |
| abstract_inverted_index.setup, | 51 |
| abstract_inverted_index.source | 22 |
| abstract_inverted_index.combine | 12 |
| abstract_inverted_index.devkit. | 46 |
| abstract_inverted_index.included | 42 |
| abstract_inverted_index.negative | 32 |
| abstract_inverted_index.original | 40 |
| abstract_inverted_index.positive | 30 |
| abstract_inverted_index.provided | 52 |
| abstract_inverted_index.relative | 62 |
| abstract_inverted_index.semantic | 35 |
| abstract_inverted_index.sentence | 8 |
| abstract_inverted_index.baseline, | 65 |
| abstract_inverted_index.describes | 2 |
| abstract_inverted_index.filtering | 9 |
| abstract_inverted_index.language, | 23 |
| abstract_inverted_index.sacreBLEU | 67 |
| abstract_inverted_index.alignment, | 36 |
| abstract_inverted_index.classifier | 26 |
| abstract_inverted_index.finetuning | 50 |
| abstract_inverted_index.submission | 4 |
| abstract_inverted_index.distinguish | 29 |
| abstract_inverted_index.improvement | 63 |
| abstract_inverted_index.organizers, | 55 |
| abstract_inverted_index.respectively. | 77 |
| cited_by_percentile_year | |
| countries_distinct_count | 2 |
| institutions_distinct_count | 6 |
| citation_normalized_percentile |