Classification and similarity detection of Indonesian scientific journal articles Article Swipe
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.11591/csit.v6i2.p147-158
The development of technology is accelerating in finding references to scientific articles or journals related to research topics. One of the sources of national aggregator services to find references is Garba Rujukan Digital (GARUDA), developed by the Ministry of Education, Culture, Research, and Technology (Kemendikbudristek) of the Republic of Indonesia. The naïve Bayes method classifies articles into several categories based on titles and abstracts. The system achieves an F1-score of 98%, which indicates high classification accuracy, and the classification process takes less than 60 minutes. Article similarity detection is done using the cosine similarity method, and a similarity score of 0.071 reflects the degree of similarity between the title and the abstract that has been concatenated, while a score close to 1 indicates a higher similarity. Searching for similar scientific articles based on title and abstract, sort articles based on the results of the highest similarity score are the most similar articles, and generating article categories. The results of the research show that the proposed method significantly improves the classification and search processes in GARUDA, as well as accurate and efficient similarity detection.
Related Topics
- Type
- article
- Language
- en
- Landing Page
- https://doi.org/10.11591/csit.v6i2.p147-158
- OA Status
- diamond
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4411501415
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4411501415Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.11591/csit.v6i2.p147-158Digital Object Identifier
- Title
-
Classification and similarity detection of Indonesian scientific journal articlesWork title
- Type
-
articleOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2025Year of publication
- Publication date
-
2025-06-20Full publication date if available
- Authors
-
Nyimas Sabilina Cahyani, Deris Stiawan, Abdiansah Abdiansah, Nurul Afifah, Dhıas Fajar Wıdya PermanaList of authors in order
- Landing page
-
https://doi.org/10.11591/csit.v6i2.p147-158Publisher landing page
- Open access
-
YesWhether a free full text is available
- OA status
-
diamondOpen access status per OpenAlex
- OA URL
-
https://doi.org/10.11591/csit.v6i2.p147-158Direct OA link when available
- Concepts
-
Similarity (geometry), Information retrieval, Computer science, Cosine similarity, Christian ministry, sort, Indonesian, Data mining, Artificial intelligence, Pattern recognition (psychology), Political science, Linguistics, Philosophy, Image (mathematics), LawTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4411501415 |
|---|---|
| doi | https://doi.org/10.11591/csit.v6i2.p147-158 |
| ids.doi | https://doi.org/10.11591/csit.v6i2.p147-158 |
| ids.openalex | https://openalex.org/W4411501415 |
| fwci | 0.0 |
| type | article |
| title | Classification and similarity detection of Indonesian scientific journal articles |
| biblio.issue | 2 |
| biblio.volume | 6 |
| biblio.last_page | 158 |
| biblio.first_page | 147 |
| topics[0].id | https://openalex.org/T13559 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9932000041007996 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1702 |
| topics[0].subfield.display_name | Artificial Intelligence |
| topics[0].display_name | Edcuational Technology Systems |
| topics[1].id | https://openalex.org/T13373 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9613999724388123 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1710 |
| topics[1].subfield.display_name | Information Systems |
| topics[1].display_name | Data Mining and Machine Learning Applications |
| topics[2].id | https://openalex.org/T13083 |
| topics[2].field.id | https://openalex.org/fields/17 |
| topics[2].field.display_name | Computer Science |
| topics[2].score | 0.9332000017166138 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1702 |
| topics[2].subfield.display_name | Artificial Intelligence |
| topics[2].display_name | Advanced Text Analysis Techniques |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C103278499 |
| concepts[0].level | 3 |
| concepts[0].score | 0.7963690757751465 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q254465 |
| concepts[0].display_name | Similarity (geometry) |
| concepts[1].id | https://openalex.org/C23123220 |
| concepts[1].level | 1 |
| concepts[1].score | 0.5625991821289062 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q816826 |
| concepts[1].display_name | Information retrieval |
| concepts[2].id | https://openalex.org/C41008148 |
| concepts[2].level | 0 |
| concepts[2].score | 0.5570961833000183 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[2].display_name | Computer science |
| concepts[3].id | https://openalex.org/C2780762811 |
| concepts[3].level | 3 |
| concepts[3].score | 0.5530609488487244 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q1784941 |
| concepts[3].display_name | Cosine similarity |
| concepts[4].id | https://openalex.org/C521751864 |
| concepts[4].level | 2 |
| concepts[4].score | 0.461925208568573 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q1729207 |
| concepts[4].display_name | Christian ministry |
| concepts[5].id | https://openalex.org/C88548561 |
| concepts[5].level | 2 |
| concepts[5].score | 0.4427903890609741 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q347599 |
| concepts[5].display_name | sort |
| concepts[6].id | https://openalex.org/C2779207338 |
| concepts[6].level | 2 |
| concepts[6].score | 0.4403468072414398 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q9240 |
| concepts[6].display_name | Indonesian |
| concepts[7].id | https://openalex.org/C124101348 |
| concepts[7].level | 1 |
| concepts[7].score | 0.370898962020874 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q172491 |
| concepts[7].display_name | Data mining |
| concepts[8].id | https://openalex.org/C154945302 |
| concepts[8].level | 1 |
| concepts[8].score | 0.34541571140289307 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[8].display_name | Artificial intelligence |
| concepts[9].id | https://openalex.org/C153180895 |
| concepts[9].level | 2 |
| concepts[9].score | 0.2530931830406189 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q7148389 |
| concepts[9].display_name | Pattern recognition (psychology) |
| concepts[10].id | https://openalex.org/C17744445 |
| concepts[10].level | 0 |
| concepts[10].score | 0.10139608383178711 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q36442 |
| concepts[10].display_name | Political science |
| concepts[11].id | https://openalex.org/C41895202 |
| concepts[11].level | 1 |
| concepts[11].score | 0.09846848249435425 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q8162 |
| concepts[11].display_name | Linguistics |
| concepts[12].id | https://openalex.org/C138885662 |
| concepts[12].level | 0 |
| concepts[12].score | 0.0 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q5891 |
| concepts[12].display_name | Philosophy |
| concepts[13].id | https://openalex.org/C115961682 |
| concepts[13].level | 2 |
| concepts[13].score | 0.0 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q860623 |
| concepts[13].display_name | Image (mathematics) |
| concepts[14].id | https://openalex.org/C199539241 |
| concepts[14].level | 1 |
| concepts[14].score | 0.0 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q7748 |
| concepts[14].display_name | Law |
| keywords[0].id | https://openalex.org/keywords/similarity |
| keywords[0].score | 0.7963690757751465 |
| keywords[0].display_name | Similarity (geometry) |
| keywords[1].id | https://openalex.org/keywords/information-retrieval |
| keywords[1].score | 0.5625991821289062 |
| keywords[1].display_name | Information retrieval |
| keywords[2].id | https://openalex.org/keywords/computer-science |
| keywords[2].score | 0.5570961833000183 |
| keywords[2].display_name | Computer science |
| keywords[3].id | https://openalex.org/keywords/cosine-similarity |
| keywords[3].score | 0.5530609488487244 |
| keywords[3].display_name | Cosine similarity |
| keywords[4].id | https://openalex.org/keywords/christian-ministry |
| keywords[4].score | 0.461925208568573 |
| keywords[4].display_name | Christian ministry |
| keywords[5].id | https://openalex.org/keywords/sort |
| keywords[5].score | 0.4427903890609741 |
| keywords[5].display_name | sort |
| keywords[6].id | https://openalex.org/keywords/indonesian |
| keywords[6].score | 0.4403468072414398 |
| keywords[6].display_name | Indonesian |
| keywords[7].id | https://openalex.org/keywords/data-mining |
| keywords[7].score | 0.370898962020874 |
| keywords[7].display_name | Data mining |
| keywords[8].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[8].score | 0.34541571140289307 |
| keywords[8].display_name | Artificial intelligence |
| keywords[9].id | https://openalex.org/keywords/pattern-recognition |
| keywords[9].score | 0.2530931830406189 |
| keywords[9].display_name | Pattern recognition (psychology) |
| keywords[10].id | https://openalex.org/keywords/political-science |
| keywords[10].score | 0.10139608383178711 |
| keywords[10].display_name | Political science |
| keywords[11].id | https://openalex.org/keywords/linguistics |
| keywords[11].score | 0.09846848249435425 |
| keywords[11].display_name | Linguistics |
| language | en |
| locations[0].id | doi:10.11591/csit.v6i2.p147-158 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4210218571 |
| locations[0].source.issn | 2722-3221, 2722-323X |
| locations[0].source.type | journal |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | 2722-3221 |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | Computer Science and Information Technologies |
| locations[0].source.host_organization | https://openalex.org/P4310315009 |
| locations[0].source.host_organization_name | Institute of Advanced Engineering and Science (IAES) |
| locations[0].source.host_organization_lineage | https://openalex.org/P4310315009 |
| locations[0].source.host_organization_lineage_names | Institute of Advanced Engineering and Science (IAES) |
| locations[0].license | |
| locations[0].pdf_url | |
| locations[0].version | publishedVersion |
| locations[0].raw_type | journal-article |
| locations[0].license_id | |
| locations[0].is_accepted | True |
| locations[0].is_published | True |
| locations[0].raw_source_name | Computer Science and Information Technologies |
| locations[0].landing_page_url | https://doi.org/10.11591/csit.v6i2.p147-158 |
| indexed_in | crossref |
| authorships[0].author.id | https://openalex.org/A5118516222 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Nyimas Sabilina Cahyani |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Nyimas Sabilina Cahyani |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5053568232 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-9302-1868 |
| authorships[1].author.display_name | Deris Stiawan |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Deris Stiawan |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5012805707 |
| authorships[2].author.orcid | https://orcid.org/0000-0003-1484-379X |
| authorships[2].author.display_name | Abdiansah Abdiansah |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Abdiansah Abdiansah |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5103014558 |
| authorships[3].author.orcid | https://orcid.org/0009-0009-4820-7207 |
| authorships[3].author.display_name | Nurul Afifah |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Nurul Afifah |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5032738357 |
| authorships[4].author.orcid | |
| authorships[4].author.display_name | Dhıas Fajar Wıdya Permana |
| authorships[4].author_position | last |
| authorships[4].raw_author_name | Dendi Renaldo Permana |
| authorships[4].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://doi.org/10.11591/csit.v6i2.p147-158 |
| open_access.oa_status | diamond |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Classification and similarity detection of Indonesian scientific journal articles |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T13559 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9932000041007996 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1702 |
| primary_topic.subfield.display_name | Artificial Intelligence |
| primary_topic.display_name | Edcuational Technology Systems |
| related_works | https://openalex.org/W2389818373, https://openalex.org/W4392445444, https://openalex.org/W4396220545, https://openalex.org/W2220831889, https://openalex.org/W4312683641, https://openalex.org/W3027421045, https://openalex.org/W4405933298, https://openalex.org/W3013312691, https://openalex.org/W2980386803, https://openalex.org/W3215994059 |
| cited_by_count | 0 |
| locations_count | 1 |
| best_oa_location.id | doi:10.11591/csit.v6i2.p147-158 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4210218571 |
| best_oa_location.source.issn | 2722-3221, 2722-323X |
| best_oa_location.source.type | journal |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | 2722-3221 |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | Computer Science and Information Technologies |
| best_oa_location.source.host_organization | https://openalex.org/P4310315009 |
| best_oa_location.source.host_organization_name | Institute of Advanced Engineering and Science (IAES) |
| best_oa_location.source.host_organization_lineage | https://openalex.org/P4310315009 |
| best_oa_location.source.host_organization_lineage_names | Institute of Advanced Engineering and Science (IAES) |
| best_oa_location.license | |
| best_oa_location.pdf_url | |
| best_oa_location.version | publishedVersion |
| best_oa_location.raw_type | journal-article |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | True |
| best_oa_location.raw_source_name | Computer Science and Information Technologies |
| best_oa_location.landing_page_url | https://doi.org/10.11591/csit.v6i2.p147-158 |
| primary_location.id | doi:10.11591/csit.v6i2.p147-158 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4210218571 |
| primary_location.source.issn | 2722-3221, 2722-323X |
| primary_location.source.type | journal |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | 2722-3221 |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | Computer Science and Information Technologies |
| primary_location.source.host_organization | https://openalex.org/P4310315009 |
| primary_location.source.host_organization_name | Institute of Advanced Engineering and Science (IAES) |
| primary_location.source.host_organization_lineage | https://openalex.org/P4310315009 |
| primary_location.source.host_organization_lineage_names | Institute of Advanced Engineering and Science (IAES) |
| primary_location.license | |
| primary_location.pdf_url | |
| primary_location.version | publishedVersion |
| primary_location.raw_type | journal-article |
| primary_location.license_id | |
| primary_location.is_accepted | True |
| primary_location.is_published | True |
| primary_location.raw_source_name | Computer Science and Information Technologies |
| primary_location.landing_page_url | https://doi.org/10.11591/csit.v6i2.p147-158 |
| publication_date | 2025-06-20 |
| publication_year | 2025 |
| referenced_works_count | 0 |
| abstract_inverted_index.1 | 121 |
| abstract_inverted_index.a | 96, 117, 123 |
| abstract_inverted_index.60 | 83 |
| abstract_inverted_index.an | 67 |
| abstract_inverted_index.as | 175, 177 |
| abstract_inverted_index.by | 35 |
| abstract_inverted_index.in | 6, 173 |
| abstract_inverted_index.is | 4, 29, 88 |
| abstract_inverted_index.of | 2, 19, 22, 38, 45, 48, 69, 99, 104, 142, 158 |
| abstract_inverted_index.on | 60, 132, 139 |
| abstract_inverted_index.or | 12 |
| abstract_inverted_index.to | 9, 15, 26, 120 |
| abstract_inverted_index.One | 18 |
| abstract_inverted_index.The | 0, 50, 64, 156 |
| abstract_inverted_index.and | 42, 62, 76, 95, 109, 134, 152, 170, 179 |
| abstract_inverted_index.are | 147 |
| abstract_inverted_index.for | 127 |
| abstract_inverted_index.has | 113 |
| abstract_inverted_index.the | 20, 36, 46, 77, 91, 102, 107, 110, 140, 143, 148, 159, 163, 168 |
| abstract_inverted_index.98%, | 70 |
| abstract_inverted_index.been | 114 |
| abstract_inverted_index.done | 89 |
| abstract_inverted_index.find | 27 |
| abstract_inverted_index.high | 73 |
| abstract_inverted_index.into | 56 |
| abstract_inverted_index.less | 81 |
| abstract_inverted_index.most | 149 |
| abstract_inverted_index.show | 161 |
| abstract_inverted_index.sort | 136 |
| abstract_inverted_index.than | 82 |
| abstract_inverted_index.that | 112, 162 |
| abstract_inverted_index.well | 176 |
| abstract_inverted_index.0.071 | 100 |
| abstract_inverted_index.Bayes | 52 |
| abstract_inverted_index.Garba | 30 |
| abstract_inverted_index.based | 59, 131, 138 |
| abstract_inverted_index.close | 119 |
| abstract_inverted_index.score | 98, 118, 146 |
| abstract_inverted_index.takes | 80 |
| abstract_inverted_index.title | 108, 133 |
| abstract_inverted_index.using | 90 |
| abstract_inverted_index.which | 71 |
| abstract_inverted_index.while | 116 |
| abstract_inverted_index.cosine | 92 |
| abstract_inverted_index.degree | 103 |
| abstract_inverted_index.higher | 124 |
| abstract_inverted_index.method | 53, 165 |
| abstract_inverted_index.naïve | 51 |
| abstract_inverted_index.search | 171 |
| abstract_inverted_index.system | 65 |
| abstract_inverted_index.titles | 61 |
| abstract_inverted_index.Article | 85 |
| abstract_inverted_index.Digital | 32 |
| abstract_inverted_index.GARUDA, | 174 |
| abstract_inverted_index.Rujukan | 31 |
| abstract_inverted_index.article | 154 |
| abstract_inverted_index.between | 106 |
| abstract_inverted_index.finding | 7 |
| abstract_inverted_index.highest | 144 |
| abstract_inverted_index.method, | 94 |
| abstract_inverted_index.process | 79 |
| abstract_inverted_index.related | 14 |
| abstract_inverted_index.results | 141, 157 |
| abstract_inverted_index.several | 57 |
| abstract_inverted_index.similar | 128, 150 |
| abstract_inverted_index.sources | 21 |
| abstract_inverted_index.topics. | 17 |
| abstract_inverted_index.Culture, | 40 |
| abstract_inverted_index.F1-score | 68 |
| abstract_inverted_index.Ministry | 37 |
| abstract_inverted_index.Republic | 47 |
| abstract_inverted_index.abstract | 111 |
| abstract_inverted_index.accurate | 178 |
| abstract_inverted_index.achieves | 66 |
| abstract_inverted_index.articles | 11, 55, 130, 137 |
| abstract_inverted_index.improves | 167 |
| abstract_inverted_index.journals | 13 |
| abstract_inverted_index.minutes. | 84 |
| abstract_inverted_index.national | 23 |
| abstract_inverted_index.proposed | 164 |
| abstract_inverted_index.reflects | 101 |
| abstract_inverted_index.research | 16, 160 |
| abstract_inverted_index.services | 25 |
| abstract_inverted_index.(GARUDA), | 33 |
| abstract_inverted_index.Research, | 41 |
| abstract_inverted_index.Searching | 126 |
| abstract_inverted_index.abstract, | 135 |
| abstract_inverted_index.accuracy, | 75 |
| abstract_inverted_index.articles, | 151 |
| abstract_inverted_index.detection | 87 |
| abstract_inverted_index.developed | 34 |
| abstract_inverted_index.efficient | 180 |
| abstract_inverted_index.indicates | 72, 122 |
| abstract_inverted_index.processes | 172 |
| abstract_inverted_index.Education, | 39 |
| abstract_inverted_index.Indonesia. | 49 |
| abstract_inverted_index.Technology | 43 |
| abstract_inverted_index.abstracts. | 63 |
| abstract_inverted_index.aggregator | 24 |
| abstract_inverted_index.categories | 58 |
| abstract_inverted_index.classifies | 54 |
| abstract_inverted_index.detection. | 182 |
| abstract_inverted_index.generating | 153 |
| abstract_inverted_index.references | 8, 28 |
| abstract_inverted_index.scientific | 10, 129 |
| abstract_inverted_index.similarity | 86, 93, 97, 105, 145, 181 |
| abstract_inverted_index.technology | 3 |
| abstract_inverted_index.categories. | 155 |
| abstract_inverted_index.development | 1 |
| abstract_inverted_index.similarity. | 125 |
| abstract_inverted_index.accelerating | 5 |
| abstract_inverted_index.concatenated, | 115 |
| abstract_inverted_index.significantly | 166 |
| abstract_inverted_index.classification | 74, 78, 169 |
| abstract_inverted_index.(Kemendikbudristek) | 44 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 5 |
| sustainable_development_goals[0].id | https://metadata.un.org/sdg/4 |
| sustainable_development_goals[0].score | 0.5799999833106995 |
| sustainable_development_goals[0].display_name | Quality Education |
| citation_normalized_percentile.value | 0.10444523 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | True |