LLMSecCode: Evaluating Large Language Models for Secure Coding Article Swipe
YOU?
·
· 2024
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2408.16100
The rapid deployment of Large Language Models (LLMs) requires careful consideration of their effect on cybersecurity. Our work aims to improve the selection process of LLMs that are suitable for facilitating Secure Coding (SC). This raises challenging research questions, such as (RQ1) Which functionality can streamline the LLM evaluation? (RQ2) What should the evaluation measure? (RQ3) How to attest that the evaluation process is impartial? To address these questions, we introduce LLMSecCode, an open-source evaluation framework designed to assess LLM SC capabilities objectively. We validate the LLMSecCode implementation through experiments. When varying parameters and prompts, we find a 10% and 9% difference in performance, respectively. We also compare some results to reliable external actors, where our results show a 5% difference. We strive to ensure the ease of use of our open-source framework and encourage further development by external actors. With LLMSecCode, we hope to encourage the standardization and benchmarking of LLMs' capabilities in security-oriented code and tasks.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2408.16100
- https://arxiv.org/pdf/2408.16100
- OA Status
- green
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4402706364
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4402706364Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2408.16100Digital Object Identifier
- Title
-
LLMSecCode: Evaluating Large Language Models for Secure CodingWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2024Year of publication
- Publication date
-
2024-08-28Full publication date if available
- Authors
-
Anton Rydén, Erik Näslund, Elad Michael Schiller, Magnus AlmgrenList of authors in order
- Landing page
-
https://arxiv.org/abs/2408.16100Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2408.16100Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2408.16100Direct OA link when available
- Concepts
-
Computer science, Coding (social sciences), Natural language processing, Mathematics, StatisticsTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4402706364 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2408.16100 |
| ids.doi | https://doi.org/10.48550/arxiv.2408.16100 |
| ids.openalex | https://openalex.org/W4402706364 |
| fwci | |
| type | preprint |
| title | LLMSecCode: Evaluating Large Language Models for Secure Coding |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T11287 |
| topics[0].field.id | https://openalex.org/fields/13 |
| topics[0].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[0].score | 0.6322000026702881 |
| topics[0].domain.id | https://openalex.org/domains/1 |
| topics[0].domain.display_name | Life Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1306 |
| topics[0].subfield.display_name | Cancer Research |
| topics[0].display_name | Cancer Genomics and Diagnostics |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C41008148 |
| concepts[0].level | 0 |
| concepts[0].score | 0.6742909550666809 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[0].display_name | Computer science |
| concepts[1].id | https://openalex.org/C179518139 |
| concepts[1].level | 2 |
| concepts[1].score | 0.525265634059906 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q5140297 |
| concepts[1].display_name | Coding (social sciences) |
| concepts[2].id | https://openalex.org/C204321447 |
| concepts[2].level | 1 |
| concepts[2].score | 0.33164697885513306 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q30642 |
| concepts[2].display_name | Natural language processing |
| concepts[3].id | https://openalex.org/C33923547 |
| concepts[3].level | 0 |
| concepts[3].score | 0.14535808563232422 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q395 |
| concepts[3].display_name | Mathematics |
| concepts[4].id | https://openalex.org/C105795698 |
| concepts[4].level | 1 |
| concepts[4].score | 0.05863049626350403 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q12483 |
| concepts[4].display_name | Statistics |
| keywords[0].id | https://openalex.org/keywords/computer-science |
| keywords[0].score | 0.6742909550666809 |
| keywords[0].display_name | Computer science |
| keywords[1].id | https://openalex.org/keywords/coding |
| keywords[1].score | 0.525265634059906 |
| keywords[1].display_name | Coding (social sciences) |
| keywords[2].id | https://openalex.org/keywords/natural-language-processing |
| keywords[2].score | 0.33164697885513306 |
| keywords[2].display_name | Natural language processing |
| keywords[3].id | https://openalex.org/keywords/mathematics |
| keywords[3].score | 0.14535808563232422 |
| keywords[3].display_name | Mathematics |
| keywords[4].id | https://openalex.org/keywords/statistics |
| keywords[4].score | 0.05863049626350403 |
| keywords[4].display_name | Statistics |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2408.16100 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2408.16100 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2408.16100 |
| locations[1].id | doi:10.48550/arxiv.2408.16100 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | cc-by |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | https://openalex.org/licenses/cc-by |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2408.16100 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5107570291 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Anton Rydén |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Rydén, Anton |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5034453769 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-0166-6344 |
| authorships[1].author.display_name | Erik Näslund |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Näslund, Erik |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5107608981 |
| authorships[2].author.orcid | |
| authorships[2].author.display_name | Elad Michael Schiller |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Schiller, Elad Michael |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5014602488 |
| authorships[3].author.orcid | https://orcid.org/0000-0002-3383-9617 |
| authorships[3].author.display_name | Magnus Almgren |
| authorships[3].author_position | last |
| authorships[3].raw_author_name | Almgren, Magnus |
| authorships[3].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2408.16100 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | LLMSecCode: Evaluating Large Language Models for Secure Coding |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T11287 |
| primary_topic.field.id | https://openalex.org/fields/13 |
| primary_topic.field.display_name | Biochemistry, Genetics and Molecular Biology |
| primary_topic.score | 0.6322000026702881 |
| primary_topic.domain.id | https://openalex.org/domains/1 |
| primary_topic.domain.display_name | Life Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1306 |
| primary_topic.subfield.display_name | Cancer Research |
| primary_topic.display_name | Cancer Genomics and Diagnostics |
| related_works | https://openalex.org/W4391375266, https://openalex.org/W2748952813, https://openalex.org/W2390279801, https://openalex.org/W2358668433, https://openalex.org/W4396701345, https://openalex.org/W2376932109, https://openalex.org/W2001405890, https://openalex.org/W4396696052, https://openalex.org/W2382290278, https://openalex.org/W4395014643 |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2408.16100 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2408.16100 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2408.16100 |
| primary_location.id | pmh:oai:arXiv.org:2408.16100 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2408.16100 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2408.16100 |
| publication_date | 2024-08-28 |
| publication_year | 2024 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 97, 118 |
| abstract_inverted_index.5% | 119 |
| abstract_inverted_index.9% | 100 |
| abstract_inverted_index.SC | 80 |
| abstract_inverted_index.To | 65 |
| abstract_inverted_index.We | 83, 105, 121 |
| abstract_inverted_index.an | 72 |
| abstract_inverted_index.as | 40 |
| abstract_inverted_index.by | 137 |
| abstract_inverted_index.in | 102, 153 |
| abstract_inverted_index.is | 63 |
| abstract_inverted_index.of | 3, 11, 24, 127, 129, 150 |
| abstract_inverted_index.on | 14 |
| abstract_inverted_index.to | 19, 57, 77, 110, 123, 144 |
| abstract_inverted_index.we | 69, 95, 142 |
| abstract_inverted_index.10% | 98 |
| abstract_inverted_index.How | 56 |
| abstract_inverted_index.LLM | 47, 79 |
| abstract_inverted_index.Our | 16 |
| abstract_inverted_index.The | 0 |
| abstract_inverted_index.and | 93, 99, 133, 148, 156 |
| abstract_inverted_index.are | 27 |
| abstract_inverted_index.can | 44 |
| abstract_inverted_index.for | 29 |
| abstract_inverted_index.our | 115, 130 |
| abstract_inverted_index.the | 21, 46, 52, 60, 85, 125, 146 |
| abstract_inverted_index.use | 128 |
| abstract_inverted_index.LLMs | 25 |
| abstract_inverted_index.This | 34 |
| abstract_inverted_index.What | 50 |
| abstract_inverted_index.When | 90 |
| abstract_inverted_index.With | 140 |
| abstract_inverted_index.aims | 18 |
| abstract_inverted_index.also | 106 |
| abstract_inverted_index.code | 155 |
| abstract_inverted_index.ease | 126 |
| abstract_inverted_index.find | 96 |
| abstract_inverted_index.hope | 143 |
| abstract_inverted_index.show | 117 |
| abstract_inverted_index.some | 108 |
| abstract_inverted_index.such | 39 |
| abstract_inverted_index.that | 26, 59 |
| abstract_inverted_index.work | 17 |
| abstract_inverted_index.(RQ1) | 41 |
| abstract_inverted_index.(RQ2) | 49 |
| abstract_inverted_index.(RQ3) | 55 |
| abstract_inverted_index.(SC). | 33 |
| abstract_inverted_index.LLMs' | 151 |
| abstract_inverted_index.Large | 4 |
| abstract_inverted_index.Which | 42 |
| abstract_inverted_index.rapid | 1 |
| abstract_inverted_index.their | 12 |
| abstract_inverted_index.these | 67 |
| abstract_inverted_index.where | 114 |
| abstract_inverted_index.(LLMs) | 7 |
| abstract_inverted_index.Coding | 32 |
| abstract_inverted_index.Models | 6 |
| abstract_inverted_index.Secure | 31 |
| abstract_inverted_index.assess | 78 |
| abstract_inverted_index.attest | 58 |
| abstract_inverted_index.effect | 13 |
| abstract_inverted_index.ensure | 124 |
| abstract_inverted_index.raises | 35 |
| abstract_inverted_index.should | 51 |
| abstract_inverted_index.strive | 122 |
| abstract_inverted_index.tasks. | 157 |
| abstract_inverted_index.actors, | 113 |
| abstract_inverted_index.actors. | 139 |
| abstract_inverted_index.address | 66 |
| abstract_inverted_index.careful | 9 |
| abstract_inverted_index.compare | 107 |
| abstract_inverted_index.further | 135 |
| abstract_inverted_index.improve | 20 |
| abstract_inverted_index.process | 23, 62 |
| abstract_inverted_index.results | 109, 116 |
| abstract_inverted_index.through | 88 |
| abstract_inverted_index.varying | 91 |
| abstract_inverted_index.Language | 5 |
| abstract_inverted_index.designed | 76 |
| abstract_inverted_index.external | 112, 138 |
| abstract_inverted_index.measure? | 54 |
| abstract_inverted_index.prompts, | 94 |
| abstract_inverted_index.reliable | 111 |
| abstract_inverted_index.requires | 8 |
| abstract_inverted_index.research | 37 |
| abstract_inverted_index.suitable | 28 |
| abstract_inverted_index.validate | 84 |
| abstract_inverted_index.encourage | 134, 145 |
| abstract_inverted_index.framework | 75, 132 |
| abstract_inverted_index.introduce | 70 |
| abstract_inverted_index.selection | 22 |
| abstract_inverted_index.LLMSecCode | 86 |
| abstract_inverted_index.deployment | 2 |
| abstract_inverted_index.difference | 101 |
| abstract_inverted_index.evaluation | 53, 61, 74 |
| abstract_inverted_index.impartial? | 64 |
| abstract_inverted_index.parameters | 92 |
| abstract_inverted_index.questions, | 38, 68 |
| abstract_inverted_index.streamline | 45 |
| abstract_inverted_index.LLMSecCode, | 71, 141 |
| abstract_inverted_index.challenging | 36 |
| abstract_inverted_index.development | 136 |
| abstract_inverted_index.difference. | 120 |
| abstract_inverted_index.evaluation? | 48 |
| abstract_inverted_index.open-source | 73, 131 |
| abstract_inverted_index.benchmarking | 149 |
| abstract_inverted_index.capabilities | 81, 152 |
| abstract_inverted_index.experiments. | 89 |
| abstract_inverted_index.facilitating | 30 |
| abstract_inverted_index.objectively. | 82 |
| abstract_inverted_index.performance, | 103 |
| abstract_inverted_index.consideration | 10 |
| abstract_inverted_index.functionality | 43 |
| abstract_inverted_index.respectively. | 104 |
| abstract_inverted_index.cybersecurity. | 15 |
| abstract_inverted_index.implementation | 87 |
| abstract_inverted_index.standardization | 147 |
| abstract_inverted_index.security-oriented | 154 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 4 |
| citation_normalized_percentile |