Exploring Data Caching Policy with Data Access Patterns from dCache System Article Swipe
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.1051/epjconf/202533701340
The dCache storage system at Brookhaven National Laboratory (BNL) serves as a critical cache for the ATLAS collaboration, enabling efficient access to petabytes of data located on tape, remote repositories, and cold storage. Effective cache management is vital to minimize access latency, particularly as operators have observed persistent high-demand datasets that warrant prolonged retention (“pinning”) in disk cache. This study evaluates machine learning (ML) techniques to automate dataset pinning decisions by predicting future access patterns. Our models, which integrate temporal trends and request-specific features, achieve predictive errors significantly below the inherent variability of dataset access patterns. We further explore dynamic updates to these predictions using real-time dCache access logs, enabling adaptive pinning strategies for high-priority datasets. Ongoing work focuses on validating system-wide performance gains under realistic user workloads, with the goal of optimizing resource utilization for large-scale scientific data infrastructures.
Related Topics
- Type
- article
- Language
- en
- Landing Page
- https://doi.org/10.1051/epjconf/202533701340
- https://www.epj-conferences.org/articles/epjconf/pdf/2025/22/epjconf_chep2025_01340.pdf
- OA Status
- diamond
- References
- 6
- OpenAlex ID
- https://openalex.org/W4414917971
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4414917971Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.1051/epjconf/202533701340Digital Object Identifier
- Title
-
Exploring Data Caching Policy with Data Access Patterns from dCache SystemWork title
- Type
-
articleOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2025Year of publication
- Publication date
-
2025-01-01Full publication date if available
- Authors
-
J. M. Aldrich, Alex Sim, Kesheng Wu, Shinjae Yoo, Hiro Ito, V. Garonne, E. LançonList of authors in order
- Landing page
-
https://doi.org/10.1051/epjconf/202533701340Publisher landing page
- PDF URL
-
https://www.epj-conferences.org/articles/epjconf/pdf/2025/22/epjconf_chep2025_01340.pdfDirect link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
diamondOpen access status per OpenAlex
- OA URL
-
https://www.epj-conferences.org/articles/epjconf/pdf/2025/22/epjconf_chep2025_01340.pdfDirect OA link when available
- Cited by
-
0Total citation count in OpenAlex
- References (count)
-
6Number of works referenced by this work
Full payload
| id | https://openalex.org/W4414917971 |
|---|---|
| doi | https://doi.org/10.1051/epjconf/202533701340 |
| ids.doi | https://doi.org/10.1051/epjconf/202533701340 |
| ids.openalex | https://openalex.org/W4414917971 |
| fwci | 0.0 |
| type | article |
| title | Exploring Data Caching Policy with Data Access Patterns from dCache System |
| biblio.issue | |
| biblio.volume | 337 |
| biblio.last_page | 01340 |
| biblio.first_page | 01340 |
| topics[0].id | https://openalex.org/T11478 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9998999834060669 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1705 |
| topics[0].subfield.display_name | Computer Networks and Communications |
| topics[0].display_name | Caching and Content Delivery |
| topics[1].id | https://openalex.org/T11181 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9918000102043152 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1705 |
| topics[1].subfield.display_name | Computer Networks and Communications |
| topics[1].display_name | Advanced Data Storage Technologies |
| topics[2].id | https://openalex.org/T10742 |
| topics[2].field.id | https://openalex.org/fields/17 |
| topics[2].field.display_name | Computer Science |
| topics[2].score | 0.9890000224113464 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1705 |
| topics[2].subfield.display_name | Computer Networks and Communications |
| topics[2].display_name | Peer-to-Peer Network Technologies |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| language | en |
| locations[0].id | doi:10.1051/epjconf/202533701340 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S19068271 |
| locations[0].source.issn | 2100-014X, 2101-6275 |
| locations[0].source.type | journal |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | 2100-014X |
| locations[0].source.is_core | True |
| locations[0].source.is_in_doaj | True |
| locations[0].source.display_name | EPJ Web of Conferences |
| locations[0].source.host_organization | https://openalex.org/P4310319748 |
| locations[0].source.host_organization_name | EDP Sciences |
| locations[0].source.host_organization_lineage | https://openalex.org/P4310319748 |
| locations[0].source.host_organization_lineage_names | EDP Sciences |
| locations[0].license | cc-by |
| locations[0].pdf_url | https://www.epj-conferences.org/articles/epjconf/pdf/2025/22/epjconf_chep2025_01340.pdf |
| locations[0].version | publishedVersion |
| locations[0].raw_type | journal-article |
| locations[0].license_id | https://openalex.org/licenses/cc-by |
| locations[0].is_accepted | True |
| locations[0].is_published | True |
| locations[0].raw_source_name | EPJ Web of Conferences |
| locations[0].landing_page_url | https://doi.org/10.1051/epjconf/202533701340 |
| locations[1].id | pmh:oai:doaj.org/article:246738845ffd482f8936f685c4ea2b49 |
| locations[1].is_oa | False |
| locations[1].source.id | https://openalex.org/S4306401280 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | False |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | DOAJ (DOAJ: Directory of Open Access Journals) |
| locations[1].source.host_organization | |
| locations[1].source.host_organization_name | |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | submittedVersion |
| locations[1].raw_type | article |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | False |
| locations[1].raw_source_name | EPJ Web of Conferences, Vol 337, p 01340 (2025) |
| locations[1].landing_page_url | https://doaj.org/article/246738845ffd482f8936f685c4ea2b49 |
| indexed_in | crossref, doaj |
| authorships[0].author.id | https://openalex.org/A5109723649 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | J. M. Aldrich |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Jacob Aldrich |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5068293431 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-6295-1982 |
| authorships[1].author.display_name | Alex Sim |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Alex Sim |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5043129695 |
| authorships[2].author.orcid | https://orcid.org/0000-0002-6907-3393 |
| authorships[2].author.display_name | Kesheng Wu |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Kesheng Wu |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5048176207 |
| authorships[3].author.orcid | https://orcid.org/0000-0003-4378-6448 |
| authorships[3].author.display_name | Shinjae Yoo |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Shinjae Yoo |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5105820237 |
| authorships[4].author.orcid | |
| authorships[4].author.display_name | Hiro Ito |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Hiro Ito |
| authorships[4].is_corresponding | False |
| authorships[5].author.id | https://openalex.org/A5017423930 |
| authorships[5].author.orcid | https://orcid.org/0000-0001-7169-9160 |
| authorships[5].author.display_name | V. Garonne |
| authorships[5].author_position | middle |
| authorships[5].raw_author_name | Vincent Garonne |
| authorships[5].is_corresponding | False |
| authorships[6].author.id | https://openalex.org/A5058458958 |
| authorships[6].author.orcid | https://orcid.org/0000-0002-0225-187X |
| authorships[6].author.display_name | E. Lançon |
| authorships[6].author_position | last |
| authorships[6].raw_author_name | Eric Lancon |
| authorships[6].is_corresponding | False |
| has_content.pdf | True |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://www.epj-conferences.org/articles/epjconf/pdf/2025/22/epjconf_chep2025_01340.pdf |
| open_access.oa_status | diamond |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-08T00:00:00 |
| display_name | Exploring Data Caching Policy with Data Access Patterns from dCache System |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T11478 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9998999834060669 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1705 |
| primary_topic.subfield.display_name | Computer Networks and Communications |
| primary_topic.display_name | Caching and Content Delivery |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | doi:10.1051/epjconf/202533701340 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S19068271 |
| best_oa_location.source.issn | 2100-014X, 2101-6275 |
| best_oa_location.source.type | journal |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | 2100-014X |
| best_oa_location.source.is_core | True |
| best_oa_location.source.is_in_doaj | True |
| best_oa_location.source.display_name | EPJ Web of Conferences |
| best_oa_location.source.host_organization | https://openalex.org/P4310319748 |
| best_oa_location.source.host_organization_name | EDP Sciences |
| best_oa_location.source.host_organization_lineage | https://openalex.org/P4310319748 |
| best_oa_location.source.host_organization_lineage_names | EDP Sciences |
| best_oa_location.license | cc-by |
| best_oa_location.pdf_url | https://www.epj-conferences.org/articles/epjconf/pdf/2025/22/epjconf_chep2025_01340.pdf |
| best_oa_location.version | publishedVersion |
| best_oa_location.raw_type | journal-article |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | True |
| best_oa_location.raw_source_name | EPJ Web of Conferences |
| best_oa_location.landing_page_url | https://doi.org/10.1051/epjconf/202533701340 |
| primary_location.id | doi:10.1051/epjconf/202533701340 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S19068271 |
| primary_location.source.issn | 2100-014X, 2101-6275 |
| primary_location.source.type | journal |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | 2100-014X |
| primary_location.source.is_core | True |
| primary_location.source.is_in_doaj | True |
| primary_location.source.display_name | EPJ Web of Conferences |
| primary_location.source.host_organization | https://openalex.org/P4310319748 |
| primary_location.source.host_organization_name | EDP Sciences |
| primary_location.source.host_organization_lineage | https://openalex.org/P4310319748 |
| primary_location.source.host_organization_lineage_names | EDP Sciences |
| primary_location.license | cc-by |
| primary_location.pdf_url | https://www.epj-conferences.org/articles/epjconf/pdf/2025/22/epjconf_chep2025_01340.pdf |
| primary_location.version | publishedVersion |
| primary_location.raw_type | journal-article |
| primary_location.license_id | https://openalex.org/licenses/cc-by |
| primary_location.is_accepted | True |
| primary_location.is_published | True |
| primary_location.raw_source_name | EPJ Web of Conferences |
| primary_location.landing_page_url | https://doi.org/10.1051/epjconf/202533701340 |
| publication_date | 2025-01-01 |
| publication_year | 2025 |
| referenced_works | https://openalex.org/W1984567653, https://openalex.org/W3169979683, https://openalex.org/W2141405340, https://openalex.org/W2969429039, https://openalex.org/W4396681430, https://openalex.org/W2949676527 |
| referenced_works_count | 6 |
| abstract_inverted_index.a | 11 |
| abstract_inverted_index.We | 96 |
| abstract_inverted_index.as | 10, 43 |
| abstract_inverted_index.at | 4 |
| abstract_inverted_index.by | 70 |
| abstract_inverted_index.in | 55 |
| abstract_inverted_index.is | 36 |
| abstract_inverted_index.of | 23, 92, 131 |
| abstract_inverted_index.on | 26, 119 |
| abstract_inverted_index.to | 21, 38, 65, 101 |
| abstract_inverted_index.Our | 75 |
| abstract_inverted_index.The | 0 |
| abstract_inverted_index.and | 30, 81 |
| abstract_inverted_index.for | 14, 113, 135 |
| abstract_inverted_index.the | 15, 89, 129 |
| abstract_inverted_index.(ML) | 63 |
| abstract_inverted_index.This | 58 |
| abstract_inverted_index.cold | 31 |
| abstract_inverted_index.data | 24, 138 |
| abstract_inverted_index.disk | 56 |
| abstract_inverted_index.goal | 130 |
| abstract_inverted_index.have | 45 |
| abstract_inverted_index.that | 50 |
| abstract_inverted_index.user | 126 |
| abstract_inverted_index.with | 128 |
| abstract_inverted_index.work | 117 |
| abstract_inverted_index.(BNL) | 8 |
| abstract_inverted_index.ATLAS | 16 |
| abstract_inverted_index.below | 88 |
| abstract_inverted_index.cache | 13, 34 |
| abstract_inverted_index.gains | 123 |
| abstract_inverted_index.logs, | 108 |
| abstract_inverted_index.study | 59 |
| abstract_inverted_index.tape, | 27 |
| abstract_inverted_index.these | 102 |
| abstract_inverted_index.under | 124 |
| abstract_inverted_index.using | 104 |
| abstract_inverted_index.vital | 37 |
| abstract_inverted_index.which | 77 |
| abstract_inverted_index.access | 20, 40, 73, 94, 107 |
| abstract_inverted_index.cache. | 57 |
| abstract_inverted_index.dCache | 1, 106 |
| abstract_inverted_index.errors | 86 |
| abstract_inverted_index.future | 72 |
| abstract_inverted_index.remote | 28 |
| abstract_inverted_index.serves | 9 |
| abstract_inverted_index.system | 3 |
| abstract_inverted_index.trends | 80 |
| abstract_inverted_index.Ongoing | 116 |
| abstract_inverted_index.achieve | 84 |
| abstract_inverted_index.dataset | 67, 93 |
| abstract_inverted_index.dynamic | 99 |
| abstract_inverted_index.explore | 98 |
| abstract_inverted_index.focuses | 118 |
| abstract_inverted_index.further | 97 |
| abstract_inverted_index.located | 25 |
| abstract_inverted_index.machine | 61 |
| abstract_inverted_index.models, | 76 |
| abstract_inverted_index.pinning | 68, 111 |
| abstract_inverted_index.storage | 2 |
| abstract_inverted_index.updates | 100 |
| abstract_inverted_index.warrant | 51 |
| abstract_inverted_index.National | 6 |
| abstract_inverted_index.adaptive | 110 |
| abstract_inverted_index.automate | 66 |
| abstract_inverted_index.critical | 12 |
| abstract_inverted_index.datasets | 49 |
| abstract_inverted_index.enabling | 18, 109 |
| abstract_inverted_index.inherent | 90 |
| abstract_inverted_index.latency, | 41 |
| abstract_inverted_index.learning | 62 |
| abstract_inverted_index.minimize | 39 |
| abstract_inverted_index.observed | 46 |
| abstract_inverted_index.resource | 133 |
| abstract_inverted_index.storage. | 32 |
| abstract_inverted_index.temporal | 79 |
| abstract_inverted_index.Effective | 33 |
| abstract_inverted_index.datasets. | 115 |
| abstract_inverted_index.decisions | 69 |
| abstract_inverted_index.efficient | 19 |
| abstract_inverted_index.evaluates | 60 |
| abstract_inverted_index.features, | 83 |
| abstract_inverted_index.integrate | 78 |
| abstract_inverted_index.operators | 44 |
| abstract_inverted_index.patterns. | 74, 95 |
| abstract_inverted_index.petabytes | 22 |
| abstract_inverted_index.prolonged | 52 |
| abstract_inverted_index.real-time | 105 |
| abstract_inverted_index.realistic | 125 |
| abstract_inverted_index.retention | 53 |
| abstract_inverted_index.Brookhaven | 5 |
| abstract_inverted_index.Laboratory | 7 |
| abstract_inverted_index.management | 35 |
| abstract_inverted_index.optimizing | 132 |
| abstract_inverted_index.persistent | 47 |
| abstract_inverted_index.predicting | 71 |
| abstract_inverted_index.predictive | 85 |
| abstract_inverted_index.scientific | 137 |
| abstract_inverted_index.strategies | 112 |
| abstract_inverted_index.techniques | 64 |
| abstract_inverted_index.validating | 120 |
| abstract_inverted_index.workloads, | 127 |
| abstract_inverted_index.high-demand | 48 |
| abstract_inverted_index.large-scale | 136 |
| abstract_inverted_index.performance | 122 |
| abstract_inverted_index.predictions | 103 |
| abstract_inverted_index.system-wide | 121 |
| abstract_inverted_index.utilization | 134 |
| abstract_inverted_index.variability | 91 |
| abstract_inverted_index.particularly | 42 |
| abstract_inverted_index.high-priority | 114 |
| abstract_inverted_index.repositories, | 29 |
| abstract_inverted_index.significantly | 87 |
| abstract_inverted_index.collaboration, | 17 |
| abstract_inverted_index.(“pinning”) | 54 |
| abstract_inverted_index.infrastructures. | 139 |
| abstract_inverted_index.request-specific | 82 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 7 |
| citation_normalized_percentile.value | 0.56281608 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | True |