RAGStack: A Privacy-First GenAI Retrieval-Augmented Generation Architecture for Secure Enterprise Document Intelligence Article Swipe
RAGStack is a fully local, privacy-first Retrieval-Augmented Generation (RAG) framework for enterprise document intelligence and question answering. It integrates PyMuPDF for parsing, SentenceTransformers for embedding generation, FAISS for local semantic retrieval, and Ollama-hosted large language models (LLMs) for response generation. Featuring auto-indexing, manifest-based deduplication, self-healing index rebuilding, and normalized (0–1) similarity scoring, the framework enables compliance-ready AI deployment in offline or air-gapped enterprise environments. Recommended Citation: Srivastava, M. (2025). *RAGStack: A Privacy-First GenAI Retrieval-Augmented Generation Architecture for Secure Enterprise Document Intelligence (v1.0.1)*. Zenodo. https://doi.org/10.5281/zenodo.17878948
Related Topics
- Type
- other
- Landing Page
- https://doi.org/10.5281/zenodo.17878948
- OA Status
- green
- OpenAlex ID
- https://openalex.org/W7114783641
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W7114783641Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.5281/zenodo.17878948Digital Object Identifier
- Title
-
RAGStack: A Privacy-First GenAI Retrieval-Augmented Generation Architecture for Secure Enterprise Document IntelligenceWork title
- Type
-
otherOpenAlex work type
- Publication year
-
2025Year of publication
- Publication date
-
2025-12-10Full publication date if available
- Authors
-
Srivastava, MohitList of authors in order
- Landing page
-
https://doi.org/10.5281/zenodo.17878948Publisher landing page
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://doi.org/10.5281/zenodo.17878948Direct OA link when available
- Concepts
-
Computer science, Enterprise architecture, Enterprise information security architecture, Enterprise integration, Software deployment, NIST Enterprise Architecture Model, Embedding, Architecture domain, Enterprise information system, Enterprise architecture framework, Architecture, Software engineering, Enterprise architecture management, Key (lock), Semantics (computer science), Knowledge management, Enterprise software, World Wide Web, Intelligence analysis, Enterprise life cycle, Enterprise modelling, Artificial intelligence, Similarity (geometry), View model, Architecture framework, Applications architecture, Intrusion detection system, Applications of artificial intelligence, Index (typography), Business architecture, Enterprise data management, Integrated enterprise modeling, Database, Solution architecture, Enterprise system, Information retrievalTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
Full payload
| id | https://openalex.org/W7114783641 |
|---|---|
| doi | https://doi.org/10.5281/zenodo.17878948 |
| ids.doi | https://doi.org/10.5281/zenodo.17878948 |
| ids.openalex | https://openalex.org/W7114783641 |
| fwci | |
| type | other |
| title | RAGStack: A Privacy-First GenAI Retrieval-Augmented Generation Architecture for Secure Enterprise Document Intelligence |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C41008148 |
| concepts[0].level | 0 |
| concepts[0].score | 0.7559585571289062 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[0].display_name | Computer science |
| concepts[1].id | https://openalex.org/C10590034 |
| concepts[1].level | 3 |
| concepts[1].score | 0.588892936706543 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q1048431 |
| concepts[1].display_name | Enterprise architecture |
| concepts[2].id | https://openalex.org/C31139447 |
| concepts[2].level | 2 |
| concepts[2].score | 0.5212608575820923 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q5380386 |
| concepts[2].display_name | Enterprise information security architecture |
| concepts[3].id | https://openalex.org/C53996427 |
| concepts[3].level | 3 |
| concepts[3].score | 0.4727044701576233 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q5380387 |
| concepts[3].display_name | Enterprise integration |
| concepts[4].id | https://openalex.org/C105339364 |
| concepts[4].level | 2 |
| concepts[4].score | 0.4678921699523926 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q2297740 |
| concepts[4].display_name | Software deployment |
| concepts[5].id | https://openalex.org/C48461290 |
| concepts[5].level | 5 |
| concepts[5].score | 0.46599557995796204 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q6954385 |
| concepts[5].display_name | NIST Enterprise Architecture Model |
| concepts[6].id | https://openalex.org/C41608201 |
| concepts[6].level | 2 |
| concepts[6].score | 0.4590342938899994 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q980509 |
| concepts[6].display_name | Embedding |
| concepts[7].id | https://openalex.org/C194167682 |
| concepts[7].level | 5 |
| concepts[7].score | 0.4166257679462433 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q4787088 |
| concepts[7].display_name | Architecture domain |
| concepts[8].id | https://openalex.org/C27295321 |
| concepts[8].level | 2 |
| concepts[8].score | 0.40550586581230164 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q831795 |
| concepts[8].display_name | Enterprise information system |
| concepts[9].id | https://openalex.org/C27591593 |
| concepts[9].level | 4 |
| concepts[9].score | 0.3907126486301422 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q4380897 |
| concepts[9].display_name | Enterprise architecture framework |
| concepts[10].id | https://openalex.org/C123657996 |
| concepts[10].level | 2 |
| concepts[10].score | 0.37746453285217285 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q12271 |
| concepts[10].display_name | Architecture |
| concepts[11].id | https://openalex.org/C115903868 |
| concepts[11].level | 1 |
| concepts[11].score | 0.3772774338722229 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q80993 |
| concepts[11].display_name | Software engineering |
| concepts[12].id | https://openalex.org/C163352659 |
| concepts[12].level | 4 |
| concepts[12].score | 0.37415602803230286 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q5380367 |
| concepts[12].display_name | Enterprise architecture management |
| concepts[13].id | https://openalex.org/C26517878 |
| concepts[13].level | 2 |
| concepts[13].score | 0.3578612804412842 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q228039 |
| concepts[13].display_name | Key (lock) |
| concepts[14].id | https://openalex.org/C184337299 |
| concepts[14].level | 2 |
| concepts[14].score | 0.3517034649848938 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q1437428 |
| concepts[14].display_name | Semantics (computer science) |
| concepts[15].id | https://openalex.org/C56739046 |
| concepts[15].level | 1 |
| concepts[15].score | 0.34969720244407654 |
| concepts[15].wikidata | https://www.wikidata.org/wiki/Q192060 |
| concepts[15].display_name | Knowledge management |
| concepts[16].id | https://openalex.org/C185765463 |
| concepts[16].level | 2 |
| concepts[16].score | 0.31992411613464355 |
| concepts[16].wikidata | https://www.wikidata.org/wiki/Q1318054 |
| concepts[16].display_name | Enterprise software |
| concepts[17].id | https://openalex.org/C136764020 |
| concepts[17].level | 1 |
| concepts[17].score | 0.31831094622612 |
| concepts[17].wikidata | https://www.wikidata.org/wiki/Q466 |
| concepts[17].display_name | World Wide Web |
| concepts[18].id | https://openalex.org/C517642484 |
| concepts[18].level | 2 |
| concepts[18].score | 0.3128313720226288 |
| concepts[18].wikidata | https://www.wikidata.org/wiki/Q2388514 |
| concepts[18].display_name | Intelligence analysis |
| concepts[19].id | https://openalex.org/C82419060 |
| concepts[19].level | 2 |
| concepts[19].score | 0.307586133480072 |
| concepts[19].wikidata | https://www.wikidata.org/wiki/Q5380394 |
| concepts[19].display_name | Enterprise life cycle |
| concepts[20].id | https://openalex.org/C138824270 |
| concepts[20].level | 4 |
| concepts[20].score | 0.30409589409828186 |
| concepts[20].wikidata | https://www.wikidata.org/wiki/Q3318162 |
| concepts[20].display_name | Enterprise modelling |
| concepts[21].id | https://openalex.org/C154945302 |
| concepts[21].level | 1 |
| concepts[21].score | 0.29575127363204956 |
| concepts[21].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[21].display_name | Artificial intelligence |
| concepts[22].id | https://openalex.org/C103278499 |
| concepts[22].level | 3 |
| concepts[22].score | 0.2878878116607666 |
| concepts[22].wikidata | https://www.wikidata.org/wiki/Q254465 |
| concepts[22].display_name | Similarity (geometry) |
| concepts[23].id | https://openalex.org/C65936242 |
| concepts[23].level | 5 |
| concepts[23].score | 0.28478196263313293 |
| concepts[23].wikidata | https://www.wikidata.org/wiki/Q925858 |
| concepts[23].display_name | View model |
| concepts[24].id | https://openalex.org/C53619493 |
| concepts[24].level | 3 |
| concepts[24].score | 0.2831442654132843 |
| concepts[24].wikidata | https://www.wikidata.org/wiki/Q4787093 |
| concepts[24].display_name | Architecture framework |
| concepts[25].id | https://openalex.org/C41065761 |
| concepts[25].level | 4 |
| concepts[25].score | 0.28230413794517517 |
| concepts[25].wikidata | https://www.wikidata.org/wiki/Q2193309 |
| concepts[25].display_name | Applications architecture |
| concepts[26].id | https://openalex.org/C35525427 |
| concepts[26].level | 2 |
| concepts[26].score | 0.2813214957714081 |
| concepts[26].wikidata | https://www.wikidata.org/wiki/Q745881 |
| concepts[26].display_name | Intrusion detection system |
| concepts[27].id | https://openalex.org/C157170001 |
| concepts[27].level | 2 |
| concepts[27].score | 0.2739616334438324 |
| concepts[27].wikidata | https://www.wikidata.org/wiki/Q4781507 |
| concepts[27].display_name | Applications of artificial intelligence |
| concepts[28].id | https://openalex.org/C2777382242 |
| concepts[28].level | 2 |
| concepts[28].score | 0.2718777358531952 |
| concepts[28].wikidata | https://www.wikidata.org/wiki/Q6017816 |
| concepts[28].display_name | Index (typography) |
| concepts[29].id | https://openalex.org/C94184115 |
| concepts[29].level | 4 |
| concepts[29].score | 0.2718134820461273 |
| concepts[29].wikidata | https://www.wikidata.org/wiki/Q3643219 |
| concepts[29].display_name | Business architecture |
| concepts[30].id | https://openalex.org/C136227091 |
| concepts[30].level | 3 |
| concepts[30].score | 0.27107489109039307 |
| concepts[30].wikidata | https://www.wikidata.org/wiki/Q5380376 |
| concepts[30].display_name | Enterprise data management |
| concepts[31].id | https://openalex.org/C29418244 |
| concepts[31].level | 5 |
| concepts[31].score | 0.27007153630256653 |
| concepts[31].wikidata | https://www.wikidata.org/wiki/Q6043081 |
| concepts[31].display_name | Integrated enterprise modeling |
| concepts[32].id | https://openalex.org/C77088390 |
| concepts[32].level | 1 |
| concepts[32].score | 0.2657821476459503 |
| concepts[32].wikidata | https://www.wikidata.org/wiki/Q8513 |
| concepts[32].display_name | Database |
| concepts[33].id | https://openalex.org/C26063835 |
| concepts[33].level | 5 |
| concepts[33].score | 0.2589060366153717 |
| concepts[33].wikidata | https://www.wikidata.org/wiki/Q7558977 |
| concepts[33].display_name | Solution architecture |
| concepts[34].id | https://openalex.org/C67571701 |
| concepts[34].level | 2 |
| concepts[34].score | 0.2559782862663269 |
| concepts[34].wikidata | https://www.wikidata.org/wiki/Q1318054 |
| concepts[34].display_name | Enterprise system |
| concepts[35].id | https://openalex.org/C23123220 |
| concepts[35].level | 1 |
| concepts[35].score | 0.2526615560054779 |
| concepts[35].wikidata | https://www.wikidata.org/wiki/Q816826 |
| concepts[35].display_name | Information retrieval |
| keywords[0].id | https://openalex.org/keywords/enterprise-architecture |
| keywords[0].score | 0.588892936706543 |
| keywords[0].display_name | Enterprise architecture |
| keywords[1].id | https://openalex.org/keywords/enterprise-information-security-architecture |
| keywords[1].score | 0.5212608575820923 |
| keywords[1].display_name | Enterprise information security architecture |
| keywords[2].id | https://openalex.org/keywords/enterprise-integration |
| keywords[2].score | 0.4727044701576233 |
| keywords[2].display_name | Enterprise integration |
| keywords[3].id | https://openalex.org/keywords/software-deployment |
| keywords[3].score | 0.4678921699523926 |
| keywords[3].display_name | Software deployment |
| keywords[4].id | https://openalex.org/keywords/nist-enterprise-architecture-model |
| keywords[4].score | 0.46599557995796204 |
| keywords[4].display_name | NIST Enterprise Architecture Model |
| keywords[5].id | https://openalex.org/keywords/embedding |
| keywords[5].score | 0.4590342938899994 |
| keywords[5].display_name | Embedding |
| keywords[6].id | https://openalex.org/keywords/architecture-domain |
| keywords[6].score | 0.4166257679462433 |
| keywords[6].display_name | Architecture domain |
| keywords[7].id | https://openalex.org/keywords/enterprise-information-system |
| keywords[7].score | 0.40550586581230164 |
| keywords[7].display_name | Enterprise information system |
| keywords[8].id | https://openalex.org/keywords/enterprise-architecture-framework |
| keywords[8].score | 0.3907126486301422 |
| keywords[8].display_name | Enterprise architecture framework |
| keywords[9].id | https://openalex.org/keywords/architecture |
| keywords[9].score | 0.37746453285217285 |
| keywords[9].display_name | Architecture |
| language | |
| locations[0].id | doi:10.5281/zenodo.17878948 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400562 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | Zenodo (CERN European Organization for Nuclear Research) |
| locations[0].source.host_organization | https://openalex.org/I67311998 |
| locations[0].source.host_organization_name | European Organization for Nuclear Research |
| locations[0].source.host_organization_lineage | https://openalex.org/I67311998 |
| locations[0].license | |
| locations[0].pdf_url | |
| locations[0].version | |
| locations[0].raw_type | article |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | https://doi.org/10.5281/zenodo.17878948 |
| indexed_in | datacite |
| authorships[0].author.id | |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Srivastava, Mohit |
| authorships[0].countries | GB |
| authorships[0].institutions[0].id | https://openalex.org/I4210092225 |
| authorships[0].institutions[0].ror | https://ror.org/00hsg6w24 |
| authorships[0].institutions[0].type | company |
| authorships[0].institutions[0].lineage | https://openalex.org/I4210092225 |
| authorships[0].institutions[0].country_code | GB |
| authorships[0].institutions[0].display_name | ET Enterprises (United Kingdom) |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Srivastava, Mohit |
| authorships[0].is_corresponding | True |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://doi.org/10.5281/zenodo.17878948 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-12-11T00:00:00 |
| display_name | RAGStack: A Privacy-First GenAI Retrieval-Augmented Generation Architecture for Secure Enterprise Document Intelligence |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-12-11T23:13:37.075516 |
| primary_topic | |
| cited_by_count | 0 |
| locations_count | 1 |
| best_oa_location.id | doi:10.5281/zenodo.17878948 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400562 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | Zenodo (CERN European Organization for Nuclear Research) |
| best_oa_location.source.host_organization | https://openalex.org/I67311998 |
| best_oa_location.source.host_organization_name | European Organization for Nuclear Research |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I67311998 |
| best_oa_location.license | |
| best_oa_location.pdf_url | |
| best_oa_location.version | |
| best_oa_location.raw_type | article |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | https://doi.org/10.5281/zenodo.17878948 |
| primary_location.id | doi:10.5281/zenodo.17878948 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400562 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | Zenodo (CERN European Organization for Nuclear Research) |
| primary_location.source.host_organization | https://openalex.org/I67311998 |
| primary_location.source.host_organization_name | European Organization for Nuclear Research |
| primary_location.source.host_organization_lineage | https://openalex.org/I67311998 |
| primary_location.license | |
| primary_location.pdf_url | |
| primary_location.version | |
| primary_location.raw_type | article |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | https://doi.org/10.5281/zenodo.17878948 |
| publication_date | 2025-12-10 |
| publication_year | 2025 |
| referenced_works_count | 0 |
| abstract_inverted_index.A | 70 |
| abstract_inverted_index.a | 2 |
| abstract_inverted_index.AI | 56 |
| abstract_inverted_index.It | 17 |
| abstract_inverted_index.M. | 67 |
| abstract_inverted_index.in | 58 |
| abstract_inverted_index.is | 1 |
| abstract_inverted_index.or | 60 |
| abstract_inverted_index.and | 14, 31, 47 |
| abstract_inverted_index.for | 10, 20, 23, 27, 37, 76 |
| abstract_inverted_index.the | 52 |
| abstract_inverted_index.(RAG) | 8 |
| abstract_inverted_index.FAISS | 26 |
| abstract_inverted_index.GenAI | 72 |
| abstract_inverted_index.fully | 3 |
| abstract_inverted_index.index | 45 |
| abstract_inverted_index.large | 33 |
| abstract_inverted_index.local | 28 |
| abstract_inverted_index.(LLMs) | 36 |
| abstract_inverted_index.Secure | 77 |
| abstract_inverted_index.local, | 4 |
| abstract_inverted_index.models | 35 |
| abstract_inverted_index.(0–1) | 49 |
| abstract_inverted_index.(2025). | 68 |
| abstract_inverted_index.PyMuPDF | 19 |
| abstract_inverted_index.Zenodo. | 82 |
| abstract_inverted_index.enables | 54 |
| abstract_inverted_index.offline | 59 |
| abstract_inverted_index.Document | 79 |
| abstract_inverted_index.RAGStack | 0 |
| abstract_inverted_index.document | 12 |
| abstract_inverted_index.language | 34 |
| abstract_inverted_index.parsing, | 21 |
| abstract_inverted_index.question | 15 |
| abstract_inverted_index.response | 38 |
| abstract_inverted_index.scoring, | 51 |
| abstract_inverted_index.semantic | 29 |
| abstract_inverted_index.Citation: | 65 |
| abstract_inverted_index.Featuring | 40 |
| abstract_inverted_index.embedding | 24 |
| abstract_inverted_index.framework | 9, 53 |
| abstract_inverted_index.(v1.0.1)*. | 81 |
| abstract_inverted_index.*RAGStack: | 69 |
| abstract_inverted_index.Enterprise | 78 |
| abstract_inverted_index.Generation | 7, 74 |
| abstract_inverted_index.air-gapped | 61 |
| abstract_inverted_index.answering. | 16 |
| abstract_inverted_index.deployment | 57 |
| abstract_inverted_index.enterprise | 11, 62 |
| abstract_inverted_index.integrates | 18 |
| abstract_inverted_index.normalized | 48 |
| abstract_inverted_index.retrieval, | 30 |
| abstract_inverted_index.similarity | 50 |
| abstract_inverted_index.Recommended | 64 |
| abstract_inverted_index.Srivastava, | 66 |
| abstract_inverted_index.generation, | 25 |
| abstract_inverted_index.generation. | 39 |
| abstract_inverted_index.rebuilding, | 46 |
| abstract_inverted_index.Architecture | 75 |
| abstract_inverted_index.Intelligence | 80 |
| abstract_inverted_index.intelligence | 13 |
| abstract_inverted_index.self-healing | 44 |
| abstract_inverted_index.Ollama-hosted | 32 |
| abstract_inverted_index.Privacy-First | 71 |
| abstract_inverted_index.environments. | 63 |
| abstract_inverted_index.privacy-first | 5 |
| abstract_inverted_index.auto-indexing, | 41 |
| abstract_inverted_index.deduplication, | 43 |
| abstract_inverted_index.manifest-based | 42 |
| abstract_inverted_index.compliance-ready | 55 |
| abstract_inverted_index.Retrieval-Augmented | 6, 73 |
| abstract_inverted_index.SentenceTransformers | 22 |
| abstract_inverted_index.https://doi.org/10.5281/zenodo.17878948 | 83 |
| cited_by_percentile_year | |
| countries_distinct_count | 1 |
| institutions_distinct_count | 1 |
| citation_normalized_percentile |