PUMA pipeline output Article Swipe
YOU?
·
· 2021
· Open Access
·
· DOI: https://doi.org/10.5281/zenodo.4545741
Output of the PUMA (PUblications Metadata Augmentation) software pipeline which takes a list of journal articles and augments it with metadata from external sources. This augmented metadata is then processed to generate data files and an explorable/searchable set of HTML pages. The PUMA pipeline is available at: https://github.com/OllyButters/puma and is described at: https://doi.org/10.12688/f1000research.25484.1 These attached files are the result of running the pipeline on the list of publications described at: https://doi.org/10.12688/wellcomeopenres.14986.1 on 2021-01-15. Rerunning the pipeline on this list may result in slightly different outputs due to the changing content of the external metadata sources. Screenshots of the output HTML pages: PUMA_home_2021-01-15.png - Summary of all publications. PUMA_2011_2021-01-15.png - All publications from 2011. PUMA_map_2021-01-15.png - Choropleth map of first author's country. PUMA_asthma_2021-01-15.png - All publications with an asthma MeSH. PUMA_metrics_2021-01-15.png - Simple metrics. PUMA_word_cloud_2021-01-15.png - Word cloud of abstract text. PUMA_coverage_2021-01-15.png - Table showing completeness of metadata. Generated data files authors.csv - Frequency of authors. first_authors.csv - Frequency of first authors. first_authors_inst.csv - Frequency of first authors' institutes. journals.csv - Frequency of journals published in. abstract_lemmatized.csv - Frequency of lemmatized abstract words. abstract_lemmatized_by_year.csv - Frequency of lemmatized abstract words broken down by year. title_lemmatized.csv - Frequency of lemmatized title words. title_lemmatized_by_year.csv - Frequency of lemmatized title words broken down by year. keywords_lemmatized.csv - Frequency of lemmatized keywords. keywords_lemmatized_by_year.csv - Frequency of lemmatized keywords broken down by year.
Related Topics
- Type
- dataset
- Language
- en
- Landing Page
- https://doi.org/10.5281/zenodo.4545741
- OA Status
- green
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4393830107
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4393830107Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.5281/zenodo.4545741Digital Object Identifier
- Title
-
PUMA pipeline outputWork title
- Type
-
datasetOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2021Year of publication
- Publication date
-
2021-02-26Full publication date if available
- Authors
-
O. W. Butters, Rebecca Wilson, Hugh Garner, Thomas BurtonList of authors in order
- Landing page
-
https://doi.org/10.5281/zenodo.4545741Publisher landing page
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://doi.org/10.5281/zenodo.4545741Direct OA link when available
- Concepts
-
Puma, Pipeline (software), Computer science, Chemistry, Operating system, Gene, BiochemistryTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4393830107 |
|---|---|
| doi | https://doi.org/10.5281/zenodo.4545741 |
| ids.doi | https://doi.org/10.5281/zenodo.4545741 |
| ids.openalex | https://openalex.org/W4393830107 |
| fwci | |
| type | dataset |
| title | PUMA pipeline output |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10323 |
| topics[0].field.id | https://openalex.org/fields/22 |
| topics[0].field.display_name | Engineering |
| topics[0].score | 0.6669999957084656 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/2204 |
| topics[0].subfield.display_name | Biomedical Engineering |
| topics[0].display_name | Analog and Mixed-Signal Circuit Design |
| topics[1].id | https://openalex.org/T12564 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.5976999998092651 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1705 |
| topics[1].subfield.display_name | Computer Networks and Communications |
| topics[1].display_name | Sensor Technology and Measurement Systems |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C2777417711 |
| concepts[0].level | 3 |
| concepts[0].score | 0.8311856389045715 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q270748 |
| concepts[0].display_name | Puma |
| concepts[1].id | https://openalex.org/C43521106 |
| concepts[1].level | 2 |
| concepts[1].score | 0.565383791923523 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q2165493 |
| concepts[1].display_name | Pipeline (software) |
| concepts[2].id | https://openalex.org/C41008148 |
| concepts[2].level | 0 |
| concepts[2].score | 0.33923739194869995 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[2].display_name | Computer science |
| concepts[3].id | https://openalex.org/C185592680 |
| concepts[3].level | 0 |
| concepts[3].score | 0.14950346946716309 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q2329 |
| concepts[3].display_name | Chemistry |
| concepts[4].id | https://openalex.org/C111919701 |
| concepts[4].level | 1 |
| concepts[4].score | 0.13546308875083923 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q9135 |
| concepts[4].display_name | Operating system |
| concepts[5].id | https://openalex.org/C104317684 |
| concepts[5].level | 2 |
| concepts[5].score | 0.0 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q7187 |
| concepts[5].display_name | Gene |
| concepts[6].id | https://openalex.org/C55493867 |
| concepts[6].level | 1 |
| concepts[6].score | 0.0 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q7094 |
| concepts[6].display_name | Biochemistry |
| keywords[0].id | https://openalex.org/keywords/puma |
| keywords[0].score | 0.8311856389045715 |
| keywords[0].display_name | Puma |
| keywords[1].id | https://openalex.org/keywords/pipeline |
| keywords[1].score | 0.565383791923523 |
| keywords[1].display_name | Pipeline (software) |
| keywords[2].id | https://openalex.org/keywords/computer-science |
| keywords[2].score | 0.33923739194869995 |
| keywords[2].display_name | Computer science |
| keywords[3].id | https://openalex.org/keywords/chemistry |
| keywords[3].score | 0.14950346946716309 |
| keywords[3].display_name | Chemistry |
| keywords[4].id | https://openalex.org/keywords/operating-system |
| keywords[4].score | 0.13546308875083923 |
| keywords[4].display_name | Operating system |
| language | en |
| locations[0].id | doi:10.5281/zenodo.4545741 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400562 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | Zenodo (CERN European Organization for Nuclear Research) |
| locations[0].source.host_organization | https://openalex.org/I67311998 |
| locations[0].source.host_organization_name | European Organization for Nuclear Research |
| locations[0].source.host_organization_lineage | https://openalex.org/I67311998 |
| locations[0].license | cc-by |
| locations[0].pdf_url | |
| locations[0].version | |
| locations[0].raw_type | dataset |
| locations[0].license_id | https://openalex.org/licenses/cc-by |
| locations[0].is_accepted | False |
| locations[0].is_published | |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | https://doi.org/10.5281/zenodo.4545741 |
| indexed_in | datacite |
| authorships[0].author.id | https://openalex.org/A5088042834 |
| authorships[0].author.orcid | https://orcid.org/0000-0003-0354-8461 |
| authorships[0].author.display_name | O. W. Butters |
| authorships[0].countries | GB |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I146655781 |
| authorships[0].affiliations[0].raw_affiliation_string | University of Liverpool |
| authorships[0].institutions[0].id | https://openalex.org/I146655781 |
| authorships[0].institutions[0].ror | https://ror.org/04xs57h96 |
| authorships[0].institutions[0].type | education |
| authorships[0].institutions[0].lineage | https://openalex.org/I146655781 |
| authorships[0].institutions[0].country_code | GB |
| authorships[0].institutions[0].display_name | University of Liverpool |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Olly Butters |
| authorships[0].is_corresponding | False |
| authorships[0].raw_affiliation_strings | University of Liverpool |
| authorships[1].author.id | https://openalex.org/A5084170452 |
| authorships[1].author.orcid | |
| authorships[1].author.display_name | Rebecca Wilson |
| authorships[1].countries | GB |
| authorships[1].affiliations[0].institution_ids | https://openalex.org/I146655781 |
| authorships[1].affiliations[0].raw_affiliation_string | University of Liverpool |
| authorships[1].institutions[0].id | https://openalex.org/I146655781 |
| authorships[1].institutions[0].ror | https://ror.org/04xs57h96 |
| authorships[1].institutions[0].type | education |
| authorships[1].institutions[0].lineage | https://openalex.org/I146655781 |
| authorships[1].institutions[0].country_code | GB |
| authorships[1].institutions[0].display_name | University of Liverpool |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Rebecca Wilson |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | University of Liverpool |
| authorships[2].author.id | https://openalex.org/A5021716048 |
| authorships[2].author.orcid | https://orcid.org/0000-0003-1707-0139 |
| authorships[2].author.display_name | Hugh Garner |
| authorships[2].countries | GB |
| authorships[2].affiliations[0].institution_ids | https://openalex.org/I84884186 |
| authorships[2].affiliations[0].raw_affiliation_string | Newcastle University |
| authorships[2].institutions[0].id | https://openalex.org/I84884186 |
| authorships[2].institutions[0].ror | https://ror.org/01kj2bm70 |
| authorships[2].institutions[0].type | education |
| authorships[2].institutions[0].lineage | https://openalex.org/I84884186 |
| authorships[2].institutions[0].country_code | GB |
| authorships[2].institutions[0].display_name | Newcastle University |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Hugh Garner |
| authorships[2].is_corresponding | False |
| authorships[2].raw_affiliation_strings | Newcastle University |
| authorships[3].author.id | https://openalex.org/A5083679447 |
| authorships[3].author.orcid | https://orcid.org/0000-0002-6632-0950 |
| authorships[3].author.display_name | Thomas Burton |
| authorships[3].countries | GB |
| authorships[3].affiliations[0].institution_ids | https://openalex.org/I40120149 |
| authorships[3].affiliations[0].raw_affiliation_string | University of Oxford |
| authorships[3].institutions[0].id | https://openalex.org/I40120149 |
| authorships[3].institutions[0].ror | https://ror.org/052gg0110 |
| authorships[3].institutions[0].type | education |
| authorships[3].institutions[0].lineage | https://openalex.org/I40120149 |
| authorships[3].institutions[0].country_code | GB |
| authorships[3].institutions[0].display_name | University of Oxford |
| authorships[3].author_position | last |
| authorships[3].raw_author_name | Thomas Burton |
| authorships[3].is_corresponding | False |
| authorships[3].raw_affiliation_strings | University of Oxford |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://doi.org/10.5281/zenodo.4545741 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | PUMA pipeline output |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T10323 |
| primary_topic.field.id | https://openalex.org/fields/22 |
| primary_topic.field.display_name | Engineering |
| primary_topic.score | 0.6669999957084656 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/2204 |
| primary_topic.subfield.display_name | Biomedical Engineering |
| primary_topic.display_name | Analog and Mixed-Signal Circuit Design |
| related_works | https://openalex.org/W2748952813, https://openalex.org/W3029624080, https://openalex.org/W2383247791, https://openalex.org/W2383847661, https://openalex.org/W2977641071, https://openalex.org/W2112968535, https://openalex.org/W2255064539, https://openalex.org/W2347355260, https://openalex.org/W1967092074, https://openalex.org/W2049426872 |
| cited_by_count | 0 |
| locations_count | 1 |
| best_oa_location.id | doi:10.5281/zenodo.4545741 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400562 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | Zenodo (CERN European Organization for Nuclear Research) |
| best_oa_location.source.host_organization | https://openalex.org/I67311998 |
| best_oa_location.source.host_organization_name | European Organization for Nuclear Research |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I67311998 |
| best_oa_location.license | cc-by |
| best_oa_location.pdf_url | |
| best_oa_location.version | |
| best_oa_location.raw_type | dataset |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | https://doi.org/10.5281/zenodo.4545741 |
| primary_location.id | doi:10.5281/zenodo.4545741 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400562 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | Zenodo (CERN European Organization for Nuclear Research) |
| primary_location.source.host_organization | https://openalex.org/I67311998 |
| primary_location.source.host_organization_name | European Organization for Nuclear Research |
| primary_location.source.host_organization_lineage | https://openalex.org/I67311998 |
| primary_location.license | cc-by |
| primary_location.pdf_url | |
| primary_location.version | |
| primary_location.raw_type | dataset |
| primary_location.license_id | https://openalex.org/licenses/cc-by |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | https://doi.org/10.5281/zenodo.4545741 |
| publication_date | 2021-02-26 |
| publication_year | 2021 |
| referenced_works_count | 0 |
| abstract_inverted_index.- | 102, 108, 114, 122, 130, 134, 141, 151, 156, 162, 169, 176, 183, 194, 201, 212, 218 |
| abstract_inverted_index.a | 11 |
| abstract_inverted_index.an | 35, 126 |
| abstract_inverted_index.by | 191, 209, 225 |
| abstract_inverted_index.in | 81 |
| abstract_inverted_index.is | 27, 44, 49 |
| abstract_inverted_index.it | 18 |
| abstract_inverted_index.of | 1, 13, 38, 59, 66, 90, 96, 104, 117, 137, 145, 153, 158, 164, 171, 178, 185, 196, 203, 214, 220 |
| abstract_inverted_index.on | 63, 71, 76 |
| abstract_inverted_index.to | 30, 86 |
| abstract_inverted_index.All | 109, 123 |
| abstract_inverted_index.The | 41 |
| abstract_inverted_index.all | 105 |
| abstract_inverted_index.and | 16, 34, 48 |
| abstract_inverted_index.are | 56 |
| abstract_inverted_index.at: | 46, 51, 69 |
| abstract_inverted_index.due | 85 |
| abstract_inverted_index.in. | 174 |
| abstract_inverted_index.map | 116 |
| abstract_inverted_index.may | 79 |
| abstract_inverted_index.set | 37 |
| abstract_inverted_index.the | 2, 57, 61, 64, 74, 87, 91, 97 |
| abstract_inverted_index.HTML | 39, 99 |
| abstract_inverted_index.PUMA | 3, 42 |
| abstract_inverted_index.This | 24 |
| abstract_inverted_index.Word | 135 |
| abstract_inverted_index.data | 32, 148 |
| abstract_inverted_index.down | 190, 208, 224 |
| abstract_inverted_index.from | 21, 111 |
| abstract_inverted_index.list | 12, 65, 78 |
| abstract_inverted_index.then | 28 |
| abstract_inverted_index.this | 77 |
| abstract_inverted_index.with | 19, 125 |
| abstract_inverted_index.2011. | 112 |
| abstract_inverted_index.MeSH. | 128 |
| abstract_inverted_index.Table | 142 |
| abstract_inverted_index.These | 53 |
| abstract_inverted_index.cloud | 136 |
| abstract_inverted_index.files | 33, 55 |
| abstract_inverted_index.first | 118, 159, 165 |
| abstract_inverted_index.takes | 10 |
| abstract_inverted_index.text. | 139 |
| abstract_inverted_index.title | 198, 205 |
| abstract_inverted_index.which | 9 |
| abstract_inverted_index.words | 188, 206 |
| abstract_inverted_index.year. | 192, 210, 226 |
| abstract_inverted_index.Output | 0 |
| abstract_inverted_index.Simple | 131 |
| abstract_inverted_index.asthma | 127 |
| abstract_inverted_index.broken | 189, 207, 223 |
| abstract_inverted_index.output | 98 |
| abstract_inverted_index.pages. | 40 |
| abstract_inverted_index.result | 58, 80 |
| abstract_inverted_index.words. | 181, 199 |
| abstract_inverted_index.Summary | 103 |
| abstract_inverted_index.content | 89 |
| abstract_inverted_index.journal | 14 |
| abstract_inverted_index.outputs | 84 |
| abstract_inverted_index.running | 60 |
| abstract_inverted_index.showing | 143 |
| abstract_inverted_index.Metadata | 5 |
| abstract_inverted_index.abstract | 138, 180, 187 |
| abstract_inverted_index.articles | 15 |
| abstract_inverted_index.attached | 54 |
| abstract_inverted_index.augments | 17 |
| abstract_inverted_index.author's | 119 |
| abstract_inverted_index.authors' | 166 |
| abstract_inverted_index.authors. | 154, 160 |
| abstract_inverted_index.changing | 88 |
| abstract_inverted_index.country. | 120 |
| abstract_inverted_index.external | 22, 92 |
| abstract_inverted_index.generate | 31 |
| abstract_inverted_index.journals | 172 |
| abstract_inverted_index.keywords | 222 |
| abstract_inverted_index.metadata | 20, 26, 93 |
| abstract_inverted_index.metrics. | 132 |
| abstract_inverted_index.pipeline | 8, 43, 62, 75 |
| abstract_inverted_index.slightly | 82 |
| abstract_inverted_index.software | 7 |
| abstract_inverted_index.sources. | 23, 94 |
| abstract_inverted_index.Frequency | 152, 157, 163, 170, 177, 184, 195, 202, 213, 219 |
| abstract_inverted_index.Rerunning | 73 |
| abstract_inverted_index.augmented | 25 |
| abstract_inverted_index.available | 45 |
| abstract_inverted_index.described | 50, 68 |
| abstract_inverted_index.different | 83 |
| abstract_inverted_index.keywords. | 216 |
| abstract_inverted_index.metadata. | 146 |
| abstract_inverted_index.processed | 29 |
| abstract_inverted_index.published | 173 |
| abstract_inverted_index.Choropleth | 115 |
| abstract_inverted_index.lemmatized | 179, 186, 197, 204, 215, 221 |
| abstract_inverted_index.2021-01-15. | 72 |
| abstract_inverted_index.authors.csv | 150 |
| abstract_inverted_index.institutes. | 167 |
| abstract_inverted_index.completeness | 144 |
| abstract_inverted_index.journals.csv | 168 |
| abstract_inverted_index.publications | 67, 110, 124 |
| abstract_inverted_index.(PUblications | 4 |
| abstract_inverted_index.Augmentation) | 6 |
| abstract_inverted_index.publications. | 106 |
| abstract_inverted_index.files</strong> | 149 |
| abstract_inverted_index.pages:</strong> | 100 |
| abstract_inverted_index.<strong>Generated | 147 |
| abstract_inverted_index.first_authors.csv | 155 |
| abstract_inverted_index.<strong>Screenshots | 95 |
| abstract_inverted_index.title_lemmatized.csv | 193 |
| abstract_inverted_index.explorable/searchable | 36 |
| abstract_inverted_index.first_authors_inst.csv | 161 |
| abstract_inverted_index.PUMA_map_2021-01-15.png | 113 |
| abstract_inverted_index.abstract_lemmatized.csv | 175 |
| abstract_inverted_index.keywords_lemmatized.csv | 211 |
| abstract_inverted_index.PUMA_2011_2021-01-15.png | 107 |
| abstract_inverted_index.PUMA_home_2021-01-15.png | 101 |
| abstract_inverted_index.PUMA_asthma_2021-01-15.png | 121 |
| abstract_inverted_index.PUMA_metrics_2021-01-15.png | 129 |
| abstract_inverted_index.PUMA_coverage_2021-01-15.png | 140 |
| abstract_inverted_index.title_lemmatized_by_year.csv | 200 |
| abstract_inverted_index.PUMA_word_cloud_2021-01-15.png | 133 |
| abstract_inverted_index.abstract_lemmatized_by_year.csv | 182 |
| abstract_inverted_index.keywords_lemmatized_by_year.csv | 217 |
| abstract_inverted_index.https://github.com/OllyButters/puma | 47 |
| abstract_inverted_index.https://doi.org/10.12688/f1000research.25484.1 | 52 |
| abstract_inverted_index.https://doi.org/10.12688/wellcomeopenres.14986.1 | 70 |
| cited_by_percentile_year | |
| countries_distinct_count | 1 |
| institutions_distinct_count | 4 |
| citation_normalized_percentile |