Edward A. Fox
YOU?
Author Swipe
View article: Toward Robust URL Extraction for Open Science: A Study of arXiv File Formats and Temporal Trends
Toward Robust URL Extraction for Open Science: A Study of arXiv File Formats and Temporal Trends Open
In this work, we study how URL extraction results depend on input format. We compiled a pilot dataset by extracting URLs from 10 arXiv papers and used the same heuristic method to extract URLs from four formats derived from the PDF files o…
View article: From Data Deficient to Big Data in Shark Conservation
From Data Deficient to Big Data in Shark Conservation Open
Citizen science is increasingly harnessed worldwide to gather data otherwise requiring a prohibitive investment of funding and time. Meanwhile, the revolution in digital communication offers opportunities from crowdsourcing, big data appro…
View article: What’s in a cue?: Using natural language processing to quantify content characteristics of episodic future thinking in the context of overweight and obesity
What’s in a cue?: Using natural language processing to quantify content characteristics of episodic future thinking in the context of overweight and obesity Open
Episodic future thinking (EFT), an intervention in which participants vividly imagine their future, has been explored as a cognitive intervention to reduce delay discounting and decrease engagement in harmful health behaviors. In these stu…
View article: AI-Facilitated Episodic Future Thinking For Adults with Obesity
AI-Facilitated Episodic Future Thinking For Adults with Obesity Open
Episodic Future Thinking (EFT) involves vividly imagining personal future events and experiences in detail. It has shown promise as an intervention to reduce delay discounting-the tendency to devalue delayed rewards in favor of immediate g…
View article: Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals
Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals Open
As research institutions increasingly commit to supporting the United Nations' Sustainable Development Goals (SDGs), there is a pressing need to accurately assess their research output against these goals. Current approaches, primarily rel…
View article: Automating Chapter-Level Classification for Electronic Theses and Dissertations
Automating Chapter-Level Classification for Electronic Theses and Dissertations Open
Traditional archival practices for describing electronic theses and dissertations (ETDs) rely on broad, high-level metadata schemes that fail to capture the depth, complexity, and interdisciplinary nature of these long scholarly works. The…
View article: VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Models
VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Models Open
Existing text simplification or paraphrase datasets mainly focus on sentence-level text generation in a general domain. These datasets are typically developed without using domain knowledge. In this paper, we release a novel dataset, VTech…
View article: Analyzing Quality, Bias, and Performance in Text-to-Image Generative Models
Analyzing Quality, Bias, and Performance in Text-to-Image Generative Models Open
Advances in generative models have led to significant interest in image synthesis, demonstrating the ability to generate high-quality images for a diverse range of text prompts. Despite this progress, most studies ignore the presence of bi…
View article: ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations
ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations Open
Electronic theses and dissertations (ETDs) have been proposed, advocated, and generated for more than 25 years. Although ETDs are hosted by commercial or institutional digital library repositories, they are still an understudied type of sc…
View article: ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations
ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations Open
Electronic theses and dissertations (ETDs) have been proposed, advocated, and generated for more than 25 years. Although ETDs are hosted by commercial or institutional digital library repositories, they are still an understudied type of sc…
View article: AI Chatbot for Generating Episodic Future Thinking (EFT) Cue Texts for Health
AI Chatbot for Generating Episodic Future Thinking (EFT) Cue Texts for Health Open
We describe an AI-powered chatbot to aid with health improvement by generating Episodic Future Thinking (EFT) cue texts that should reduce delay discounting. In prior studies, EFT has been shown to address maladaptive health behaviors. Tho…
View article: Prediction and optimization of employee turnover intentions in enterprises based on unbalanced data
Prediction and optimization of employee turnover intentions in enterprises based on unbalanced data Open
The sudden resignation of core employees often brings losses to companies in various aspects. Traditional employee turnover theory cannot analyze the unbalanced data of employees comprehensively, which leads the company to make wrong decis…
View article: ETDPC-ETD500
ETDPC-ETD500 Open
ETDPC has been developed to classify ETD pages into 13 categories. This model uses ETDPC-ETD500, containing 92,371 scanned ETD pages in PNGs. These pages were manually annotated. Later, OCR was performed on all pages using AWS Textract, a …
View article: Retrieval-based Text Selection for Addressing Class-Imbalanced Data in Classification
Retrieval-based Text Selection for Addressing Class-Imbalanced Data in Classification Open
This paper addresses the problem of selecting of a set of texts for annotation in text classification using retrieval methods when there are limits on the number of annotations due to constraints on human resources. An additional challenge…
View article: MetaEnhance-ETDQual500
MetaEnhance-ETDQual500 Open
MetaEnhance-ETDQual500 consists of 500 ETD benchmark evaluations (different from ETD500) by combining subsets (i.e., 4 ETD subsets from university, year, department, and degree fields) sampled using multiple criteria. The selection criteri…
View article: AutoMeta-ETD500
AutoMeta-ETD500 Open
AutoMeta-ETD500 contains 500 scanned Electronic Theses and Dissertations (ETDs). This dataset is used to develop a framework called AutoMeta, which automatically extracts seven key metadata fields (e.g., title, author, advisor, university,…
View article: Web Archiving and Digital Libraries (WADL) 2023
Web Archiving and Digital Libraries (WADL) 2023 Open
The 2023 edition of the Workshop on Web Archiving and Digital Libraries (WADL) will explore the integration of web archiving and digital libraries. The workshop aims at addressing aspects covering the entire life cycle of digital resources…
View article: Maximizing Equitable Reach and Accessibility of ETDs
Maximizing Equitable Reach and Accessibility of ETDs Open
This poster addresses accessibility issues of electronic theses and dissertations (ETDs) in digital libraries (DLs). ETDs are available primarily as PDF files, which present barriers to equitable access, especially for users with visual im…
View article: Who can Submit an Excellent Review for this Manuscript in the Next 30 Days? - Peer Reviewing in the Age of Overload
Who can Submit an Excellent Review for this Manuscript in the Next 30 Days? - Peer Reviewing in the Age of Overload Open
With millions of research articles published yearly, the peer review process is in danger of collapsing, especially in 'hot' areas with popular conferences. Challenges arise from the large number of manuscripts submitted, skyrocketing use …
View article: Integrated Digital Library System for Long Documents and their Elements
Integrated Digital Library System for Long Documents and their Elements Open
We describe a next-generation integrated Digital Library (DL) system that addresses the numerous goals associated with long documents such as Electronic Theses and Dissertations (ETDs). Our extensible workflow-centric design supports a var…
View article: A New Annotation Method and Dataset for Layout Analysis of Long Documents
A New Annotation Method and Dataset for Layout Analysis of Long Documents Open
Parsing long documents, such as books, theses, and dissertations, is an important component of information extraction from scholarly documents. Layout analysis methods based on object detection have been developed in recent years to help w…
View article: Supplementary Figure 4 from Molecular Ontogeny of Donor-Derived Follicular Lymphomas Occurring after Hematopoietic Cell Transplantation
Supplementary Figure 4 from Molecular Ontogeny of Donor-Derived Follicular Lymphomas Occurring after Hematopoietic Cell Transplantation Open
PDF file - 348K
View article: Supplementary Figure 4 from Molecular Ontogeny of Donor-Derived Follicular Lymphomas Occurring after Hematopoietic Cell Transplantation
Supplementary Figure 4 from Molecular Ontogeny of Donor-Derived Follicular Lymphomas Occurring after Hematopoietic Cell Transplantation Open
PDF file - 348K
View article: Data from Molecular Ontogeny of Donor-Derived Follicular Lymphomas Occurring after Hematopoietic Cell Transplantation
Data from Molecular Ontogeny of Donor-Derived Follicular Lymphomas Occurring after Hematopoietic Cell Transplantation Open
The relative timing of genetic alterations that contribute to follicular lymphoma remains unknown. We analyzed a donor–recipient pair who both developed grade 2/3A follicular lymphoma 7 years after allogeneic transplantation and donor lymp…
View article: Interview with Dr. Weinstock from Molecular Ontogeny of Donor-Derived Follicular Lymphomas Occurring after Hematopoietic Cell Transplantation
Interview with Dr. Weinstock from Molecular Ontogeny of Donor-Derived Follicular Lymphomas Occurring after Hematopoietic Cell Transplantation Open
mp3 file (7.8 MB). In the January edition of the Cancer Discovery podcast, Executive Editor Mark Landis talks with David Weinstock about his paper, in which analysis of a donor-recipient pair of sisters with follicular lymphoma reveals the…
View article: Supplementary Methods, Figure Legends 1-4 from Molecular Ontogeny of Donor-Derived Follicular Lymphomas Occurring after Hematopoietic Cell Transplantation
Supplementary Methods, Figure Legends 1-4 from Molecular Ontogeny of Donor-Derived Follicular Lymphomas Occurring after Hematopoietic Cell Transplantation Open
PDF file - 73K
View article: Supplementary Table 5 from Molecular Ontogeny of Donor-Derived Follicular Lymphomas Occurring after Hematopoietic Cell Transplantation
Supplementary Table 5 from Molecular Ontogeny of Donor-Derived Follicular Lymphomas Occurring after Hematopoietic Cell Transplantation Open
PDF file - 52K
View article: Supplementary Table 3 from Molecular Ontogeny of Donor-Derived Follicular Lymphomas Occurring after Hematopoietic Cell Transplantation
Supplementary Table 3 from Molecular Ontogeny of Donor-Derived Follicular Lymphomas Occurring after Hematopoietic Cell Transplantation Open
PDF file - 114K