From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior\n with Optical Character Recognition Article Swipe
YOU?
·
· 2018
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.1811.06193
Tracking users' activities on the World Wide Web (WWW) allows researchers to\nanalyze each user's internet behavior as time passes and for the amount of time\nspent on a particular domain. This analysis can be used in research design, as\nresearchers may access to their participant's behaviors while browsing the web.\nWeb search behavior has been a subject of interest because of its real-world\napplications in marketing, digital advertisement, and identifying potential\nthreats online. In this paper, we present an image-processing based method to\nextract domains which are visited by a participant over multiple browsers\nduring a lab session. This method could provide another way to collect users'\nactivities during an online session given that the session recorder collected\nthe data. The method can also be used to collect the textual content of\nweb-pages that an individual visits for later analysis\n
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/1811.06193
- https://arxiv.org/pdf/1811.06193
- OA Status
- green
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4289288243
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4289288243Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.1811.06193Digital Object Identifier
- Title
-
From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior\n with Optical Character RecognitionWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2018Year of publication
- Publication date
-
2018-11-15Full publication date if available
- Authors
-
Mojtaba Heidarysafa, James Reed, Kamran Kowsari, April Celeste R. Leviton, Janet I. Warren, Donald E. BrownList of authors in order
- Landing page
-
https://arxiv.org/abs/1811.06193Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/1811.06193Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/1811.06193Direct OA link when available
- Concepts
-
Session (web analytics), World Wide Web, Computer science, The Internet, Web page, Web browser, Multimedia, Static web page, Web navigation, Domain (mathematical analysis), Subject (documents), Character (mathematics), Tracking (education), Information retrieval, Mathematical analysis, Pedagogy, Mathematics, Psychology, GeometryTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4289288243 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.1811.06193 |
| ids.openalex | https://openalex.org/W4289288243 |
| fwci | 0.0 |
| type | preprint |
| title | From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior\n with Optical Character Recognition |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T12016 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9509999752044678 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1710 |
| topics[0].subfield.display_name | Information Systems |
| topics[0].display_name | Web Data Mining and Analysis |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C2779182362 |
| concepts[0].level | 2 |
| concepts[0].score | 0.8509633541107178 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q17126187 |
| concepts[0].display_name | Session (web analytics) |
| concepts[1].id | https://openalex.org/C136764020 |
| concepts[1].level | 1 |
| concepts[1].score | 0.7595642805099487 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q466 |
| concepts[1].display_name | World Wide Web |
| concepts[2].id | https://openalex.org/C41008148 |
| concepts[2].level | 0 |
| concepts[2].score | 0.7468540072441101 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[2].display_name | Computer science |
| concepts[3].id | https://openalex.org/C110875604 |
| concepts[3].level | 2 |
| concepts[3].score | 0.6199614405632019 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q75 |
| concepts[3].display_name | The Internet |
| concepts[4].id | https://openalex.org/C21959979 |
| concepts[4].level | 2 |
| concepts[4].score | 0.5353935360908508 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q36774 |
| concepts[4].display_name | Web page |
| concepts[5].id | https://openalex.org/C2983909278 |
| concepts[5].level | 3 |
| concepts[5].score | 0.5296390056610107 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q6368 |
| concepts[5].display_name | Web browser |
| concepts[6].id | https://openalex.org/C49774154 |
| concepts[6].level | 1 |
| concepts[6].score | 0.5168704986572266 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q131765 |
| concepts[6].display_name | Multimedia |
| concepts[7].id | https://openalex.org/C173576120 |
| concepts[7].level | 4 |
| concepts[7].score | 0.489022821187973 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q2641220 |
| concepts[7].display_name | Static web page |
| concepts[8].id | https://openalex.org/C61096286 |
| concepts[8].level | 3 |
| concepts[8].score | 0.4868318438529968 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q7978592 |
| concepts[8].display_name | Web navigation |
| concepts[9].id | https://openalex.org/C36503486 |
| concepts[9].level | 2 |
| concepts[9].score | 0.46505364775657654 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q11235244 |
| concepts[9].display_name | Domain (mathematical analysis) |
| concepts[10].id | https://openalex.org/C2777855551 |
| concepts[10].level | 2 |
| concepts[10].score | 0.43446269631385803 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q12310021 |
| concepts[10].display_name | Subject (documents) |
| concepts[11].id | https://openalex.org/C2780861071 |
| concepts[11].level | 2 |
| concepts[11].score | 0.42597317695617676 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q1062934 |
| concepts[11].display_name | Character (mathematics) |
| concepts[12].id | https://openalex.org/C2775936607 |
| concepts[12].level | 2 |
| concepts[12].score | 0.41067641973495483 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q466845 |
| concepts[12].display_name | Tracking (education) |
| concepts[13].id | https://openalex.org/C23123220 |
| concepts[13].level | 1 |
| concepts[13].score | 0.3982850909233093 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q816826 |
| concepts[13].display_name | Information retrieval |
| concepts[14].id | https://openalex.org/C134306372 |
| concepts[14].level | 1 |
| concepts[14].score | 0.0 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q7754 |
| concepts[14].display_name | Mathematical analysis |
| concepts[15].id | https://openalex.org/C19417346 |
| concepts[15].level | 1 |
| concepts[15].score | 0.0 |
| concepts[15].wikidata | https://www.wikidata.org/wiki/Q7922 |
| concepts[15].display_name | Pedagogy |
| concepts[16].id | https://openalex.org/C33923547 |
| concepts[16].level | 0 |
| concepts[16].score | 0.0 |
| concepts[16].wikidata | https://www.wikidata.org/wiki/Q395 |
| concepts[16].display_name | Mathematics |
| concepts[17].id | https://openalex.org/C15744967 |
| concepts[17].level | 0 |
| concepts[17].score | 0.0 |
| concepts[17].wikidata | https://www.wikidata.org/wiki/Q9418 |
| concepts[17].display_name | Psychology |
| concepts[18].id | https://openalex.org/C2524010 |
| concepts[18].level | 1 |
| concepts[18].score | 0.0 |
| concepts[18].wikidata | https://www.wikidata.org/wiki/Q8087 |
| concepts[18].display_name | Geometry |
| keywords[0].id | https://openalex.org/keywords/session |
| keywords[0].score | 0.8509633541107178 |
| keywords[0].display_name | Session (web analytics) |
| keywords[1].id | https://openalex.org/keywords/world-wide-web |
| keywords[1].score | 0.7595642805099487 |
| keywords[1].display_name | World Wide Web |
| keywords[2].id | https://openalex.org/keywords/computer-science |
| keywords[2].score | 0.7468540072441101 |
| keywords[2].display_name | Computer science |
| keywords[3].id | https://openalex.org/keywords/the-internet |
| keywords[3].score | 0.6199614405632019 |
| keywords[3].display_name | The Internet |
| keywords[4].id | https://openalex.org/keywords/web-page |
| keywords[4].score | 0.5353935360908508 |
| keywords[4].display_name | Web page |
| keywords[5].id | https://openalex.org/keywords/web-browser |
| keywords[5].score | 0.5296390056610107 |
| keywords[5].display_name | Web browser |
| keywords[6].id | https://openalex.org/keywords/multimedia |
| keywords[6].score | 0.5168704986572266 |
| keywords[6].display_name | Multimedia |
| keywords[7].id | https://openalex.org/keywords/static-web-page |
| keywords[7].score | 0.489022821187973 |
| keywords[7].display_name | Static web page |
| keywords[8].id | https://openalex.org/keywords/web-navigation |
| keywords[8].score | 0.4868318438529968 |
| keywords[8].display_name | Web navigation |
| keywords[9].id | https://openalex.org/keywords/domain |
| keywords[9].score | 0.46505364775657654 |
| keywords[9].display_name | Domain (mathematical analysis) |
| keywords[10].id | https://openalex.org/keywords/subject |
| keywords[10].score | 0.43446269631385803 |
| keywords[10].display_name | Subject (documents) |
| keywords[11].id | https://openalex.org/keywords/character |
| keywords[11].score | 0.42597317695617676 |
| keywords[11].display_name | Character (mathematics) |
| keywords[12].id | https://openalex.org/keywords/tracking |
| keywords[12].score | 0.41067641973495483 |
| keywords[12].display_name | Tracking (education) |
| keywords[13].id | https://openalex.org/keywords/information-retrieval |
| keywords[13].score | 0.3982850909233093 |
| keywords[13].display_name | Information retrieval |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:1811.06193 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/1811.06193 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/1811.06193 |
| indexed_in | arxiv |
| authorships[0].author.id | https://openalex.org/A5037262149 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Mojtaba Heidarysafa |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Heidarysafa, Mojtaba |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5044939686 |
| authorships[1].author.orcid | https://orcid.org/0000-0001-8188-3914 |
| authorships[1].author.display_name | James Reed |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Reed, James |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5001354047 |
| authorships[2].author.orcid | https://orcid.org/0000-0002-6451-4786 |
| authorships[2].author.display_name | Kamran Kowsari |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Kowsari, Kamran |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5045152092 |
| authorships[3].author.orcid | |
| authorships[3].author.display_name | April Celeste R. Leviton |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Leviton, April Celeste R. |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5056629388 |
| authorships[4].author.orcid | https://orcid.org/0000-0003-3005-1282 |
| authorships[4].author.display_name | Janet I. Warren |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Warren, Janet I. |
| authorships[4].is_corresponding | False |
| authorships[5].author.id | https://openalex.org/A5086462231 |
| authorships[5].author.orcid | https://orcid.org/0000-0002-9140-2632 |
| authorships[5].author.display_name | Donald E. Brown |
| authorships[5].author_position | last |
| authorships[5].raw_author_name | Brown, Donald E. |
| authorships[5].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/1811.06193 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2022-08-02T00:00:00 |
| display_name | From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior\n with Optical Character Recognition |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T12016 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9509999752044678 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1710 |
| primary_topic.subfield.display_name | Information Systems |
| primary_topic.display_name | Web Data Mining and Analysis |
| related_works | https://openalex.org/W1519586109, https://openalex.org/W2055154498, https://openalex.org/W2051097555, https://openalex.org/W1541158057, https://openalex.org/W2017818230, https://openalex.org/W2626548695, https://openalex.org/W2268257560, https://openalex.org/W2548348270, https://openalex.org/W2126467347, https://openalex.org/W2885559332 |
| cited_by_count | 0 |
| locations_count | 1 |
| best_oa_location.id | pmh:oai:arXiv.org:1811.06193 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/1811.06193 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/1811.06193 |
| primary_location.id | pmh:oai:arXiv.org:1811.06193 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/1811.06193 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/1811.06193 |
| publication_date | 2018-11-15 |
| publication_year | 2018 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 26, 52, 83, 88 |
| abstract_inverted_index.In | 68 |
| abstract_inverted_index.an | 73, 101, 124 |
| abstract_inverted_index.as | 16 |
| abstract_inverted_index.be | 32, 115 |
| abstract_inverted_index.by | 82 |
| abstract_inverted_index.in | 34, 60 |
| abstract_inverted_index.of | 23, 54, 57 |
| abstract_inverted_index.on | 3, 25 |
| abstract_inverted_index.to | 40, 97, 117 |
| abstract_inverted_index.we | 71 |
| abstract_inverted_index.The | 111 |
| abstract_inverted_index.Web | 7 |
| abstract_inverted_index.and | 19, 64 |
| abstract_inverted_index.are | 80 |
| abstract_inverted_index.can | 31, 113 |
| abstract_inverted_index.for | 20, 127 |
| abstract_inverted_index.has | 50 |
| abstract_inverted_index.its | 58 |
| abstract_inverted_index.lab | 89 |
| abstract_inverted_index.may | 38 |
| abstract_inverted_index.the | 4, 21, 46, 106, 119 |
| abstract_inverted_index.way | 96 |
| abstract_inverted_index.This | 29, 91 |
| abstract_inverted_index.Wide | 6 |
| abstract_inverted_index.also | 114 |
| abstract_inverted_index.been | 51 |
| abstract_inverted_index.each | 12 |
| abstract_inverted_index.over | 85 |
| abstract_inverted_index.that | 105, 123 |
| abstract_inverted_index.this | 69 |
| abstract_inverted_index.time | 17 |
| abstract_inverted_index.used | 33, 116 |
| abstract_inverted_index.(WWW) | 8 |
| abstract_inverted_index.World | 5 |
| abstract_inverted_index.based | 75 |
| abstract_inverted_index.could | 93 |
| abstract_inverted_index.data. | 110 |
| abstract_inverted_index.given | 104 |
| abstract_inverted_index.later | 128 |
| abstract_inverted_index.their | 41 |
| abstract_inverted_index.which | 79 |
| abstract_inverted_index.while | 44 |
| abstract_inverted_index.access | 39 |
| abstract_inverted_index.allows | 9 |
| abstract_inverted_index.amount | 22 |
| abstract_inverted_index.during | 100 |
| abstract_inverted_index.method | 76, 92, 112 |
| abstract_inverted_index.online | 102 |
| abstract_inverted_index.paper, | 70 |
| abstract_inverted_index.passes | 18 |
| abstract_inverted_index.search | 48 |
| abstract_inverted_index.user's | 13 |
| abstract_inverted_index.users' | 1 |
| abstract_inverted_index.visits | 126 |
| abstract_inverted_index.another | 95 |
| abstract_inverted_index.because | 56 |
| abstract_inverted_index.collect | 98, 118 |
| abstract_inverted_index.content | 121 |
| abstract_inverted_index.design, | 36 |
| abstract_inverted_index.digital | 62 |
| abstract_inverted_index.domain. | 28 |
| abstract_inverted_index.domains | 78 |
| abstract_inverted_index.online. | 67 |
| abstract_inverted_index.present | 72 |
| abstract_inverted_index.provide | 94 |
| abstract_inverted_index.session | 103, 107 |
| abstract_inverted_index.subject | 53 |
| abstract_inverted_index.textual | 120 |
| abstract_inverted_index.visited | 81 |
| abstract_inverted_index.Tracking | 0 |
| abstract_inverted_index.analysis | 30 |
| abstract_inverted_index.behavior | 15, 49 |
| abstract_inverted_index.browsing | 45 |
| abstract_inverted_index.interest | 55 |
| abstract_inverted_index.internet | 14 |
| abstract_inverted_index.multiple | 86 |
| abstract_inverted_index.recorder | 108 |
| abstract_inverted_index.research | 35 |
| abstract_inverted_index.session. | 90 |
| abstract_inverted_index.behaviors | 43 |
| abstract_inverted_index.web.\nWeb | 47 |
| abstract_inverted_index.activities | 2 |
| abstract_inverted_index.analysis\n | 129 |
| abstract_inverted_index.individual | 125 |
| abstract_inverted_index.marketing, | 61 |
| abstract_inverted_index.particular | 27 |
| abstract_inverted_index.identifying | 65 |
| abstract_inverted_index.participant | 84 |
| abstract_inverted_index.researchers | 10 |
| abstract_inverted_index.time\nspent | 24 |
| abstract_inverted_index.to\nanalyze | 11 |
| abstract_inverted_index.to\nextract | 77 |
| abstract_inverted_index.of\nweb-pages | 122 |
| abstract_inverted_index.participant's | 42 |
| abstract_inverted_index.advertisement, | 63 |
| abstract_inverted_index.collected\nthe | 109 |
| abstract_inverted_index.as\nresearchers | 37 |
| abstract_inverted_index.browsers\nduring | 87 |
| abstract_inverted_index.image-processing | 74 |
| abstract_inverted_index.potential\nthreats | 66 |
| abstract_inverted_index.users'\nactivities | 99 |
| abstract_inverted_index.real-world\napplications | 59 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 6 |
| citation_normalized_percentile.value | 0.55538503 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | False |