Options
#iCanHazRobot?: improved robot detection for IR usage statistics
Author(s)
Date Issued
2016-06-14
Date Available
2024-06-21T15:02:49Z
Abstract
Experiment: simple random sample of 2 years of download data (n=341, N=3.3 million for 96.20% certainty), manually checked to determine if robot or human. DSpace 1.8.2 with U. Minho DSpace Statistics Add-on v. 4. Apache Tomcat behind Apache HTTP server; logs in Apache Combined Log Format. Minho registers every download in the PostgreSQL database. Results to be published in July 2016 issue of Library Hi Tech (Greene 2016). This dataset is used to experimentally test different detection techniques used alone and in combination, and different out-of-box techniques used by the major repository platforms DSpace and EPrints. 85% of unfiltered repository downloads come from robots.
Type of Material
Conference Publication
Copyright (Published Version)
2016 the Author
Language
English
Status of Item
Peer reviewed
Conference Details
11th Annual Conference on Open Repositories (OR2016). Dublin, 13-16 June 2016
This item is made available under a Creative Commons License
File(s)
Loading...
Name
JGRobotsTechTrackOR2016-06-07.pdf
Size
1.05 MB
Format
Adobe PDF
Checksum (MD5)
7f62cdd3430c6a12df062b87d2cd3c57
Owning collection