Supervised ranking for plagiarism Source retrieval: Notebook for PAN at CLEF 2013

Kyle Williams, Hung Hsuan Chen, C. Lee Giles

Research output: Contribution to journalConference article

5 Citations (Scopus)

Abstract

Source retrieval involves making use of a search engine to retrieve candidate sources of plagiarism for a given suspicious document so that more accurate comparisons can be made. We describe a strategy for source retrieval that uses a supervised method to classify and rank search engine results as potential sources of plagiarism without retrieving the documents themselves. Evaluation shows the performance of our approach, which achieved the highest precision (0.57) and F1 score (0.47) in the 2014 PAN Source Retrieval task.

Original languageEnglish (US)
Pages (from-to)1021-1026
Number of pages6
JournalCEUR Workshop Proceedings
Volume1180
StatePublished - Jan 1 2014
Event2014 Cross Language Evaluation Forum Conference, CLEF 2014 - Sheffield, United Kingdom
Duration: Sep 15 2014Sep 18 2014

Fingerprint

Search engines

All Science Journal Classification (ASJC) codes

  • Computer Science(all)

Cite this

@article{e2831b2ccefc4838b99066ac1e46800a,
title = "Supervised ranking for plagiarism Source retrieval: Notebook for PAN at CLEF 2013",
abstract = "Source retrieval involves making use of a search engine to retrieve candidate sources of plagiarism for a given suspicious document so that more accurate comparisons can be made. We describe a strategy for source retrieval that uses a supervised method to classify and rank search engine results as potential sources of plagiarism without retrieving the documents themselves. Evaluation shows the performance of our approach, which achieved the highest precision (0.57) and F1 score (0.47) in the 2014 PAN Source Retrieval task.",
author = "Kyle Williams and Chen, {Hung Hsuan} and Giles, {C. Lee}",
year = "2014",
month = "1",
day = "1",
language = "English (US)",
volume = "1180",
pages = "1021--1026",
journal = "CEUR Workshop Proceedings",
issn = "1613-0073",
publisher = "CEUR-WS",

}

Supervised ranking for plagiarism Source retrieval : Notebook for PAN at CLEF 2013. / Williams, Kyle; Chen, Hung Hsuan; Giles, C. Lee.

In: CEUR Workshop Proceedings, Vol. 1180, 01.01.2014, p. 1021-1026.

Research output: Contribution to journalConference article

TY - JOUR

T1 - Supervised ranking for plagiarism Source retrieval

T2 - Notebook for PAN at CLEF 2013

AU - Williams, Kyle

AU - Chen, Hung Hsuan

AU - Giles, C. Lee

PY - 2014/1/1

Y1 - 2014/1/1

N2 - Source retrieval involves making use of a search engine to retrieve candidate sources of plagiarism for a given suspicious document so that more accurate comparisons can be made. We describe a strategy for source retrieval that uses a supervised method to classify and rank search engine results as potential sources of plagiarism without retrieving the documents themselves. Evaluation shows the performance of our approach, which achieved the highest precision (0.57) and F1 score (0.47) in the 2014 PAN Source Retrieval task.

AB - Source retrieval involves making use of a search engine to retrieve candidate sources of plagiarism for a given suspicious document so that more accurate comparisons can be made. We describe a strategy for source retrieval that uses a supervised method to classify and rank search engine results as potential sources of plagiarism without retrieving the documents themselves. Evaluation shows the performance of our approach, which achieved the highest precision (0.57) and F1 score (0.47) in the 2014 PAN Source Retrieval task.

UR - http://www.scopus.com/inward/record.url?scp=84981268159&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84981268159&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:84981268159

VL - 1180

SP - 1021

EP - 1026

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

SN - 1613-0073

ER -