The Center for Education and Research in Information Assurance and Security (CERIAS)

The Center for Education and Research in
Information Assurance and Security (CERIAS)

Authorship attribution of SMS messages using an N-grams approach

Download

Download PDF Document
PDF

Author

Ashwin Mohan, Ibrahim M. Baggili, Marcus K. Rogers

Tech report number

CERIAS TR 2010-11

Entry type

techreport

Abstract

The pervasive use of SMS is increasing the amount of digital evidence available on cellular phones. Consequently it has become important to detect SMS authors, as a post-hoc analysis technique deemed useful in criminal persecution cases. This paper investigates an N-grams based approach for determining the authorship of SMS messages. Despite the scarcity of words in SMS messages and the differences between SMS language and natural language characteristics, the chosen method shows encouraging results in identification of authors. In this paper the effects of the gram size and the similarity scoring technique on the prediction of SMS message authors are also examined.

Download

PDF

Date

2010 – 7 – 12

Key alpha

Mohan, SMS, Authorship, Digital forensics, N-grams

School

Purdue University

Affiliation

CERIAS, College of Technology

Publication Date

2010-07-12

BibTex-formatted data

To refer to this entry, you may select and copy the text below and paste it into your BibTex document. Note that the text may not contain all macros that BibTex supports.