Natural Language Watermarking and Tamperproofing

Get BibTex-formatted data

Download

PDF

Author

Atallah

Tech report number

CERIAS TR 2002-54

Entry type

conference

Abstract

Two main results in the area of information hiding in natural lan- guage text are presented. A semantically-based scheme dramatically im- proves the information-hiding capacity of any text through two tech- niques: (i) modifying the granularity of meaning of individual sentences, whereas our own previous scheme kept the granularity fixed, and (ii) halv- ing the number of sentences affected by the watermark. No longer a â€œlong text, short watermarkâ€ approach, it now makes it possible to watermark short texts like wire agency reports. Using both the above-mentioned se- mantic marking scheme and our previous syntactically-based method hides information in a way that reveals any non-trivial tampering with the text (while re-formatting is not considered to be tamperingâ€”the problem would be solved trivially otherwise by hiding a hash of the text) with a probabil- ity 1â€“2â€“b(n+1), n being its number of sentences and b a small positive integer based on the extent of co-referencing.

Download

PDF

Date

2002

URL

http://homes.cerias.purdue.edu/~mercan/IHW-2002.pdf

Key alpha

Atallah

Affiliation

CERIAS

Publication Date

2002-01-01

BibTex-formatted data

To refer to this entry, you may select and copy the text below and paste it into your BibTex document. Note that the text may not contain all macros that BibTex supports.

@Conference{ Atallah,
	title = "Natural Language Watermarking and Tamperproofing",
	author = "Atallah",
	year = "2002",
	abstract = "Two main results in the area of information hiding in natural lan- 
guage text are presented. A semantically-based scheme dramatically im- 
proves the information-hiding capacity of any text through two tech- 
niques: (i) modifying the granularity of meaning of individual sentences, 
whereas our own previous scheme kept the granularity fixed, and (ii) halv- 
ing the number of sentences affected by the watermark. No longer a â€œlong 
text, short watermarkâ€ approach, it now makes it possible to watermark 
short texts like wire agency reports. Using both the above-mentioned se- 
mantic marking scheme and our previous syntactically-based method hides 
information in a way that reveals any non-trivial tampering with the text 
(while re-formatting is not considered to be tamperingâ€”the problem would 
be solved trivially otherwise by hiding a hash of the text) with a probabil- 
ity 1â€“2â€“b(n+1), n being its number of sentences and b a small positive integer 
based on the extent of co-referencing. 
",
	affiliation = "CERIAS",
	url = "homes.cerias.purdue.edu/~mercan/IHW-2002.pdf",
}