The Performance Implications of Thread Management Alternatives for Shared-Memory Multiprocessors

Get BibTex-formatted data

Download

PDF

Author

Thomas E Anderson,Edward D Lazowska,Henry M Levy

Entry type

techreport

Abstract

Threads ("lightweight" processes) have become a common element of new languages and operating systems. This paper examines the performance implications of several data structure and algorithm alternatives for thread management in shared-memory multi- processors. Both experimental measurements and analytical model projections are presented. For applications with fine-grained parallelism, small differences in thread management are shown to have significant performance impact, often posing a tradeoff between throughput and latency. Pre-processor data structures can be used to improve throughput, and in some circumstances to avoid locking, improving latency as well. The method used by processors to queue for locks is also shown to affect performance significantly. Normal methods of critical resource waiting can substantially degrade performance with moderate numbers of waiting processors. We present an Ethernet-style backoff algorithm that largely eliminates this effect.

Download

PDF

Date

1988 – September

URL

http://portal.acm.org/citation.cfm?id=75108.75378

Address

Seattle WA, 98195

Institution

University of Washington

Key alpha

Anderson

Number

88-09-04

Publication Date

2001-01-01

BibTex-formatted data

To refer to this entry, you may select and copy the text below and paste it into your BibTex document. Note that the text may not contain all macros that BibTex supports.

@Techreport{ Anderson,
	title = "The Performance Implications of Thread Management Alternatives for Shared-Memory Multiprocessors",
	author = "Thomas E Anderson,Edward D Lazowska,Henry M Levy",
	year = "1988",
	month = "September",
	address = "Seattle WA, 98195",
	institution = "University of Washington",
	number = " 88-09-04",
	abstract = "Threads ("lightweight" processes) have become a common element of new languages and
operating systems. This paper examines the performance implications of several data
structure and algorithm alternatives for thread management in shared-memory multi-
processors. Both experimental measurements and analytical model projections are
presented.
For applications with fine-grained parallelism, small differences in thread management
are shown to have significant performance impact, often posing a tradeoff between
throughput and latency. Pre-processor data structures can be used to improve
throughput, and in some circumstances to avoid locking, improving latency as well.
The method used by processors to queue for locks is also shown to affect performance
significantly. Normal methods of critical resource waiting can substantially degrade
performance with moderate numbers of waiting processors. We present an Ethernet-style
backoff algorithm that largely eliminates this effect.",
	url = "http://portal.acm.org/citation.cfm?id=75108.75378",
}