![]() |
|
![]() |
|
|   | |
|
Improving the Performance of Large-Scale Shared Memory Systems
by
Advised by
As network hop latency rapidly approaches thousands of processor cycles,
it becomes a major factor in determining parallel programs' performance.
Unfortunately, cc-NUMA designs put the directory access into the critical
path of 3-hop(cache-to-cache) misses, which incurs high overheads.
In the producer-consumer sharing pattern, producer need to access home
directory, send invalidations to all consumers and
collect all invalidation acknowledgements before it could get the
exclusive
ownership. Similarly, consumers wishing to use the new data need to
send requests to home directory and wait for producer to downgrade its
exclusive copy which also involves 3 network hops.
|
School of Computing 50 S. Central Campus Dr. Rm. 3190 Salt Lake City, UT 84112
801-581-8224 Send comments to webmaster@cs.utah.edu
Disclaimer