Memory data organization for improved cache performance in embedded processor applications
- 1 October 1997
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Design Automation of Electronic Systems
- Vol. 2 (4), 384-409
- https://doi.org/10.1145/268424.268464
Abstract
Code generation for embedded processors opens up the possibility for several performance optimization techniques that have been ignored by traditional compilers due to compilation time constraints. We present techniques that take into account the parameters of the data caches for organizing scalar and array variables declared in embedded code into memory, with the objective of improving data cache performance. We present techniques for clustering variables to minimize compulsory cache misses, and for solving the memory assignment problem to minimize conflict cache misses. Our experiments with benchmark code kernels from DSP and other domains on the CW4001 embedded processor from LSI Logic indicate significant improvements in data cache performance by the application of our memory organization technique.Keywords
This publication has 6 references indexed in Scilit:
- Challenges in Code Generation for Embedded ProcessorsPublished by Springer Nature ,2002
- Effective hardware-based data prefetching for high-performance processorsIEEE Transactions on Computers, 1995
- PrefacePublished by Elsevier ,1994
- Vector register allocationIEEE Transactions on Computers, 1992
- A case for direct-mapped cachesComputer, 1988
- Strategies for cache and local memory management by global program transformationJournal of Parallel and Distributed Computing, 1988