In this paper the Scheduled Dataflow (SDF) architecture - a decoupled memory/execution, multithreaded architecture using non-blocking threads - is presented in detail and evaluated against Superscalar architecture. Recent focus in the field of new processor architectures is mainly on VLIW (e.g. IA-64), superscalar and superspeculative designs. This trend allows for better performance but at the expense of increased hardware complexity, and possibly higher power expenditures resulting from dynamic instruction scheduling. Our research deviates from this trend by exploring a simpler, yet powerful execution paradigm that is based on dataflow and multithreading. A program is partitioned into non-blocking execution threads. In addition, all memory accesses are decoupled from the thread's execution. Data is pre-loaded into the thread's context (registers), and all results are post-stored after the completion of the thread's execution. While multithreading and decoupling are possible with control-flow architectures, SDF makes it easier to coordinate the memory accesses and execution of a thread, as well as eliminate unnecessary dependencies among instructions. We have compared the execution cycles required for programs on SDF with the execution cycles required by programs on SimpleScalar (a superscalar simulator) by considering the essential aspects of these architectures in order to have a fair comparison. The results show that our architecture can outperform the superscalar. SDF performance scales better with the number of functional units and allow for a good exploitation of Thread Level Parallelism (TLP) and available chip area.

Kavi, K.m., Giorgi, R., Arul, J. (2001). Scheduled Dataflow: Execution paradigm, architecture, and performance evaluation. IEEE TRANSACTIONS ON COMPUTERS, 50(8), 834-846 [10.1109/TC.2001.947011].

Scheduled Dataflow: Execution paradigm, architecture, and performance evaluation

GIORGI, ROBERTO;
2001-01-01

Abstract

In this paper the Scheduled Dataflow (SDF) architecture - a decoupled memory/execution, multithreaded architecture using non-blocking threads - is presented in detail and evaluated against Superscalar architecture. Recent focus in the field of new processor architectures is mainly on VLIW (e.g. IA-64), superscalar and superspeculative designs. This trend allows for better performance but at the expense of increased hardware complexity, and possibly higher power expenditures resulting from dynamic instruction scheduling. Our research deviates from this trend by exploring a simpler, yet powerful execution paradigm that is based on dataflow and multithreading. A program is partitioned into non-blocking execution threads. In addition, all memory accesses are decoupled from the thread's execution. Data is pre-loaded into the thread's context (registers), and all results are post-stored after the completion of the thread's execution. While multithreading and decoupling are possible with control-flow architectures, SDF makes it easier to coordinate the memory accesses and execution of a thread, as well as eliminate unnecessary dependencies among instructions. We have compared the execution cycles required for programs on SDF with the execution cycles required by programs on SimpleScalar (a superscalar simulator) by considering the essential aspects of these architectures in order to have a fair comparison. The results show that our architecture can outperform the superscalar. SDF performance scales better with the number of functional units and allow for a good exploitation of Thread Level Parallelism (TLP) and available chip area.
2001
Kavi, K.m., Giorgi, R., Arul, J. (2001). Scheduled Dataflow: Execution paradigm, architecture, and performance evaluation. IEEE TRANSACTIONS ON COMPUTERS, 50(8), 834-846 [10.1109/TC.2001.947011].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/46868
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo