



|             | nstruction                             | Instruction using resu                                     | ult Latency                         |
|-------------|----------------------------------------|------------------------------------------------------------|-------------------------------------|
| FP ALU OP   |                                        | FP ALU OP                                                  | 3                                   |
| FP ALU OP   |                                        | Store double                                               | 2                                   |
| Load double |                                        | FP ALU OP                                                  | 1                                   |
| Load Double |                                        | Store double                                               | 0                                   |
| Loop:       | L.D<br>ADD.D<br>S.D<br>DADDUI<br>BNE R | F0,0(R1)<br>F4,F0,F2<br>O(R1),F4<br>R1,R1,#-8<br>1,R2,Loop | or (I=1000;I>0;I++)<br>x[I]=x[I]+s; |

| °Assu<br>oper<br>intec | Exan<br>Ime that<br>ations, 2<br>Jer or bra | n <b>ple</b><br>w can sch<br>FP opera<br>anch | nedule 2<br>itions, a | 2 memory<br>and one    |     |
|------------------------|---------------------------------------------|-----------------------------------------------|-----------------------|------------------------|-----|
| Memory<br>reference 1  | Memory<br>reference 2                       | FP<br>2 operation 1                           | FP<br>op. 2           | Int. op/ Clo<br>branch | ock |
| LD F0,0(R1)            | LD F6,-8(R1)                                |                                               |                       |                        | 1   |
| LD F10,-16(R1)         | LD F14, 24(R1)                              |                                               |                       |                        | 2   |
| LD F18,-32(R1)         | LD F22,-40(R1)                              | ADDD F4,F0,F2                                 | ADDD F8,F6,I          | -2 3                   |     |
| LD F26,-48(R1)         |                                             | ADDD F12,F10,F2                               | ADDD F16,F1           | 4,F2                   | 4   |
|                        |                                             | ADDD F20,F18,F2                               | ADDD F24,F2           | 2,F2                   | 5   |
| SD 0(R1),F4            | SD -8(R1),F8                                | ADDD F28,F26,F2                               |                       |                        | 6   |
| SD -16(R1),F12         | SD -24(R1),F16                              |                                               |                       | DADD R1,R1,#-56        | 7   |
| SD 24(R1),F20          | SD 16(R1),F24                               |                                               |                       |                        | 8   |
| SD 8(R1),F28           |                                             | 7 iterations $cvcles = 1.2$                   | in 9<br>9 c/l         | BNEZ R1,LOOP           | 9   |
| Fall 07                |                                             | CSE4201                                       |                       |                        |     |







| lteration<br>number | Instructions |            | lssues at<br>clock cycle<br>number | Executes at<br>clock cycle<br>number | e clock cycle<br>number | Write CDB at<br>clock cycle<br>number | Comment          |
|---------------------|--------------|------------|------------------------------------|--------------------------------------|-------------------------|---------------------------------------|------------------|
| 1                   | LD           | R2,0(R1)   | 1                                  | 2                                    | 3                       | 4                                     | First issue      |
| . 1                 | DADDIU       | R2,R2,#1   | 1                                  | 5 🔶                                  |                         | 6                                     | Wait for LW      |
| 1                   | SD           | R2,0(R1)   | 2                                  | 3                                    | 7                       |                                       | Wait for DADDIU  |
| 1                   | DADDIU       | R1,R1,#4   | 2                                  | 3                                    |                         | 4                                     | Execute directly |
| 1                   | BNE          | R2,R3,L00P | 3                                  | 7 🖌                                  |                         |                                       | Wait for DADDIU  |
| 2                   | LD           | R2,0(R1)   | 4                                  | 8                                    | 9                       | 10                                    | Wait for BNE     |
| 2                   | DADDIU       | R2,R2,#1   | 4                                  | 11 🗸                                 |                         | 12                                    | Wait for LW      |
| 2                   | SD           | R2,0(R1)   | 5                                  | 9                                    | 13                      |                                       | Wait for DADDI   |
| 2                   | DADDIU       | R1,R1,#4   | 5                                  | 8                                    |                         | 9                                     | Wait for BNE     |
| 2                   | BNE          | R2,R3,L00P | 6                                  | 13                                   |                         |                                       | Wait for DADDIU  |
| 3                   | LD           | R2,0(R1)   | 7                                  | 14                                   | 15                      | 16                                    | Wait for BNE     |
| 3                   | DADDIU       | R2,R2,#1   | 7                                  | 17 🔸                                 |                         | 18                                    | Wait for LW      |
| 3                   | SD           | R2,0(R1)   | 8                                  | 15                                   | 19                      |                                       | Wait for DADDIU  |
| 3                   | DADDIU       | R1,R1,#4   | 8                                  | 14                                   |                         | 15                                    | Wait for BNE     |
| 3                   | BNZ          | R2,R3,L00P | 9                                  | 19 🖌                                 |                         |                                       | Wait for DADDI   |

the strength of speculation. Separate functional units for address calculation, ALU operations, and evaluation allow multiple instructions to execute in the same cycle.
Fall 07 CSE4201

| teration<br>number | Instructions |            | lssues<br>at clock<br>number | Executes<br>at clock<br>number | Read access<br>at clock<br>number | Write<br>CDB at<br>clock<br>number | Commits<br>at clock<br>number | Comment           |
|--------------------|--------------|------------|------------------------------|--------------------------------|-----------------------------------|------------------------------------|-------------------------------|-------------------|
| 1                  | LD           | R2,0(R1)   | 1                            | 2                              | 3                                 | 4                                  | 5                             | First issue       |
| 1                  | DADDIU       | R2,R2,#1   | 1                            | 5                              |                                   | 6                                  | 7                             | Wait for LW       |
| 1                  | SD           | R2,0(R1)   | 2                            | 3                              |                                   |                                    | 7                             | Wait for DADDIU   |
| 1                  | DADDIU       | R1,R1,#4   | 2                            | 3                              |                                   | 4                                  | 8                             | Commit in order   |
| 1                  | BNE          | R2,R3,L00P | 3                            | 7 🖌                            |                                   |                                    | 8                             | Wait for DADDIU   |
| 2                  | LD           | R2,0(R1)   | 4                            | 5                              | 6                                 | 7                                  | 9                             | No execute delay  |
| 2                  | DADDIU       | R2,R2,#1   | 4                            | 8                              |                                   | 9                                  | 10                            | Wait for LW       |
| 2                  | SD           | R2,0(R1)   | 5                            | 6                              |                                   |                                    | 10                            | Wait for DADDIU   |
| 2                  | DADDIU       | R1,R1,#4   | 5                            | 6                              |                                   | 7                                  | 11                            | Commit in order   |
| 2                  | BNE          | R2,R3,L00P | 6                            | 10                             |                                   |                                    | 11                            | Wait for DADDIU   |
| 3                  | LD           | R2,0(R1)   | 7                            | 8                              | 9                                 | 10                                 | 12                            | Earliest possible |
| 3                  | DADDIU       | R2,R2,#1   | 7                            | 11                             |                                   | 12                                 | 13                            | Wait for LW       |
| 3                  | SD           | R2,0(R1)   | 8                            | 9                              |                                   |                                    | 13                            | Wait for DADDIU   |
| 3                  | DADDIU       | R1,R1,#4   | 8                            | 9                              |                                   | 10                                 | 14                            | Executes earlier  |
| 3                  | BNE          | R2,R3,L00P | 9                            | 13                             |                                   |                                    | 14                            | Wait for DADDIU   |

Branches Still Single Issue Fall 07

CSE4201