Answer: The ALU instructions of a (0,3) machine can never have a memory reference as an operand. These are typically called register-register machines.
The ALU instructions of a (1,3) machine are allowed to have up to one (1) memory reference. They are typically called memory-register machines.
So, the instructions (DADD R1, R2, R3) and (DADDI R1, R1,#10) are valid for both ISA's.
The instructions (DADD R1, (R2), R3) and (DADDI R1, (R2), #34) are only valid for the (1,3) ISA.
That is, choose A and B such that any byte loaded from an address,
, where
Answer: This answer is embedded in the answer to problem 1.1. Because memory is little endian, the least significant byte (LSB), which happens to contain the least significant bit (lsb), of a word must be stored at an address , where
That means that the MSB (the most significant byte, which contains the msb)
must be stored at address , where
Similarly, for a double word of 8 bytes, the LSB is at address
That means that the MSB (the most significant byte, which contains the msb)
must be stored at address , where
Answer: Since the actual hit time was not included in the problem, you get to have the blessed variable HT in your answer. Let AT be the memory access time. Then, we have:
While it doesn't make a difference in the answer to this problem if the hit time, HT, is in terms of number of clock cycles or time, it DOES matter if that number, 500, only applies if the HT is in terms of number of clock cycles, and not, say some portion of seconds.
Write an expression for the CPU execution time when the cache is not perfect.
Answer: This was the hard nasty question, because it made you think.
First, you are supposed to know that is the equation for CPU execution time.
Second, you are supposed to recognize that the memory related stuff has NO impact on the number of instructions (IC) and the clock cycle time (CC). So, the problem boils down to one that asks:
What is the CPI, assuming that the cache is NOT perfect?
From the wording of the problem, we have to distinguish between instruction memory access and data memory access. However, the read miss penalty is identical to the write miss penalty. So, we don't have to separate memory reads from writes.
One way to do this is to write:
Note: if the miss penalty differed for reads and writes, we'd have to break down the right hand side further into (data miss rate read miss penalty percentage of loads) added to (data miss rate write miss penalty percentage of stores)
But, we don't have that. So, we get the following pieces to work with.
Now, notice that for this problem, the hit rate (HT) is assumed to be in terms of number of clock cycles.