Vector Optimization – Multiple Lanes
文章目录
- Vector优化 – 多通道
Vector优化 – 多通道
A particular instruction is carried out by an execution pipeline. For example an ADD instruction will be carried out by an ADD function unit pipeline.
一条特定的指令由执行流水线执行。例如, ADD指令将由ADD功能单元流水线执行。
If only one such execution pipeline is available, it operates on the elements of a vector register sequentially. This is shown in the figure on the left. It has a single ADD pipeline and can complete one addition per cycle.
如果只有一个这样的执行管道可用,它将按顺序对vector寄存器的元素进行操作。如左图所示。它有一个ADD管道,每个周期可以完成一次加法。
If however, multiple pipelines are available, each pipeline can consume an element of the vector register. This is shown in the figure on the right. It has four ADD pipelines can can complete four additions per cycle.
然而,如果多个流水线可用,则每个流水线可以消耗vector寄存器的一个元素。如右图所示。它有4条ADD流水线,每个周期可以完成四次加法。
The elements within the vector register are interleaved across the four p