会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Apparatus and method to support pipelining of differing-latency instructions in a multithreaded processor
    • 在多线程处理器中支持不同延迟指令流水线的装置和方法
    • US07478225B1
    • 2009-01-13
    • US10881071
    • 2004-06-30
    • Jeffrey S. BrooksChristopher H. OlsonRobert T. Golla
    • Jeffrey S. BrooksChristopher H. OlsonRobert T. Golla
    • G06F9/30
    • G06F9/3836G06F9/3851G06F9/3857G06F9/3873
    • An apparatus and method to support pipelining of variable-latency instructions in a multithreaded processor. In one embodiment, a processor may include instruction fetch logic configured to issue a first and a second instruction from different ones of a plurality of threads during successive cycles. The processor may also include first and second execution units respectively configured to execute shorter-latency and longer-latency instructions and to respectively write shorter-latency or longer-latency instruction results to a result write port during a first or second writeback stage. The first writeback stage may occur a fewer number of cycles after instruction issue than the second writeback stage. The instruction fetch logic may be further configured to guarantee result write port access by the second execution unit during the second writeback stage by preventing the shorter-latency instruction from issuing during a cycle for which the first writeback stage collides with the second writeback stage.
    • 支持多线程处理器中可变延迟指令流水线的装置和方法。 在一个实施例中,处理器可以包括指令提取逻辑,其被配置为在连续循环期间从多个线程中的不同线程发出第一和第二指令。 处理器还可以包括第一和第二执行单元,其分别被配置为执行较短延迟和较长延迟的指令,并且在第一或第二回写阶段期间分别将较短等待时间或更长延迟的指令结果写入结果写入端口。 指令发布后的第一个回写阶段可能发生的次数比第二个回写阶段少。 指令提取逻辑可以被进一步配置为通过在第一写回阶段与第二回写阶段相冲突的周期期间防止短暂延迟指令发出来保证第二执行单元在第二写回阶段期间的结果写入端口访问。
    • 2. 发明授权
    • Method for selecting between divide instructions associated with respective threads in a multi-threaded processor
    • 用于在多线程处理器中与相应线程相关联的除法指令之间进行选择的方法
    • US07941642B1
    • 2011-05-10
    • US10881216
    • 2004-06-30
    • Robert T. GollaJeffrey S. BrooksChristopher H. Olson
    • Robert T. GollaJeffrey S. BrooksChristopher H. Olson
    • G06F9/30
    • G06F9/3001G06F9/3851
    • In one embodiment, a multithreaded processor includes a multithreaded instruction source that may provide a plurality of instructions each corresponding to a respective one of a plurality of threads. The multithreaded processor also includes a pick unit coupled to the multithreaded instruction source. The pick unit may select in a given cycle, a first divide instruction corresponding to one thread of the plurality of threads and a second divide instruction corresponding to another thread of the plurality of threads based upon a thread selection algorithm. Further, the multithreaded processor includes a storage coupled to a functional unit including a divider configured to execute the first divide instruction and the second divide instruction. The storage may store one of the first and the second divide instructions during execution of the other of the first and the second divide instructions.
    • 在一个实施例中,多线程处理器包括多线程指令源,其可以提供多个指令,每个指令对应于多个线程中的相应一个线程。 多线程处理器还包括耦合到多线程指令源的拾取单元。 拾取单元可以在给定周期中选择对应于多个线程中的一个线程的第一除法指令和基于线程选择算法对应于多个线程中的另一线程的第二除法指令。 此外,多线程处理器包括耦合到功能单元的存储器,该功能单元包括被配置为执行第一除法指令和第二除法指令的分配器。 存储器可以在执行第一和第二除法指令中的另一个指令期间存储第一和第二除法指令之一。
    • 3. 发明申请
    • PROCESSOR AND METHOD FOR IMPLEMENTING INSTRUCTION SUPPORT FOR MULTIPLICATION OF LARGE OPERANDS
    • 用于实施大规模操作的指导性支持的处理器和方法
    • US20100325188A1
    • 2010-12-23
    • US12488372
    • 2009-06-19
    • Christopher H. OlsonJeffrey S. BrooksRobert T. GollaPaul J. Jordan
    • Christopher H. OlsonJeffrey S. BrooksRobert T. GollaPaul J. Jordan
    • G06F7/52
    • G06F7/4876G06F2207/382
    • A processor including instruction support for implementing large-operand multiplication may issue, for execution, programmer-selectable instructions from a defined instruction set architecture (ISA). The processor may include an instruction execution unit comprising a hardware multiplier datapath circuit, where the hardware multiplier datapath circuit is configured to multiply operands having a maximum number of bits M. In response to receiving a single instance of a large-operand multiplication instruction defined within the ISA, wherein at least one of the operands of the large-operand multiplication instruction includes more than the maximum number of bits M, the instruction execution unit is configured to multiply operands of the large-operand multiplication instruction within the hardware multiplier datapath circuit to determine a result of the large-operand multiplication instruction without execution of programmer-selected instructions within the ISA other than the large-operand multiplication instruction.
    • 包括用于实现大操作数乘法的指令支持的处理器可以从定义的指令集架构(ISA)发出用于执行编程器可选择指令的执行。 处理器可以包括指令执行单元,其包括硬件乘法器数据路径电路,其中硬件乘法器数据路径电路被配置为对具有最大位数M的操作数进行乘法。响应于接收到在其中定义的大操作数乘法指令的单个实例 所述ISA,其中所述大操作数乘法指令的操作数中的至少一个包括多于所述最大位数M,所述指令执行单元被配置为将所述大操作数乘法指令在所述硬件乘法器数据通路电路内的操作数乘以 确定大操作数乘法指令的结果,而不在大操作数乘法指令之外执行ISA内的编程器选择指令。
    • 6. 发明授权
    • Processor and method for implementing instruction support for multiplication of large operands
    • 用于实现大操作数乘法的指令支持的处理器和方法
    • US08438208B2
    • 2013-05-07
    • US12488372
    • 2009-06-19
    • Christopher H. OlsonJeffrey S. BrooksRobert T. GollaPaul J. Jordan
    • Christopher H. OlsonJeffrey S. BrooksRobert T. GollaPaul J. Jordan
    • G06F7/52G06F7/38
    • G06F7/4876G06F2207/382
    • A processor including instruction support for implementing large-operand multiplication may issue, for execution, programmer-selectable instructions from a defined instruction set architecture (ISA). The processor may include an instruction execution unit comprising a hardware multiplier datapath circuit, where the hardware multiplier datapath circuit is configured to multiply operands having a maximum number of bits M. In response to receiving a single instance of a large-operand multiplication instruction defined within the ISA, wherein at least one of the operands of the large-operand multiplication instruction includes more than the maximum number of bits M, the instruction execution unit is configured to multiply operands of the large-operand multiplication instruction within the hardware multiplier datapath circuit to determine a result of the large-operand multiplication instruction without execution of programmer-selected instructions within the ISA other than the large-operand multiplication instruction.
    • 包括用于实现大操作数乘法的指令支持的处理器可以从定义的指令集架构(ISA)发出用于执行编程器可选择指令的执行。 处理器可以包括指令执行单元,其包括硬件乘法器数据路径电路,其中硬件乘法器数据路径电路被配置为对具有最大位数M的操作数进行乘法。响应于接收到在其中定义的大操作数乘法指令的单个实例 所述ISA,其中所述大操作数乘法指令的操作数中的至少一个包括多于所述最大位数M,所述指令执行单元被配置为将所述大操作数乘法指令在所述硬件乘法器数据通路电路内的操作数乘以 确定大操作数乘法指令的结果,而不在大操作数乘法指令之外执行ISA内的编程器选择指令。
    • 7. 发明授权
    • Handling multi-cycle integer operations for a multi-threaded processor
    • 处理多线程处理器的多循环整数运算
    • US08195919B1
    • 2012-06-05
    • US11927177
    • 2007-10-29
    • Christopher H. OlsonRobert T. GollaManish ShahJeffrey S. Brooks
    • Christopher H. OlsonRobert T. GollaManish ShahJeffrey S. Brooks
    • G06F13/00
    • G06F12/0842G06F9/3001G06F9/30043G06F9/3824G06F12/0855
    • Determining an effective address of a memory with a three-operand add operation in single execution cycle of a multithreaded processor that can access both segmented memory and non-segmented memory. During that cycle, the processor determines whether a memory segment base is zero. If the segment base is zero, the processor can access a memory location at the effective address without adding the segment base. If the segment base is not zero, such as when executing legacy code, the processor consumes another cycle to add the segment base to the effective address. Similarly, the processor consumes another cycle if the effective address or the linear address is misaligned. An integer execution unit that performs the three-operand add using a carry-save adder coupled to a carry look-ahead adder. If the segment base is not zero, the effective address is fed back through the integer execution unit to add the segment base.
    • 在可以访问分段存储器和非分段存储器的多线程处理器的单个执行周期中确定具有三操作数添加操作的存储器的有效地址。 在该周期期间,处理器确定存储器段基数是否为零。 如果分段基数为零,则处理器可以在有效地址的情况下访问存储器位置,而不添加分段基。 如果段基数不为零,例如执行遗留代码时,处理器消耗另一个周期,将段基数添加到有效地址。 类似地,如果有效地址或线性地址不对齐,则处理器消耗另一个周期。 整数执行单元,其使用耦合到进位先行加法器的进位保存加法器来执行三运算加法。 如果段基数不为零,则通过整数执行单元反馈有效地址以添加段基。
    • 10. 发明申请
    • SINGLE CYCLE DATA MOVEMENT BETWEEN GENERAL PURPOSE AND FLOATING-POINT REGISTERS
    • 一般用途和浮点注册机之间的单循环数据移动
    • US20100306510A1
    • 2010-12-02
    • US12476636
    • 2009-06-02
    • Christopher OlsonRobert T. GollaJeffrey S. Brooks
    • Christopher OlsonRobert T. GollaJeffrey S. Brooks
    • G06F9/302G06F9/30
    • G06F9/30032G06F9/30014G06F9/3013G06F9/384G06F9/3851G06F9/3885
    • Systems and methods for providing single cycle movement of data between a floating-point register file (FRF) and a general purpose or integer register file (RF) of a microprocessor system are provided. The system may include an integer execution unit operative to execute instructions with single cycle latency, a floating-point execution unit, a working register file (WRF), an FRF, and an IRF. To achieve the single cycle movement functionality, the integer execution unit may physically own the WRF, IRF, and FRF, and may monitor and control any dependencies between them. Thus, since the integer execution unit has direct read access to both the IRF and the FRF, data may be moved between the two register files using the single cycle operation of the integer execution unit, without the need to store and load the data from memory.
    • 提供了一种用于在微处理器系统的浮点寄存器文件(FRF)和通用或整数寄存器文件(RF)之间提供单周期数据移动的系统和方法。 系统可以包括可执行具有单周期延迟的指令的整数执行单元,浮点执行单元,工作寄存器文件(WRF),FRF和IRF。 为了实现单循环移动功能,整数执行单元可以物理拥有WRF,IRF和FRF,并且可以监视和控制它们之间的任何依赖关系。 因此,由于整数执行单元具有对IRF和FRF两者的直接读取访问,所以可以使用整数执行单元的单周期操作在两个寄存器文件之间移动数据,而不需要从存储器存储和加载数据 。