会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 7. 发明申请
    • COOPERATIVE THREAD ARRAY REDUCTION AND SCAN OPERATIONS
    • 合作螺线减排和扫描作业
    • US20110078417A1
    • 2011-03-31
    • US12890227
    • 2010-09-24
    • Brian FAHSMing Y. SiuBrett W. CoonJohn R. NickollsLars Nyland
    • Brian FAHSMing Y. SiuBrett W. CoonJohn R. NickollsLars Nyland
    • G06F9/38
    • G06F9/522G06F8/458G06F9/3004G06F9/30087G06F9/30145G06F9/3851
    • One embodiment of the present invention sets forth a technique for performing aggregation operations across multiple threads that execute independently. Aggregation is specified as part of a barrier synchronization or barrier arrival instruction, where in addition to performing the barrier synchronization or arrival, the instruction aggregates (using reduction or scan operations) values supplied by each thread. When a thread executes the barrier aggregation instruction the thread contributes to a scan or reduction result, and waits to execute any more instructions until after all of the threads have executed the barrier aggregation instruction. A reduction result is communicated to each thread after all of the threads have executed the barrier aggregation instruction and a scan result is communicated to each thread as the barrier aggregation instruction is executed by the thread.
    • 本发明的一个实施例提出了一种用于跨独立执行的多个线程执行聚合操作的技术。 聚合被指定为屏障同步或屏障到达指令的一部分,其中除了执行屏障同步或到达之外,指令聚合(使用缩减或扫描操作)由每个线程提供的值。 当线程执行屏障聚合指令时,线程有助于扫描或缩小结果,并等待执行任何更多指令,直到所有线程都执行了阻挡聚合指令为止。 在所有线程执行了屏障聚合指令之后,向每个线程传送减少结果,并且当线程执行屏障聚合指令时,将扫描结果传送给每个线程。
    • 8. 发明申请
    • FAST MAPPING TABLE REGISTER FILE ALLOCATION ALGORITHM FOR SIMT PROCESSORS
    • 用于SIMT处理器的快速映射表寄存器文件分配算法
    • US20110072438A1
    • 2011-03-24
    • US12875944
    • 2010-09-03
    • Michael FiyakMing Y. Siu
    • Michael FiyakMing Y. Siu
    • G06F9/46
    • G06F9/3009G06F9/3012G06F9/30123G06F9/3851
    • One embodiment of the present invention sets forth a technique for allocating register file entries included in a register file to a thread group. A request to allocate a number of register file entries to the thread group is received. A required number of mapping table entries included in a register file mapping table (RFMT) is determined based on the request, where each mapping table entry included in the RFMT is associated with a different plurality of register file entries included in the register file. The RFMT is parsed to locate an available mapping table entry in the RFMT for each of the required mapping table entries. For each available mapping table entry, a register file pointer is associated with an address that corresponds to a first register file entry in the plurality of register file entries associated with the available mapping table entry.
    • 本发明的一个实施例提出了一种用于将包括在寄存器文件中的寄存器文件条目分配给线程组的技术。 接收到向线程组分配一些寄存器文件条目的请求。 基于请求确定包括在寄存器文件映射表(RFMT)中的所需数量的映射表条目,其中包括在RFMT中的每个映射表条目与包括在寄存器文件中的不同的多个寄存器文件条目相关联。 解析RFMT以在RFMT中为每个所需的映射表条目定位可用的映射表条目。 对于每个可用的映射表条目,寄存器文件指针与对应于与可用映射表条目相关联的多个寄存器文件条目中的第一寄存器文件条目的地址相关联。