会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • Optimized Code Generation Targeting a High Locality Software Cache
    • 针对高位置软件缓存的优化代码生成
    • US20100088673A1
    • 2010-04-08
    • US12246602
    • 2008-10-07
    • Tong ChenAlexandre E. EichenbergerMarc Gonzalez TalladaJohn K. O'BrienKathryn M. O'BrienZehra N. SuraTao Zhang
    • Tong ChenAlexandre E. EichenbergerMarc Gonzalez TalladaJohn K. O'BrienKathryn M. O'BrienZehra N. SuraTao Zhang
    • G06F9/44
    • G06F8/4442
    • Mechanisms for optimized code generation targeting a high locality software cache are provided. Original computer code is parsed to identify memory references in the original computer code. Memory references are classified as either regular memory references or irregular memory references. Regular memory references are controlled by a high locality cache mechanism. Original computer code is transformed, by a compiler, to generate transformed computer code in which the regular memory references are grouped into one or more memory reference streams, each memory reference stream having a leading memory reference, a trailing memory reference, and one or more middle memory references. Transforming of the original computer code comprises inserting, into the original computer code, instructions to execute initialization, lookup, and cleanup operations associated with the leading memory reference and trailing memory reference in a different manner from initialization, lookup, and cleanup operations for the one or more middle memory references.
    • 提供了针对高位置软件缓存的优化代码生成机制。 解析原始计算机代码以识别原始计算机代码中的内存引用。 内存引用被分类为常规内存引用或不规则内存引用。 常规内存引用由高位置缓存机制控制。 原始计算机代码由编译器转换以生成转换的计算机代码,其中常规存储器引用被分组成一个或多个存储器参考流,每个存储器参考流具有前导存储器引用,尾随存储器引用和一个或多个 中间内存引用。 原始计算机代码的转换包括将原始计算机代码中的指令以不同于初始化,查找和清除操作的方式与前导存储器引用和尾随存储器引用相关联的执行初始化,查找和清除操作的指令进行插入 或更多的中间内存引用。
    • 8. 发明授权
    • Optimized code generation targeting a high locality software cache
    • 针对高位置软件缓存的优化代码生成
    • US08561044B2
    • 2013-10-15
    • US12246602
    • 2008-10-07
    • Tong ChenAlexandre E. EichenbergerMarc Gonzalez TalladaJohn K. O'BrienKathryn M. O'BrienZehra N. SuraTao Zhang
    • Tong ChenAlexandre E. EichenbergerMarc Gonzalez TalladaJohn K. O'BrienKathryn M. O'BrienZehra N. SuraTao Zhang
    • G06F9/44
    • G06F8/4442
    • Mechanisms for optimized code generation targeting a high locality software cache are provided. Original computer code is parsed to identify memory references in the original computer code. Memory references are classified as either regular memory references or irregular memory references. Regular memory references are controlled by a high locality cache mechanism. Original computer code is transformed, by a compiler, to generate transformed computer code in which the regular memory references are grouped into one or more memory reference streams, each memory reference stream having a leading memory reference, a trailing memory reference, and one or more middle memory references. Transforming of the original computer code comprises inserting, into the original computer code, instructions to execute initialization, lookup, and cleanup operations associated with the leading memory reference and trailing memory reference in a different manner from initialization, lookup, and cleanup operations for the one or more middle memory references.
    • 提供了针对高位置软件缓存的优化代码生成机制。 解析原始计算机代码以识别原始计算机代码中的内存引用。 内存引用被分类为常规内存引用或不规则内存引用。 常规内存引用由高位置缓存机制控制。 原始计算机代码由编译器转换以生成转换的计算机代码,其中常规存储器引用被分组成一个或多个存储器参考流,每个存储器参考流具有前导存储器引用,尾随存储器引用和一个或多个 中间内存引用。 原始计算机代码的转换包括将原始计算机代码中的指令以不同于初始化,查找和清除操作的方式与前导存储器引用和尾随存储器引用相关联的执行初始化,查找和清除操作的指令进行插入 或更多的中间内存引用。
    • 9. 发明申请
    • PARALLELIZATION OF IRREGULAR REDUCTIONS VIA PARALLEL BUILDING AND EXPLOITATION OF CONFLICT-FREE UNITS OF WORK AT RUNTIME
    • 通过平行建筑和平稳利用无冲突的工作单位在运行期间的平行化
    • US20110088020A1
    • 2011-04-14
    • US12576717
    • 2009-10-09
    • Alexandre E. EichenbergerYangchun LuoJohn K. O'BrienXiaotong Zhuang
    • Alexandre E. EichenbergerYangchun LuoJohn K. O'BrienXiaotong Zhuang
    • G06F9/45
    • G06F8/456
    • An optimizing compiler device, a method, a computer program product which are capable of performing parallelization of irregular reductions. The method for performing parallelization of irregular reductions includes receiving, at a compiler, a program and selecting, at compile time, at least one unit of work (UW) from the program, each UW configured to operate on at least one reduction operation, where at least one reduction operation in the UW operates on a reduction variable whose address is determinable when running the program at a run-time. At run time, for each successive current UW, a list of reduction operations accessed by that unit of work is recorded. Further, it is determined at run time whether reduction operations accessed by a current UW conflict with any reduction operations recorded as having been accessed by prior selected units of work, and assigning the unit of work as a conflict free unit of work (CFUW) when no conflicts are found. Finally, there is scheduled, for parallel run-time operation, at least two or more processing threads to process a respective the at least two or more assigned CFUWs.
    • 优化编译器装置,方法,计算机程序产品,其能够执行不规则减少的并行化。 用于执行不规则减少的并行化的方法包括在编译器处接收程序并且在编译时选择来自程序的至少一个工作单元(UW),每个UW被配置为在至少一个简化操作上操作,其中 UW中的至少一个减少操作对于在运行时运行程序时地址是可确定的减法变量进行操作。 在运行时,对于每个连续的当前UW,记录由该工作单元访问的减少操作的列表。 此外,在运行时确定由目前的UW访问的减少操作是否与任何记录为由先前选择的工作单元访问的任何缩减操作相冲突,并且将工作单元分配为无冲突的工作单元(CFUW),当 没有发现冲突。 最后,对于并行运行时间操作,计划至少两个或更多个处理线程来处理相应的所述至少两个或更多个分配的CFUW。