会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明授权
    • Rewriting branch instructions using branch stubs
    • 使用分支存根重写分支指令
    • US08522225B2
    • 2013-08-27
    • US12823204
    • 2010-06-25
    • Tong ChenBrian FlachsBrad W. MichaelMark R. NutterJohn K. P. O'BrienKathryn M. O'BrienTao Zhang
    • Tong ChenBrian FlachsBrad W. MichaelMark R. NutterJohn K. P. O'BrienKathryn M. O'BrienTao Zhang
    • G06F9/45
    • G06F8/4436G06F8/433G06F8/4442
    • Mechanisms are provided for rewriting branch instructions in a portion of code. The mechanisms receive a portion of source code having an original branch instruction. The mechanisms generate a branch stub for the original branch instruction. The branch stub stores information about the original branch instruction including an original target address of the original branch instruction. Moreover, the mechanisms rewrite the original branch instruction so that a target of the rewritten branch instruction references the branch stub. In addition, the mechanisms output compiled code including the rewritten branch instruction and the branch stub for execution by a computing device. The branch stub is utilized by the computing device at runtime to determine if execution of the rewritten branch instruction can be redirected directly to a target instruction corresponding to the original target address in an instruction cache of the computing device without intervention by an instruction cache runtime system.
    • 提供了用于在一部分代码中重写分支指令的机制。 该机制接收一部分具有原始分支指令的源代码。 机制为原始分支指令生成分支存根。 分支存根存储关于原始分支指令的信息,包括原始分支指令的原始目标地址。 此外,机制重写原始分支指令,使得重写的分支指令的目标引用分支存根。 此外,机制输出编译代码,包括重写的分支指令和分支存根,以供计算设备执行。 计算设备在运行时利用分支存根来确定重写的分支指令的执行是否可以被直接重定向到与计算设备的指令高速缓存中的原始目标地址相对应的目标指令,而无需指令高速缓存运行时系统的干预 。
    • 5. 发明申请
    • System and Method for Domain Stretching for an Advanced Dual-Representation Polyhedral Loop Transformation Framework
    • 用于高级双表示多面体环转换框架的域拉伸的系统和方法
    • US20090307673A1
    • 2009-12-10
    • US11861503
    • 2007-09-26
    • Alexandre E. EichenbergerJohn K. P. O'BrienKathryn M. O'BrienNicolas T. Vasilache
    • Alexandre E. EichenbergerJohn K. P. O'BrienKathryn M. O'BrienNicolas T. Vasilache
    • G06F9/45
    • G06F8/452
    • A system and method for domain stretching for an advanced dual-representation polyhedral loop transformation framework are provided. The mechanisms of the illustrative embodiments address the weaknesses of the known polyhedral loop transformation based approaches by providing mechanisms for performing code generation transformations on individual statement instances in an intermediate representation generated by the polyhedral loop transformation optimization of the source code. These code generation transformations have the important property that they do not change program order of the statements in the intermediate representation. This property allows the result of the code generation transformations to be provided back to the polyhedral loop transformation mechanisms in a program statement view, via a new re-entrance path of the illustrative embodiments, for additional optimization. In addition, mechanisms are provided for stretching the domains of statements in a program loop view of the source code to thereby normalize the domains.
    • 提供了一种用于高级双重表示多面体环转换框架的域拉伸的系统和方法。 说明性实施例的机制通过提供用于在通过源代码的多面体环转换优化生成的中间表示中对各个语句实例执行代码生成变换的机制来解决已知的基于多面体循环变换的方法的弱点。 这些代码生成转换具有重要的属性,它们不改变中间表示中的语句的程序顺序。 该属性允许通过示例性实施例的新的重新导入路径将代码生成转换的结果提供给程序语句视图中的多面体循环变换机制,用于附加优化。 另外,提供了用于在源代码的程序循环视图中拉伸语句的域的机制,从而使域标准化。
    • 6. 发明授权
    • Optimized division of work among processors in a heterogeneous processing system
    • 在异构处理系统中优化处理器之间的分工
    • US08997071B2
    • 2015-03-31
    • US13608043
    • 2012-09-10
    • Tong ChenJohn K. P. O'BrienZehra N. Sura
    • Tong ChenJohn K. P. O'BrienZehra N. Sura
    • G06F9/45
    • G06F8/451
    • A compiler implemented by a computer performs optimized division of work across heterogeneous processors. The compiler divides source code into code sections and characterizes each of the code sections based on pre-defined criteria. Each of the code sections is characterized as at least one of: allocate to a main processor, allocate to a processing element, allocate to one of a parameterized main processor and a parameterized processing element, and indeterminate. The compiler analyzes side-effects and costs of executing the code sections on allocated processors, and transforms the code sections based on results of the analyzing. The transforming includes re-characterizing the code sections for alternate execution in a runtime environment.
    • 由计算机实现的编译器在异构处理器之间执行优化的分工。 编译器将源代码分为代码段,并根据预定义的标准来表征每个代码段。 每个代码段被表征为以下至少一个:分配给主处理器,分配给处理元件,分配给参数化主处理器和参数化处理元件中的一个,并且不确定。 编译器分析在分配的处理器上执行代码段的副作用和成本,并根据分析结果转换代码段。 转换包括重新表征代码段,以便在运行时环境中进行备用执行。
    • 8. 发明授权
    • Building approximate data dependences with a moving window
    • 使用移动窗口构建近似数据依赖关系
    • US08667260B2
    • 2014-03-04
    • US12717985
    • 2010-03-05
    • Alexandre E. EichenbergerJohn K. P. O'BrienKathryn M. O'BrienKai-Ting A. WangXiaotong Zhuang
    • Alexandre E. EichenbergerJohn K. P. O'BrienKathryn M. O'BrienKai-Ting A. WangXiaotong Zhuang
    • G06F9/44
    • G06F9/32
    • Mechanisms for building approximate data dependences using a moving look-back window are provided. The mechanisms track dependence information for memory accesses over iterations of execution of a portion of code. The mechanisms receive a memory access of an iteration of the portion of code, the memory access having an address for access the memory and an access type indicating at least one of a read or a write access type. An entry in a moving look-back window data structure is generated corresponding to a memory location accessed by the memory access. The entry comprises at least an identification of the address, the access type, and an iteration number corresponding to the iteration of the memory access. The moving look-back window data structure is utilized to determine dependence information for memory accesses over a plurality of iterations of the portion of code.
    • 提供了使用移动后视窗构建近似数据依赖关系的机制。 机制跟踪代码的一部分执行迭代的存储器访问的依赖信息。 机构接收代码部分的迭代的存储器访问,存储器访问具有用于访问存储器的地址和指示读取或写入访问类型中的至少一个的访问类型。 对应于由存储器访问访问的存储器位置产生移动后视窗数据结构中的条目。 该条目至少包括对应于存储器访问的迭代的地址的标识,访问类型和迭代号。 移动后视窗数据结构用于确定代码部分的多个迭代中的存储器访问的依赖信息。