专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20180321956A1 MULTI-VERSION ASYNCHRONOUS DYNAMIC SOFTWARE UPDATE SYSTEM AND METHOD FOR APPLICATIONS WITH MULTIPLE THREADS 审中-公开
公开(公告)号：US20180321956A1
公开(公告)日：2018-11-08
申请号：US15584785
申请日：2017-05-02
申请人： Kai-Ting Amy Wang , Peng Wu , Brice Dobry , Haichuan Wang
发明人： Kai-Ting Amy Wang , Peng Wu , Brice Dobry , Haichuan Wang
IPC分类号： G06F9/445 , G06F9/38 , G06F9/44
CPC分类号： G06F9/44536 , G06F8/434 , G06F8/656 , G06F8/71 , G06F9/223 , G06F9/3893 , G06F9/44521 , G06F9/44526 , G06F9/44552 , G06F9/449 , G06F9/52
摘要： A method and system for using multiple versions of a software component, includes storing, in memory, a first function table that points to executable code in the memory for functions from a first version of the software component, and storing, in the memory, a second function table that points to executable code in the memory for functions from a second version of the software component, referencing the first function table, when running a first application thread, to execute the functions from the first version of the software component; and referencing the second function table, when running a second application thread that is active concurrently with the first application thread, to execute the functions from the second version of the software component.

2. 发明授权

US08136105B2 Method to exploit superword-level parallelism using semi-isomorphic packing 失效
标题翻译：利用半同构包装开发超级平行度的方法
公开(公告)号：US08136105B2
公开(公告)日：2012-03-13
申请号：US11536990
申请日：2006-09-29
申请人： Alexandre E. Eichenberger , Kai-Ting Amy Wang , Peng Wu , Peng Zhao
发明人： Alexandre E. Eichenberger , Kai-Ting Amy Wang , Peng Wu , Peng Zhao
IPC分类号： G06F9/45
CPC分类号： G06F8/456
摘要： A computer program product is provided for extracting SIMD parallelism. The computer program product includes instructions for providing a stream of input code comprising basic blocks; identifying pairs of statements that are semi-isomorphic with respect to each other within a basic block; iteratively combining into packs, pairs of statements that are semi-isomorphic with respect to each other, and combining packs into combined packs; collecting packs whose statements can be scheduled together for processing; and generating SIMD instructions for each pack to provide for extracting the SIMD parallelism..
摘要翻译：提供了一种用于提取SIMD并行性的计算机程序产品。计算机程序产品包括用于提供包括基本块的输入代码流的指令; 识别在基本块内相对于彼此半同构的语句对; 迭代地组合成包，相对于半同构的语句对，以及将包合并成组合包; 收集包，其陈述可以一起安排处理; 并为每个包生成SIMD指令以提供SIMD并行性。

3. 发明申请

US20110289303A1 SETJMP/LONGJMP FOR SPECULATIVE EXECUTION FRAMEWORKS 失效
标题翻译：用于统一执行框架的SETJMP / LONGJMP
公开(公告)号：US20110289303A1
公开(公告)日：2011-11-24
申请号：US13026702
申请日：2011-02-14
申请人： Raul Esteban Silvera , Kai-Ting Amy Wang , Peng Wu , Mark Wayne Yamashita , Xiaotong Zhuang
发明人： Raul Esteban Silvera , Kai-Ting Amy Wang , Peng Wu , Mark Wayne Yamashita , Xiaotong Zhuang
IPC分类号： G06F9/312
CPC分类号： G06F9/3842 , G06F9/3004 , G06F9/30054 , G06F9/30087 , G06F9/4484
摘要： A process for check pointing in speculative execution frameworks, identifies calls to a set of setjmp/longjmp instructions to form identified calls to setjmp/longjmp, determines a control flow path between a call to a setjmp and a longjmp pair of instructions in the identified calls to setjmp/longjmp and replaces calls to the setjmp/longjmp pair of instructions with calls to an improved_setjmp and improved_longjmp instruction pair. The process creates a context data structure in memory, computes a non-volatile save/restore set and replaces the call to improved_setjmp of the setjmp/longjmp pair of instructions with instructions to save all required non-volatile and special purpose registers and replaces a call to improved_longjmp of the setjmp/longjmp pair of instructions with instructions to restore all required non-volatile and special purpose registers and to branch to an instruction immediately following a block of code containing the call to improved_setjmp.
摘要翻译：用于检查指向推测执行框架的过程，识别对一组setjmp / longjmp指令的调用以形成对setjmp / longjmp的标识的调用，确定在所识别的呼叫中对setjmp的调用和longjmp指令之间的控制流路径到setjmp / longjmp，并且通过调用一个improved_setjmp和improved_longjmp指令对来替换对setjmp / longjmp指令对的调用。该过程在内存中创建一个上下文数据结构，计算一个非易失性存储/恢复集，并用setjmp / longjmp指令对来替换一个改进_setjmp的调用，其中包含所有需要的非易失性和特殊用途寄存器的指令，并替换一个调用到具有用于恢复所有需要的非易失性和特殊目的寄存器的指令的setjmp / longjmp指令指令的改进_longjmp，并且分支到紧跟在包含对converted_setjmp的调用的代码块之后的指令。

4. 发明授权

US07478377B2 SIMD code generation in the presence of optimized misaligned data reorganization 失效
标题翻译：存在优化的未对齐数据重组的SIMD代码生成
公开(公告)号：US07478377B2
公开(公告)日：2009-01-13
申请号：US10918996
申请日：2004-08-16
申请人： Alexandre E. Eichenberger , Kai-Ting Amy Wang , Peng Wu
发明人： Alexandre E. Eichenberger , Kai-Ting Amy Wang , Peng Wu
IPC分类号： G06F9/45
CPC分类号： G06F8/4452 , G06F8/447
摘要： Generating loop code to execute on Single-Instruction Multiple-Datapath (SIMD) architectures, where the loop operates on datatypes having different lengths, is disclosed. Further, a preferred embodiment of the present invention includes a novel techique to efficiently realign or shift arbitrary streams to an arbitrary offset, regardless whether the alignments or offsets are known at the compile time or not. This technique enables the application of advanced alignment optimizations to runtime alignment. This allows sequential loop code operating on datatypes of disparate length to be transformed (“simdized”) into optimized SIMD code through a fully automated process.
摘要翻译：公开了在单指令多数据路径（SIMD）架构中生成循环码，其循环对具有不同长度的数据类型进行操作。此外，本发明的优选实施例包括一种用于有效地将任意流重新对准或将任意流移动到任意偏移的新技术，无论在编译时是否知道对准或偏移。这种技术使得可以将高级对齐优化应用于运行时对齐。这允许对具有不同长度的数据类型的顺序循环代码通过完全自动化的过程进行转换（“模拟化”）成优化的SIMD代码。

5. 发明授权

US07395531B2 Framework for efficient code generation using loop peeling for SIMD loop code with multiple misaligned statements 失效
标题翻译：使用循环剥离的高效代码生成框架，用于具有多个不对齐语句的SIMD循环代码
公开(公告)号：US07395531B2
公开(公告)日：2008-07-01
申请号：US10918879
申请日：2004-08-16
申请人： Alexandre E. Eichenberger , Kai-Ting Amy Wang , Peng Wu
发明人： Alexandre E. Eichenberger , Kai-Ting Amy Wang , Peng Wu
IPC分类号： G06F9/45 , G06F15/00
CPC分类号： G06F8/447 , G06F8/4441
摘要： A system and method is provided for vectorizing misaligned references in compiled code for SIMD architectures that support only aligned loads and stores. In this framework, a loop is first simdized as if the memory unit imposes no alignment constraints. The compiler then inserts data reorganization operations to satisfy the actual alignment requirements of the hardware. Finally, the code generation algorithm generates SIMD codes based on the data reorganization graph, addressing realistic issues such as runtime alignments, unknown loop bounds, residual iteration counts, and multiple statements with arbitrary alignment combinations. Loop peeling is used to reduce the computational overhead associated with misaligned data. A loop prologue and epilogue are peeled from individual iterations in the simdized loop, and vector-splicing instructions are applied to the peeled iterations, while the steady-state loop body incurs no additional computational overhead.
摘要翻译：提供了一种系统和方法，用于在仅支持对齐的负载和存储的SIMD架构的编译代码中向量化未对齐的引用。在这个框架中，循环首先被模拟，就好像内存单元没有对齐约束。编译器然后插入数据重组操作以满足硬件的实际对齐要求。最后，代码生成算法基于数据重组图生成SIMD代码，解决诸如运行时对齐，未知循环边界，残差迭代计数以及具有任意对齐组合的多个语句之类的现实问题。循环剥离用于减少与未对齐数据相关的计算开销。循环序言和结语在模拟循环中从单独迭代中去除，向量拼接指令被应用于剥离的迭代，而稳态循环体不引起额外的计算开销。

6. 发明授权

US07367026B2 Framework for integrated intra- and inter-loop aggregation of contiguous memory accesses for SIMD vectorization 失效
标题翻译：用于SIMD向量化的连续存储器访问的集成的内部和组间集成的框架
公开(公告)号：US07367026B2
公开(公告)日：2008-04-29
申请号：US10919115
申请日：2004-08-16
申请人： Alexandre E. Eichenberger , Kai-Ting Amy Wang , Peng Wu
发明人： Alexandre E. Eichenberger , Kai-Ting Amy Wang , Peng Wu
IPC分类号： G06F9/45 , G06F15/00 , G06F7/52
CPC分类号： G06F8/4452 , G06F8/445
摘要： A method, computer program product, and information handling system for generating loop code to execute on Single-Instruction Multiple-Datapath (SIMD) architectures, where the loop contains multiple non-stride-one memory accesses that operate over a contiguous stream of memory is disclosed. A preferred embodiment identifies groups of isomorphic statements within a loop body where the isomorphic statements operate over a contiguous stream of memory over the iteration of the loop. Those identified statements are then converted into virtual-length vector operations. Next, the hardware's available vector length is used to determine a number of virtual-length vectors to aggregate into a single vector operation for each iteration of the loop. Finally, the aggregated, vectorized loop code is converted into SIMD operations.
摘要翻译：一种用于生成在单指令多数据路径（SIMD）架构上执行的循环码的方法，计算机程序产品和信息处理系统，其中循环包含在连续的存储器流上操作的多个非步进存储器访问披露优选实施例识别在循环体内同构语句的组，其中同构语句在循环的迭代上在连续的存储器流上操作。然后将那些识别的语句转换为虚拟长度向量操作。接下来，使用硬件的可用向量长度来确定多个虚拟长度向量以聚合到单个向量操作中，用于循环的每次迭代。最后，聚合的向量化循环码被转换成SIMD操作。

7. 发明授权

US11275671B2 Systems, methods and media for dynamically shaped tensors using liquid types 有权
公开(公告)号：US11275671B2
公开(公告)日：2022-03-15
申请号：US16939771
申请日：2020-07-27
申请人： Ende Jin , Kai-Ting Amy Wang
发明人： Ende Jin , Kai-Ting Amy Wang
IPC分类号： G06F9/44 , G06F11/36 , G06F8/41 , G06N20/00 , G06N5/04 , G06F8/30
摘要： Systems, methods, and processor readable media are described for verifying software. A liquid type system is used by a programming language to allow source code to define tensor variables with dimensionality and/or shape dynamically defined at runtime. The dimensionality and shape of a tensor variable invoked in the source code, as well as the data type of the constituent elements of such a tensor variable, may be defined by a static type that may be verified at compile time.

8. 发明授权

US10430208B2 Multi-version asynchronous dynamic software update system and method for applications with multiple threads 有权
公开(公告)号：US10430208B2
公开(公告)日：2019-10-01
申请号：US15584785
申请日：2017-05-02
申请人： Kai-Ting Amy Wang , Peng Wu , Brice Dobry , Haichuan Wang
发明人： Kai-Ting Amy Wang , Peng Wu , Brice Dobry , Haichuan Wang
IPC分类号： G06F9/445 , G06F8/656 , G06F9/448 , G06F9/38 , G06F8/41 , G06F8/71 , G06F9/22
摘要： A method and system for using multiple versions of a software component, includes storing, in memory, a first function table that points to executable code in the memory for functions from a first version of the software component, and storing, in the memory, a second function table that points to executable code in the memory for functions from a second version of the software component, referencing the first function table, when running a first application thread, to execute the functions from the first version of the software component; and referencing the second function table, when running a second application thread that is active concurrently with the first application thread, to execute the functions from the second version of the software component.

9. 发明授权

US08549501B2 Framework for generating mixed-mode operations in loop-level simdization 有权
标题翻译：在循环级simdization中生成混合模式操作的框架
公开(公告)号：US08549501B2
公开(公告)日：2013-10-01
申请号：US10919005
申请日：2004-08-16
申请人： Alexandre E. Eichenberger , Kai-Ting Amy Wang , Peng Wu
发明人： Alexandre E. Eichenberger , Kai-Ting Amy Wang , Peng Wu
IPC分类号： G06F9/45
CPC分类号： G06F8/4452
摘要： Generating mixed-mode operations in the compilation of program code for processors having vector or SIMD processing units is disclosed. In a preferred embodiment of the present invention, program instructions making up the body of a loop are abstracted into virtual vector instructions. These virtual vector instructions are treated, for initial code optimization purposes, as vector instructions (i.e., instructions written for the vector unit). The virtual vector instructions are eventually expanded into native code for the target processor, at which time a determination is made for each virtual vector instruction as to whether to expand the virtual vector instruction into native vector instructions, into native scalar instructions, into calls to pre-defined library functions, or into a combination of these. A cost model is used to determine the optimal choice of expansion based on hardware/software constraints, performance costs/benefits, and other criteria.
摘要翻译：公开了在具有向量或SIMD处理单元的处理器的程序代码的编译中产生混合模式操作。在本发明的优选实施例中，构成循环体的程序指令被抽象为虚拟向量指令。对于初始代码优化目的，将这些虚拟向量指令作为向量指令（即向量单元写入的指令）进行处理。虚拟向量指令最终被扩展为目标处理器的本地代码，此时，对于每个虚拟向量指令，确定是否将虚拟向量指令扩展为本地向量指令，进入本地标量指令，调用到前一个定义的库函数，或这些的组合。使用成本模型来确定基于硬件/软件约束，性能成本/效益和其他标准的最佳扩展选择。

10. 发明授权

US08245208B2 SIMD code generation for loops with mixed data lengths 有权
标题翻译：具有混合数据长度的循环的SIMD代码生成
公开(公告)号：US08245208B2
公开(公告)日：2012-08-14
申请号：US12328730
申请日：2008-12-04
申请人： Alexandre E. Eichenberger , Kai-Ting Amy Wang , Peng Wu
发明人： Alexandre E. Eichenberger , Kai-Ting Amy Wang , Peng Wu
IPC分类号： G06F9/45
CPC分类号： G06F8/4452
摘要： Generating loop code to execute on Single-Instruction Multiple-Datapath (SIMD) architectures, where the loop operates on datatypes having different lengths, is disclosed. Further, a preferred embodiment of the present invention includes a novel technique to efficiently realign or shift arbitrary streams to an arbitrary offset, regardless whether the alignments or offsets are known at the compile time or not. This technique enables the application of advanced alignment optimizations to runtime alignment. Length conversion operations, for packing and unpacking data values, are included in the alignment handling framework. These operations are formally defined in terms of standard SIMD instructions that are readily available on various SIMD platforms. This allows sequential loop code operating on datatypes of disparate length to be transformed (“simdized”) into optimized SIMD code through a fully automated process.
摘要翻译：公开了在单指令多数据路径（SIMD）架构中生成循环码，其循环对具有不同长度的数据类型进行操作。此外，本发明的优选实施例包括一种用于有效地将任意流重新对准或将任意流移动到任意偏移的新技术，无论在编译时是否已知对准或偏移。这种技术使得可以将高级对齐优化应用于运行时对齐。用于打包和解包数据值的长度转换操作包含在对齐处理框架中。这些操作根据在各种SIMD平台上容易获得的标准SIMD指令正式定义。这允许对具有不同长度的数据类型的顺序循环代码通过完全自动化的过程进行转换（“模拟化”）成优化的SIMD代码。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式