会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明申请
    • SYSTEM FOR ORGANIZING, FUNCTIONALITY INDEXING AND CONSTRUCTING OF A SOURCE CODE SEARCH ENGINE AND METHOD THEREOF
    • 用于源代码搜索引擎的组织,功能性索引和构建的系统及其方法
    • WO2017134665A1
    • 2017-08-10
    • PCT/IL2017/050126
    • 2017-02-02
    • COCYCLES
    • MIZRAHI, RanBAR-ILAN, Laly
    • G06F17/30
    • G06F17/30864
    • Systems and methods are disclosed for organizing, functionality representing and constructing a search engine that identifies program source code unit behavior associated with a plurality of repositories. The systems and method include crawling a set of data entities in a repository system, each of the data entities representing one or more of a source code units and/or subsets of the source code units; parsing said set of data entities into abstract syntax trees (ASTs) architecture; modeling said set of data entities into a code graph (CG) architecture such that each one or more of a source code units and/or subsets of the source code units are set as vertexes and connections between said each one or more of a source code units and/or subsets of the source code units are set as edges; establishing type ontology (TO) architecture of said set of data entities by processing said set of data and assigning meta-data tags to each one or more of a source code units and/or subsets of the source code units, said tags representing classification attributes; generating semantic ID based on linguistic, structural and contextual analyses of said set of data entities, said semantic ID corresponding to source code functionality of said one or more of a source code units and/or subsets of the source code units, said linguistic analysis employing linguistic clues, said structural linguistic analysis employing structural clues, and said contextual analysis employing contextual clues; and organizing and storing said set of data entities in functionality representation index (FRI) architecture.
    • 公开了系统和方法,用于组织表示和构建搜索引擎的功能,该搜索引擎识别与多个存储库相关联的节目源代码单元行为。 该系统和方法包括在存储库系统中爬取一组数据实体,每个数据实体表示源代码单元和/或源代码单元的子集中的一个或多个; 将所述一组数据实体解析为抽象语法树(AST)体系结构; 将所述一组数据实体建模成代码图(CG)体系结构,使得所述源代码单元的源代码单元和/或子集中的每一个或多个被设置为所述源代码的所述每一个或多个源代码之间的顶点和连接 源代码单元的单元和/或子集被设置为边; 通过处理所述数据集合并将元数据标签分配给源代码单元和/或源代码单元的子集中的每一个或多个,来建立所述一组数据实体的类型本体(TO)体系结构,所述标签表示分类属性 ; 基于对所述数据实体集合的语言,结构和上下文分析来生成语义ID,所述语义ID对应于源代码单元和/或源代码单元的子集中的一个或多个的源代码功能,所述语言分析采用 语言线索,表示使用结构线索的结构语言学分析,以及使用语境线索的语境分析; 并且将所述一组数据实体组织并存储在功能表示索引(FRI)体系结构中。