专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20160127398A1 Apparatus and Method for Efficient Identification of Code Similarity 有权
公开(公告)号：US20160127398A1
公开(公告)日：2016-05-05
申请号：US14925301
申请日：2015-10-28
申请人： The Johns Hopkins University
发明人： Jonathan D. Cohen
IPC分类号： H04L29/06 , G06F17/30
CPC分类号： G06F17/3053 , G06F8/71 , G06F17/30477 , G06F17/30864 , G06F17/30876 , H04L9/3231 , H04L63/1416 , H04L63/145
摘要： A method for identifying similarity between query samples and stored samples in an efficiently maintained reference library may include receiving a first threshold and a second threshold, receiving a plurality of binary reference samples, and processing each reference sample of the plurality of reference samples. The processing may include operations of assigning each reference sample a respective unique identifier, producing a reference sample fingerprint for each reference sample, and registering each respective unique identifier to reference sample fingerprint pair in a reference library. The registering may include scoring the reference sample fingerprint with each previously stored fingerprint in the reference library to produce a first matching score, if the first matching score meets or exceeds the first threshold for a previously stored fingerprint, determining the reference sample fingerprint to be a duplicate of the previously stored fingerprint and recording only a unique identifier associated with the reference sample fingerprint in the reference library where the unique identifier is marked as a duplicate of the previously stored fingerprint, and otherwise, if the first matching score is less than the first threshold, storing a corresponding reference sample unique identifier to reference sample fingerprint pair in the reference library. The method may further include receiving a binary query sample and processing the binary query sample via operations including producing a query sample fingerprint from the binary query sample, scoring the query sample fingerprint with each previously stored fingerprint in the reference library to produce a second matching score, and for each previously stored fingerprint for which the second matching score meets or exceeds the second threshold, reporting a corresponding reference sample unique identifier associated with the previously stored fingerprint and the second matching score.

2. 发明授权

US09111095B2 Apparatus and method for identifying similarity via dynamic decimation of token sequence n-grams 有权
标题翻译：通过令牌序列n-gram的动态抽取来识别相似度的装置和方法
公开(公告)号：US09111095B2
公开(公告)日：2015-08-18
申请号：US14248622
申请日：2014-04-09
申请人： The Johns Hopkins University
发明人： Jonathan D. Cohen
IPC分类号： G06F11/00 , G06F12/04 , G06F12/16 , G06F7/04 , H04N7/16 , G06F21/56
CPC分类号： G06F21/562 , G06F17/277 , G06F21/564
摘要： An apparatus for identifying related code variants or text samples includes processing circuitry configured to execute instructions for receiving query binary code, processing the query binary code to generate one or more query code fingerprints comprising compressed representations of respective functional components of the query binary code, generating token sequence n-grams of the fingerprints, hashing the n-grams, partitioning samples by length to compare selected samples based on length, and identifying similarity via dynamic decimation of token sequence n-grams.
摘要翻译：用于识别相关代码变体或文本样本的装置包括处理电路，其被配置为执行用于接收查询二进制代码的指令，处理查询二进制代码以生成包括查询二进制代码的各个功能组件的压缩表示的一个或多个查询代码指纹，令牌序列n-gram的指纹，散列n-gram，按长度分割样本以比较基于长度的选定样本，并通过令牌序列n-gram的动态抽取来识别相似度。

3. 发明申请

US20150220593A1 APPARATUS AND METHOD FOR ALIGNING TOKEN SEQUENCES WITH BLOCK PERMUTATIONS 审中-公开
标题翻译：用块标记对准序列的装置和方法
公开(公告)号：US20150220593A1
公开(公告)日：2015-08-06
申请号：US14614431
申请日：2015-02-05
申请人： The Johns Hopkins University
发明人： Jonathan D. Cohen
IPC分类号： G06F17/30 , H04L29/06 , H04L29/08
CPC分类号： G06F17/30386 , G06F17/3033 , G06F21/562 , G06F21/566
摘要： A method of determining matching between at least a first sample comprising a sequence of tokens A and a second sample comprising a sequence of tokens B may include, for monotonically decreasing values of n, performing operations including recording a subset SA of n-grams of A in a hash table LA, such that a value of each n-gram determines an index in LA and a location of each respective n-gram in A is recorded as the value in LA, recording a subset SB of n-grams of B in a hash table LB, such that a value of each n-gram determines an index in LB and a location of each respective n-gram in B is recorded as the value in LB, for each location L that is occupied in both LA and LB, examining a region in A centered on LA(L) and a region in B centered on LB(L), and reporting a largest matching region aligning LA(L) with LB(L) that does not include already-matched tokens in A or B and marking the largest matching region as matched.
摘要翻译：确定至少包括令牌序列A的第一样本和包含令牌序列B的第二样本之间的匹配的方法可以包括：对于n的单调递减值，执行操作包括记录n克的A的子集SA 在哈希表LA中，使得每个n-gram的值确定LA中的索引，并且将A中的每个相应n-gram的位置记录为LA中的值，记录B的n克的子集SB 散列表LB，使得每个n-gram的值确定LB中的索引，并且将B中的每个相应n元的位置记录为LB中的值，对于在LA和LB中都占用的每个位置L 检查以LA（L）为中心的A区域和以LB（L）为中心的B区域，并报告与A（A）中不包括已经匹配的令牌的LB（L）对应的最大匹配区域LA（L）或B，并将匹配的最大匹配区域标记。

4. 发明申请

US20140068768A1 Apparatus and Method for Identifying Related Code Variants in Binaries 有权
标题翻译：用于识别二进制相关代码变体的装置和方法
公开(公告)号：US20140068768A1
公开(公告)日：2014-03-06
申请号：US13784245
申请日：2013-03-04
申请人： THE JOHNS HOPKINS UNIVERSITY
发明人： Margaret F. Lospinuso , David M. Patrone , David P. Silberberg , Jonathan D. Cohen , Ryan W. Gardner , Laura J. Glendenning , Sakunthala Harshavardhana , Robert T. Hider , C. Durward McDonell, III , Dennis S. Patrone , Nathan S. Reller , Benjamin R. Salazar
IPC分类号： G06F21/56
CPC分类号： G06F21/562 , G06F21/561
摘要： An apparatus for identifying related code variants may include processing circuitry configured to execute instructions for receiving query binary code, processing the query binary code to generate one or more query code fingerprints comprising compressed representations of respective functional components of the query binary code, comparing the one or more query code fingerprints to at least some reference code fingerprints stored in a database to determine a similarity measure between the one or more query code fingerprints and at least some of the reference code fingerprints, and preparing at least one report based on the similarity measure.
摘要翻译：用于识别相关代码变体的装置可以包括处理电路，其被配置为执行用于接收查询二进制代码的指令，处理查询二进制代码以生成包括查询二进制代码的各个功能组件的压缩表示的一个或多个查询代码指纹，或更多的查询代码指纹到存储在数据库中的至少一些参考代码指纹，以确定一个或多个查询代码指纹与至少一些参考代码指纹之间的相似性度量，以及基于相似性度量准备至少一个报告。

5. 发明授权

US10152518B2 Apparatus and method for efficient identification of code similarity 有权
公开(公告)号：US10152518B2
公开(公告)日：2018-12-11
申请号：US14926274
申请日：2015-10-29
申请人： The Johns Hopkins University
发明人： Jonathan D. Cohen
IPC分类号： H04L29/06 , G06F17/30 , H04L9/32 , G06F8/71
摘要： A method for identifying similarity between query samples and stored samples in an efficiently maintained reference library may include receiving a binary query sample and processing the binary query sample via operations including producing a query sample fingerprint from the binary query sample, scoring the query sample fingerprint with each previously stored fingerprint in the reference library to produce a matching score, and for each previously stored fingerprint for which the matching score meets or exceeds a predetermined threshold, reporting a corresponding reference sample unique identifier associated with the previously stored fingerprint and the matching score. Each previously stored fingerprint in the reference library has been determined, prior to storage, as not being duplicative of another fingerprint in the reference library.

6. 发明授权

US09910985B2 Apparatus and method for identifying similarity via dynamic decimation of token sequence N-grams 有权
公开(公告)号：US09910985B2
公开(公告)日：2018-03-06
申请号：US14754869
申请日：2015-06-30
申请人： The Johns Hopkins University
发明人： Jonathan D. Cohen
IPC分类号： G06F12/14 , G06F21/56 , G06F17/27
CPC分类号： G06F21/562 , G06F17/277 , G06F21/564
摘要： An apparatus for identifying related code variants or text samples includes processing circuitry configured to execute instructions for receiving query binary code, processing the query binary code to generate one or more query code fingerprints comprising compressed representations of respective functional components of the query binary code, generating token sequence n-grams of the fingerprints, hashing the n-grams, partitioning samples by length to compare selected samples based on length, and identifying similarity via dynamic decimation of token sequence n-grams.

7. 发明授权

US09003529B2 Apparatus and method for identifying related code variants in binaries 有权
标题翻译：用于识别二进制文件中的相关代码变体的装置和方法
公开(公告)号：US09003529B2
公开(公告)日：2015-04-07
申请号：US13784245
申请日：2013-03-04
申请人： The Johns Hopkins University
发明人： Margaret F. Lospinuso , David M. Patrone , David P. Silberberg , Jonathan D. Cohen , Ryan W. Gardner , Laura J. Glendenning , Sakunthala Harshavardhana , Robert T. Hider , C. Durward McDonell, III , Dennis S. Patrone , Nathan S. Reller , Benjamin R. Salazar
IPC分类号： G06F21/56
CPC分类号： G06F21/562 , G06F21/561
摘要： An apparatus for identifying related code variants may include processing circuitry configured to execute instructions for receiving query binary code, processing the query binary code to generate one or more query code fingerprints comprising compressed representations of respective functional components of the query binary code, comparing the one or more query code fingerprints to at least some reference code fingerprints stored in a database to determine a similarity measure between the one or more query code fingerprints and at least some of the reference code fingerprints, and preparing at least one report based on the similarity measure.
摘要翻译：用于识别相关代码变体的装置可以包括处理电路，其被配置为执行用于接收查询二进制代码的指令，处理查询二进制代码以生成包括查询二进制代码的各个功能组件的压缩表示的一个或多个查询代码指纹，或更多的查询代码指纹到存储在数据库中的至少一些参考代码指纹，以确定一个或多个查询代码指纹与至少一些参考代码指纹之间的相似性度量，以及基于相似性度量准备至少一个报告。

8. 发明授权

US10318523B2 Apparatus and method for aligning token sequences with block permutations 有权
公开(公告)号：US10318523B2
公开(公告)日：2019-06-11
申请号：US14614431
申请日：2015-02-05
申请人： The Johns Hopkins University
发明人： Jonathan D. Cohen
IPC分类号： G06F16/24 , G06F16/22 , G06F21/56
摘要： A method of determining matching between at least a first sample comprising a sequence of tokens A and a second sample comprising a sequence of tokens B may include, for monotonically decreasing values of n, performing operations including recording a subset SA of n-grams of A in a hash table LA, such that a value of each n-gram determines an index in LA and a location of each respective n-gram in A is recorded as the value in LA, recording a subset SB of n-grams of B in a hash table LB, such that a value of each n-gram determines an index in LB and a location of each respective n-gram in B is recorded as the value in LB, for each location L that is occupied in both LA and LB, examining a region in A centered on LA(L) and a region in B centered on LB(L), and reporting a largest matching region aligning LA(L) with LB(L) that does not include already-matched tokens in A or B and marking the largest matching region as matched.

9. 发明授权

US09805099B2 Apparatus and method for efficient identification of code similarity 有权
公开(公告)号：US09805099B2
公开(公告)日：2017-10-31
申请号：US14925301
申请日：2015-10-28
申请人： The Johns Hopkins University
发明人： Jonathan D. Cohen
IPC分类号： G06F21/00 , G06F17/30 , H04L29/06 , H04L9/32 , G06F9/44
CPC分类号： G06F17/3053 , G06F8/71 , G06F17/30477 , G06F17/30864 , G06F17/30876 , H04L9/3231 , H04L63/1416 , H04L63/145
摘要： A method for identifying similarity between query samples and stored samples in an efficiently maintained reference library may include receiving a first threshold and a second threshold, receiving a plurality of binary reference samples, and processing each reference sample of the plurality of reference samples. The processing may include operations of assigning each reference sample a respective unique identifier, producing a reference sample fingerprint for each reference sample, and registering each respective unique identifier to reference sample fingerprint pair in a reference library. The registering may include scoring the reference sample fingerprint with each previously stored fingerprint in the reference library to produce a first matching score, if the first matching score meets or exceeds the first threshold for a previously stored fingerprint, determining the reference sample fingerprint to be a duplicate of the previously stored fingerprint and recording only a unique identifier associated with the reference sample fingerprint in the reference library where the unique identifier is marked as a duplicate of the previously stored fingerprint, and otherwise, if the first matching score is less than the first threshold, storing a corresponding reference sample unique identifier to reference sample fingerprint pair in the reference library. The method may further include receiving a binary query sample and processing the binary query sample via operations including producing a query sample fingerprint from the binary query sample, scoring the query sample fingerprint with each previously stored fingerprint in the reference library to produce a second matching score, and for each previously stored fingerprint for which the second matching score meets or exceeds the second threshold, reporting a corresponding reference sample unique identifier associated with the previously stored fingerprint and the second matching score.

10. 发明申请

US20160124966A1 Apparatus and Method for Efficient Identification of Code Similarity 审中-公开
标题翻译：用于有效识别代码相似性的装置和方法
公开(公告)号：US20160124966A1
公开(公告)日：2016-05-05
申请号：US14926274
申请日：2015-10-29
申请人： The Johns Hopkins University
发明人： Jonathan D. Cohen
IPC分类号： G06F17/30 , G06F9/44
CPC分类号： G06F17/3053 , G06F8/71 , G06F17/30477 , G06F17/30864 , G06F17/30876 , H04L9/3231 , H04L63/1416 , H04L63/145
摘要： A method for identifying similarity between query samples and stored samples in an efficiently maintained reference library may include receiving a binary query sample and processing the binary query sample via operations including producing a query sample fingerprint from the binary query sample, scoring the query sample fingerprint with each previously stored fingerprint in the reference library to produce a matching score, and for each previously stored fingerprint for which the matching score meets or exceeds a predetermined threshold, reporting a corresponding reference sample unique identifier associated with the previously stored fingerprint and the matching score. Each previously stored fingerprint in the reference library has been determined, prior to storage, as not being duplicative of another fingerprint in the reference library.
摘要翻译：一种用于在有效维护的参考库中识别查询样本和存储样本之间的相似性的方法可以包括接收二进制查询样本并经由以下操作处理二进制查询样本，包括从二进制查询样本生成查询样本指纹，对查询样本指纹进行评分，每个先前存储的指纹在参考库中以产生匹配分数，并且对于匹配分数满足或超过预定阈值的每个先前存储的指纹，报告与先前存储的指纹和匹配分数相关联的相应参考样本唯一标识符。参考库中的每个先前存储的指纹已经在存储之前被确定为不与参考库中的另一个指纹重复。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式