会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Compression of small strings
    • 压缩小字符串
    • US08924446B2
    • 2014-12-30
    • US13339562
    • 2011-12-29
    • Matthew ThomasBenoit Perroud
    • Matthew ThomasBenoit Perroud
    • H03M7/30
    • H03M7/40H03M7/4031
    • A method for compressing a set of small strings may include calculating n-gram frequencies for a plurality of n-grams over the set of small strings, selecting a subset of n-grams from the plurality of n-grams based on the calculated n-gram frequencies, defining a mapping table that maps each n-gram of the subset of n-grams to a unique code, and compressing the set of small strings by replacing n-grams within each small string in the set of small strings with corresponding unique codes from the mapping table. The method may use linear optimization to select a subset of n-grams that achieves a maximum space saving amount over the set of small strings for inclusion in the mapping table. The unique codes may be variable-length one or two byte codes. The set of small strings may be domain names.
    • 用于压缩一组小串的方法可以包括:在所述一组小串上计算多个n克的n克频率,基于所计算的n-gram,从所述多个n克中选择n克的子集, 定义映射表,其将n-gram子集的每个n-gram映射到唯一的代码,并且通过用小的字符串组合中的每个小字符串中的n-gram替换相应的唯一的 来自映射表的代码。 该方法可以使用线性优化来选择在该小组中的最小空间节省量的n克的子集以包含在映射表中。 唯一代码可以是可变长度的一个或两个字节代码。 一组小字符串可能是域名。
    • 2. 发明申请
    • COMPRESSION OF SMALL STRINGS
    • 小路的压缩
    • US20130173676A1
    • 2013-07-04
    • US13339562
    • 2011-12-29
    • Matthew ThomasBenoit Perroud
    • Matthew ThomasBenoit Perroud
    • G06F17/10
    • H03M7/40H03M7/4031
    • A method for compressing a set of small strings may include calculating n-gram frequencies for a plurality of n-grams over the set of small strings, selecting a subset of n-grams from the plurality of n-grams based on the calculated n-gram frequencies, defining a mapping table that maps each n-gram of the subset of n-grams to a unique code, and compressing the set of small strings by replacing n-grams within each small string in the set of small strings with corresponding unique codes from the mapping table. The method may use linear optimization to select a subset of n-grams that achieves a maximum space saving amount over the set of small strings for inclusion in the mapping table. The unique codes may be variable-length one or two byte codes. The set of small strings may be domain names.
    • 用于压缩一组小串的方法可以包括:在所述一组小串上计算多个n克的n克频率,基于所计算的n-gram,从所述多个n克中选择n克的子集, 定义映射表,其将n-gram子集的每个n-gram映射到唯一的代码,并且通过用小的字符串组合中的每个小字符串中的n-gram替换相应的唯一的 来自映射表的代码。 该方法可以使用线性优化来选择在该小组中的最小空间节省量的n克的子集以包含在映射表中。 唯一代码可以是可变长度的一个或两个字节代码。 一组小字符串可能是域名。