会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Dimensional reduction mechanisms for representing massive communication network graphs for structural queries
    • 用于表示结构性查询的大量通信网络图的尺寸缩减机制
    • US08659604B2
    • 2014-02-25
    • US12568719
    • 2009-09-29
    • Charu C. Aggarwal
    • Charu C. Aggarwal
    • G06T11/20
    • G06F17/30572G06T11/206
    • Mechanisms are provided for transforming an original graph data set into a representative form having a smaller number of dimensions that the original graph data set. The mechanisms generate a graph transformation basis structure based on an input graph data structure. The mechanisms further transform an original graph data set based on an intersection of the graph transformation basis structure and the input graph data structure to thereby generate a transformed graph data set data structure. The transformed graph data set data structure has a reduced dimensionality from that of the input graph data structure but represents characteristics of the original graph data set. Moreover, the mechanisms perform an application specific operation on the transformed graph data set data structure to generate an output of a closest similarity record in the transformed graph data set to a target component.
    • 提供了用于将原始图形数据集变换为具有较小维数原始图形数据集的代表形式的机制。 该机制基于输入图形数据结构生成图形变换基础结构。 这些机制基于图形变换基础结构和输入图形数据结构的交点进一步变换原始图形数据集,从而生成变换图形数据集数据结构。 变换后的图形数据集数据结构具有与输入图形数据结构的维度相当的维度,但表示原始图形数据集的特征。 此外,机构对变换的图形数据集数据结构执行应用程序特定的操作,以在转换的图形数据集中产生最接近的相似度记录的输出到目标分量。
    • 2. 发明授权
    • System and method for distributed privacy preserving data mining
    • 分布式隐私保护数据挖掘的系统和方法
    • US08650213B2
    • 2014-02-11
    • US11752708
    • 2007-05-23
    • Charu C. AggarwalPhilip Shi-Lung Yu
    • Charu C. AggarwalPhilip Shi-Lung Yu
    • G06F7/00
    • G06F17/30539G06F21/6245G06F2216/03Y10S707/99931Y10S707/99932
    • Distributed privacy preserving data mining techniques are provided. A first entity of a plurality of entities in a distributed computing environment exchanges summary information with a second entity of the plurality of entities via a privacy-preserving data sharing protocol such that the privacy of the summary information is preserved, the summary information associated with an entity relating to data stored at the entity. The first entity may then mine data based on at least the summary information obtained from the second entity via the privacy-preserving data sharing protocol. The first entity may obtain, from the second entity via the privacy-preserving data sharing protocol, information relating to the number of transactions in which a particular itemset occurs and/or information relating to the number of transactions in which a particular rule is satisfied.
    • 提供分布式隐私保护数据挖掘技术。 分布式计算环境中的多个实体的第一实体通过隐私保护数据共享协议与多个实体的第二实体交换摘要信息,使得保留摘要信息的隐私,与 与实体存储的数据相关的实体。 然后,第一实体可以至少基于通过隐私保护数据共享协议从第二实体获得的摘要信息来挖掘数据。 第一实体可以通过隐私保护数据共享协议从第二实体获得与特定项目集出现的交易数量有关的信息和/或与其中满足特定规则的交易数量有关的信息。
    • 3. 发明授权
    • System and method for classifying data streams with very large cardinality
    • 用于分类具有非常大基数的数据流的系统和方法
    • US08311959B2
    • 2012-11-13
    • US13400863
    • 2012-02-21
    • Charu C AggarwalPhilip S Yu
    • Charu C AggarwalPhilip S Yu
    • G06F15/18
    • G06N99/005G06K9/6267
    • An object and attributes that describe that object are identified. The attributes are grouped into attribute patterns, and classification classes are identified. For each identified class a sketch table containing a plurality of parallel hash tables is created. For the object to be classified, each attribute pattern is processed using the all of the hash functions for each sketch table, resulting in a plurality of values under each sketch table for a single attribute pattern. The lowest value is selected for each sketch table. The distribution of values across all sketch tables is evaluated for each attribute pattern, producing a discriminatory power for each attribute pattern. Attribute patterns having a discriminatory power above a given threshold are selected and added to the associated sketch table values. The sketch table with the largest overall sum is identified, and the associated class is assigned to the object belonging to the attribute patterns.
    • 识别描述该对象的对象和属性。 这些属性被分组成属性模式,并且识别分类类。 对于每个识别的类,创建包含多个并行哈希表的草图表。 对于要分类的对象,使用每个草图表的所有散列函数处理每个属性模式,从而在单个属性模式的每个草图表下产生多个值。 为每个草图表选择最低值。 对每个属性模式评估所有草图表中的值的分布,为每个属性模式产生歧视性的权力。 选择具有高于给定阈值的辨别力的属性模式并将其添加到关联的草图表值。 识别具有最大总和的草图表,并将关联的类分配给属于属性模式的对象。
    • 4. 发明申请
    • Similarity Searching in Large Disk-Based Networks
    • 在大型基于磁盘的网络中进行相似性搜索
    • US20120269200A1
    • 2012-10-25
    • US13091244
    • 2011-04-21
    • Charu C. Aggarwal
    • Charu C. Aggarwal
    • H04L12/56
    • H04L47/00
    • Techniques for determining a shortest path in a disk-based network are provided. The techniques include creating a compressed representation of an underlying disk resident network graph, wherein creating a compressed representation of an underlying disk resident network graph comprises determining one or more dense regions in the disk resident graph and compacting the one or more dense regions into one or more compressed nodes, associating one or more node penalties with the one or more compressed nodes, wherein the one or more node penalties reflect a distance of a sub-path within a compressed node, and performing a query on the underlying disk resident network graph using the compressed representation and one or more node penalties to determine a shortest path in the disk-based network to reduce the number of accesses to a physical disk.
    • 提供用于确定基于磁盘的网络中的最短路径的技术。 所述技术包括创建底层磁盘驻留网络图的压缩表示,其中创建底层磁盘驻留网络图的压缩表示包括确定磁盘驻留图中的一个或多个密集区域并将一个或多个密集区域压缩为一个或多个密集区域 更多的压缩节点将一个或多个节点惩罚与一个或多个压缩节点相关联,其中所述一个或多个节点惩罚反映了压缩节点内的子路径的距离,并且使用 压缩表示和一个或多个节点惩罚,以确定基于磁盘的网络中的最短路径,以减少对物理磁盘的访问次数。
    • 5. 发明申请
    • System and Method for Finding Important Nodes in a Network
    • 在网络中查找重要节点的系统和方法
    • US20120218908A1
    • 2012-08-30
    • US13036083
    • 2011-02-28
    • Charu C. Aggarwal
    • Charu C. Aggarwal
    • H04L12/26
    • H04L51/32H04L51/14
    • Techniques for optimizing steady state flow of a network are provided. The techniques include determining a first set of two or more nodes in a network, computing a steady-state flow probability of the first set of two or more nodes, and iteratively interchanging nodes from a second set of two or more nodes into the first set of two or more nodes to determine an optimum total steady state flow of the network, wherein determining an optimum total steady-state flow of the network comprises iteratively interchanging nodes until no additional improvements in steady-state flow over the computed steady-state flow probability can be obtained.
    • 提供了优化网络稳态流的技术。 这些技术包括确定网络中的两个或更多个节点的第一组,计算第二组两个或多个节点的稳态流概率,以及将节点从第二组两个或多个节点迭代地交换到第一组中 以确定网络的最佳总稳态流,其中确定网络的最佳总稳态流包括迭代交换节点,直到在所计算的稳态流概率上没有对稳态流的额外改进 可以获得。
    • 6. 发明申请
    • Dimensional Reduction Mechanisms for Representing Massive Communication Network Graphs for Structural Queries
    • 用于表示大量通信网络图的结构查询的维度缩减机制
    • US20110074786A1
    • 2011-03-31
    • US12568719
    • 2009-09-29
    • Charu C. Aggarwal
    • Charu C. Aggarwal
    • G06T11/20
    • G06F17/30572G06T11/206
    • Mechanisms are provided for transforming an original graph data set into a representative form having a smaller number of dimensions that the original graph data set. The mechanisms generate a graph transformation basis structure based on an input graph data structure. The mechanisms further transform an original graph data set based on an intersection of the graph transformation basis structure and the input graph data structure to thereby generate a transformed graph data set data structure. The transformed graph data set data structure has a reduced dimensionality from that of the input graph data structure but represents characteristics of the original graph data set. Moreover, the mechanisms perform an application specific operation on the transformed graph data set data structure to generate an output of a closest similarity record in the transformed graph data set to a target component.
    • 提供了用于将原始图形数据集变换为具有较小维数原始图形数据集的代表形式的机制。 该机制基于输入图形数据结构生成图形变换基础结构。 这些机制基于图形变换基础结构和输入图形数据结构的交点进一步变换原始图形数据集,从而生成变换图形数据集数据结构。 变换后的图形数据集数据结构具有与输入图形数据结构的维度相当的维度,但表示原始图形数据集的特征。 此外,机构对变换的图形数据集数据结构执行应用程序特定的操作,以在转换的图形数据集中产生最接近的相似度记录的输出到目标分量。
    • 9. 发明授权
    • Methods and apparatus for generating decision trees with discriminants and employing same in data classification
    • 用于生成具有歧视性的决策树并在数据分类中采用相同的方法和装置
    • US07716154B2
    • 2010-05-11
    • US11841221
    • 2007-08-20
    • Charu C. AggarwalPhilip Shi-Lung Yu
    • Charu C. AggarwalPhilip Shi-Lung Yu
    • G06N5/00
    • G06K9/6282G06F17/3061G06F2216/03Y10S707/99936
    • Methods and apparatus are provided for generating a decision trees using linear discriminant analysis and implementing such a decision tree in the classification (also referred to as categorization) of data. The data is preferably in the form of multidimensional objects, e.g., data records including feature variables and class variables in a decision tree generation mode, and data records including only feature variables in a decision tree traversal mode. Such an inventive approach, for example, creates more effective supervised classification systems. In general, the present invention comprises splitting a decision tree, recursively, such that the greatest amount of separation among the class values of the training data is achieved. This is accomplished by finding effective combinations of variables in order to recursively split the training data and create the decision tree. The decision tree is then used to classify input testing data.
    • 提供了用于使用线性判别分析生成决策树并且在分类(也称为分类))中实现这样的决策树的方法和装置。 数据优选地以多维对象的形式,例如包括决策树生成模式中的特征变量和类变量的数据记录,以及仅包括决策树遍历模式中的特征变量的数据记录。 例如,这种创造性的方法创建更有效的监督分类系统。 通常,本发明包括分解决策树,递归地分割,使得实现训练数据的类值之间的最大分离量。 这是通过找到变量的有效组合来实现的,以便递归地分割训练数据并创建决策树。 然后使用决策树对输入测试数据进行分类。