会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 6. 发明授权
    • Method for building space-splitting decision tree
    • 建立空间分裂决策树的方法
    • US06871201B2
    • 2005-03-22
    • US09918952
    • 2001-07-31
    • Philip Shi-lung YuHaixun Wang
    • Philip Shi-lung YuHaixun Wang
    • G06F7/00G06F17/30G06K9/62
    • G06F17/30705G06K9/6282Y10S707/99935Y10S707/99937Y10S707/99943
    • A method is provided for data classification that achieves improved interpretability and accuracy while preserving the efficiency and scalability of univariate decision trees. To build a compact decision tree, the method searches for clusters in subspaces to enable multivariate splitting based on weighted distances to such a cluster. To classify an instance more accurately, the method performs a nearest neighbor (NN) search among the potential nearest leaf nodes of the instance. The similarity measure used in the NN search is based on Euclidean distances defined in different subspaces for different leaf nodes. Since instances are scored by their similarity to a certain class, this approach provides an effective means for target selection that is not supported well by conventional decision trees.
    • 提供了一种用于数据分类的方法,其实现了改进的可解释性和准确性,同时保持了单变量决策树的效率和可扩展性。 为了构建一个紧凑的决策树,该方法将搜索子空间中的群集,以便根据这种群集的加权距离来启用多变量分割。 为了更精确地对实例进行分类,该方法在实例的最靠近的叶节点之间执行最近邻(NN)搜索。 NN搜索中使用的相似性度量是基于不同叶节点不同子空间中定义的欧几里德距离。 由于实例与某一类别的相似性得分,所以这种方法为常规决策树不能很好地支持目标选择提供了有效的手段。
    • 8. 发明申请
    • METHOD AND APPARATUS FOR STRUCTURAL DATA CLASSIFICATION
    • 用于结构数据分类的方法和装置
    • US20090319457A1
    • 2009-12-24
    • US12141251
    • 2008-06-18
    • Hong ChengWei FanXifeng YanPhilip Shi-lung Yu
    • Hong ChengWei FanXifeng YanPhilip Shi-lung Yu
    • G06N5/02
    • G06N99/005
    • Techniques for classifying structural data with skewed distribution are disclosed. By way of example, a method classifying structural input data comprises a computer system performing the following steps. Multiple classifiers are constructed, wherein each classifier is constructed on a subset of training data, using one or more selected composite features from the subset of training data. A consensus among the multiple classifiers is computed in accordance with a voting scheme such that at least a portion of the structural input data is assigned to a particular class in accordance with the computed consensus. Such techniques for structured data classification are capable of handling skewed class distribution and partial feature coverage issues.
    • 公开了分布具有偏斜分布的结构数据的技术。 作为示例,分类结构输入数据的方法包括执行以下步骤的计算机系统。 构建多个分类器,其中使用来自训练数据的子集的一个或多个选定的复合特征,在训练数据的子集上构建每个分类器。 根据投票方案计算多个分类器之间的共识,使得至少一部分结构输入数据根据所计算的一致性被分配给特定类别。 这种用于结构化数据分类的技术能够处理倾斜的类分布和部分特征覆盖问题。