会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 36. 发明申请
    • DETECTING QUASI-IDENTIFIERS IN DATASETS
    • 在数据库中检测准标识符
    • US20160342636A1
    • 2016-11-24
    • US14719663
    • 2015-05-22
    • International Business Machines Corporation
    • Stefano BraghinAris Gkoulalas-DivanisMichael Wurst
    • G06F17/30G06F9/50
    • G06F17/30321G06F9/5005G06F9/5055
    • Quasi-identifiers (QIDs) are detected in a dataset using a set of computing tasks. The dataset has a plurality of records and a set of attributes. An index is generated for the dataset. The index has an indicator for each attribute value of each record in the dataset. Each indicator specifies all the records in the dataset having the same value for the attribute. Each task is assigned an attribute combination and a subset of the plurality of records in the dataset and is passed to a thread for execution on computing resources. The executing task inspects the set of records specified by the index indicator for each attribute value in the attribute combination to produce a result. The result of at least one task identifies a unique record for the associated attribute combination. The attribute combination producing the unique record is a QID.
    • 使用一组计算任务在数据集中检测准标识符(QID)。 数据集具有多个记录和一组属性。 为数据集生成索引。 索引对数据集中每个记录的每个属性值都有一个指示符。 每个指标指定数据集中具有相同值属性的所有记录。 为每个任务分配了数据集中的多个记录的属性组合和子集,并被传递给一个线程以在计算资源上执行。 执行任务检查由属性组合中的每个属性值由索引指示符指定的记录集合以产生结果。 至少一个任务的结果识别关联的属性组合的唯一记录。 产生唯一记录的属性组合是QID。
    • 37. 发明授权
    • Iterative active feature extraction
    • 迭代主动特征提取
    • US09292798B2
    • 2016-03-22
    • US13723699
    • 2012-12-21
    • International Business Machines Corporation
    • Christoph LingenfelderPascal PompeyOlivier VerscheureMichael Wurst
    • G06N5/02G06N99/00G06N5/04
    • G06N99/005G06N5/02G06N5/025G06N5/043
    • Techniques for iterative feature extraction using domain knowledge are provided. In one aspect, a method for feature extraction is provided. The method includes the following steps. At least one query to predict at least one future value of a given value series based on a statistical model is received. At least two predictions of the future value are produced fulfilling at least the properties of 1) each being as probable as possible given the statistical model and 2) being mutually divert (in terms of numerical distance measure). A user is queried to select one of the predictions. The user may be queried for textual annotations for the predictions. The annotations may be used to identify additional covariates to create an extended set of covariates. The extended set of covariates may be used to improve the accuracy of the statistical model.
    • 提供了使用域知识进行迭代特征提取的技术。 在一方面,提供了一种用于特征提取的方法。 该方法包括以下步骤。 接收至少一个基于统计模型来预测给定值序列的至少一个未来值的查询。 至少产生两个未来价值的预测,至少满足1)的性质,每一个在统计模型中可能是可能的,2)相互转移(在数值距离测量方面)。 查询用户以选择其中一个预测。 可以查询用户的预测文本注释。 注释可用于识别额外的协变量以创建扩展的一组协变量。 扩展的协变量组可以用于提高统计模型的准确性。
    • 38. 发明申请
    • RECOMMENDING AND PRICING DATASETS
    • 推荐和定价数据
    • US20150142511A1
    • 2015-05-21
    • US14313312
    • 2014-06-24
    • International Business Machines Corporation
    • Aris Gkoulalas-DivanisMichael Wurst
    • G06Q30/02G06N99/00
    • G06Q30/0201G06N7/005G06N20/00G06Q30/0203G06Q30/0206
    • A computer processor provides a set of datasets, including at least a first dataset, with each dataset of the set of datasets respectively being configured to allow the dataset to be presented according to multiple variations, with each variation being defined by a selection of at least one transformation. The computer processor receives customer feedback information relating to at least a first variation of the first dataset. The computer processor trains a first machine learning algorithm, based, at least in part, upon the customer feedback information. The computer processor performs, by the first machine learning algorithm, a marketing act. The marketing act includes at least one of the following: (i) defining a new variation of the first dataset, (ii) defining a new transformation for defining variations of the first dataset, (iii) recommending a predefined variation of the first dataset, and (iv) pricing a predefined variation of the first dataset.
    • 计算机处理器提供一组数据集,包括至少第一数据集,数据集集合中的每个数据集分别被配置为允许根据多个变化呈现数据集,每个变化由至少一个选择定义 一个转变。 计算机处理器接收与第一数据集的至少第一变体相关的客户反馈信息。 计算机处理器至少部分地基于客户反馈信息来训练第一机器学习算法。 计算机处理器通过第一机器学习算法执行营销行为。 营销行为包括以下中的至少一个:(i)定义第一数据集的新变体,(ii)定义用于定义第一数据集的变化的新变换,(iii)推荐第一数据集的预定义变体, 和(iv)对第一数据集的预定变体进行定价。