专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

31. 发明授权

US11269834B2 Detecting quasi-identifiers in datasets 有权
公开(公告)号：US11269834B2
公开(公告)日：2022-03-08
申请号：US16446674
申请日：2019-06-20
申请人： International Business Machines Corporation
发明人： Stefano Braghin , Aris Gkoulalas-Divanis , Michael Wurst
IPC分类号： G06F16/00 , G06F16/22 , G06F9/50
摘要： Quasi-identifiers (QIDs) are detected in a dataset using a set of computing tasks. The dataset has a plurality of records and a set of attributes. An index is generated for the dataset. The index has an indicator for each attribute value of each record in the dataset. Each indicator specifies all the records in the dataset having the same value for the attribute. Each task is assigned an attribute combination and a subset of the plurality of records in the dataset and is passed to a thread for execution on computing resources. The executing task inspects the set of records specified by the index indicator for each attribute value in the attribute combination to produce a result. The result of at least one task identifies a unique record for the associated attribute combination. The attribute combination producing the unique record is a QID.

32. 发明授权

US10250956B2 Adaptive sampling of smart meter data 有权
公开(公告)号：US10250956B2
公开(公告)日：2019-04-02
申请号：US15844538
申请日：2017-12-16
申请人： International Business Machines Corporation
发明人： Carlos A. Alzate Perez , Francesco Fusco , Pascal Pompey , Mathieu Sinn , Michael Wurst
IPC分类号： G01D4/00 , H04Q9/00 , H04W4/70 , H04L12/24 , H04L29/08
摘要： In an approach for adaptive sampling of smart meter data, a computer retrieves one or more balancing constraints associated with one or more smart meter sensors. The computer retrieves meter sensor data from the one or more smart meter sensors according to the one or more balancing constraints. The computer determines a subsample of the meter sensor data based, at least in part, on one or more similar consumption patterns of meter sensor data, and then transmits the subsample of the meter sensor data to an optimization engine for use in solving an optimization problem.

33. 发明授权

US09934239B2 Restricting sensitive query results in information management platforms 有权
公开(公告)号：US09934239B2
公开(公告)日：2018-04-03
申请号：US14684478
申请日：2015-04-13
申请人： International Business Machines Corporation
发明人： Aris Gkoulalas-Divanis , Michael Wurst
IPC分类号： G06F7/00 , G06F17/30 , G06F21/62 , H04L29/06
CPC分类号： G06F17/30165 , G06F17/30864 , G06F21/6218 , G06F21/6245 , H04L63/105
摘要： As information becomes more accessible to the public, the ability to predict and estimate sensitive data from the data already available to the general public becomes easier. The existing privacy-preserving data mining approaches only consider the information the user is querying and do not consider the information the user already has, and how the user can use that information in combination with the query information to create sensitive data that the user should not have access to. Some embodiments of the present invention provide a query analysis (QA) program that solves the aforementioned problem by taking into account data that a user may already have, whether it is private data or data that is available to the public, and then using that data, along with the data that would be returned in the query, to determine if sensitive data could be recreated.

34. 发明申请

US20160366495A1 ADAPTIVE SAMPLING OF SMART METER DATA 有权
公开(公告)号：US20160366495A1
公开(公告)日：2016-12-15
申请号：US15247226
申请日：2016-08-25
申请人： International Business Machines Corporation
发明人： Carlos A. Alzate Perez , Francesco Fusco , Pascal Pompey , Mathieu Sinn , Michael Wurst
IPC分类号： H04Q9/00 , H04L12/24
CPC分类号： H04Q9/00 , G01D4/004 , H04L41/145 , H04L67/1095 , H04L67/125 , H04L67/22 , H04Q2209/60 , H04W4/70 , Y02B90/242 , Y02B90/246 , Y02B90/247 , Y04S20/322 , Y04S20/42 , Y04S20/50
摘要： In an approach for adaptive sampling of smart meter data, a computer retrieves one or more balancing constraints associated with one or more smart meter sensors. The computer retrieves meter sensor data from the one or more smart meter sensors according to the one or more balancing constraints. The computer determines a subsample of the meter sensor data, and then transmits the subsample of the meter sensor data to an optimization engine for use in solving an optimization problem.

35. 发明申请

US20160350377A1 ESTIMATING THE COST OF DATA-MINING SERVICES 审中-公开
标题翻译：估算数据采矿服务的成本
公开(公告)号：US20160350377A1
公开(公告)日：2016-12-01
申请号：US15149216
申请日：2016-05-09
申请人： International Business Machines Corporation
发明人： Jakub Marecek , Dimitrios Mavroeidis , Pascal Pompey , Michael Wurst
IPC分类号： G06F17/30
摘要： The cost of data-mining is estimated where data-mining services are delivered via a distributed computing system environment. System requirements are estimated for a particular data-mining task for an input data set having specified properties. Estimating system requirements includes applying a partial learning tool to operate on sample data from the input data set.
摘要翻译：在数据挖掘服务通过分布式计算系统环境传送的情况下，估计数据挖掘的成本。对于具有指定属性的输入数据集的特定数据挖掘任务，估计系统要求。估计系统要求包括应用部分学习工具对来自输入数据集的样本数据进行操作。

36. 发明申请

US20160342636A1 DETECTING QUASI-IDENTIFIERS IN DATASETS 有权
标题翻译：在数据库中检测准标识符
公开(公告)号：US20160342636A1
公开(公告)日：2016-11-24
申请号：US14719663
申请日：2015-05-22
申请人： International Business Machines Corporation
发明人： Stefano Braghin , Aris Gkoulalas-Divanis , Michael Wurst
IPC分类号： G06F17/30 , G06F9/50
CPC分类号： G06F17/30321 , G06F9/5005 , G06F9/5055
摘要： Quasi-identifiers (QIDs) are detected in a dataset using a set of computing tasks. The dataset has a plurality of records and a set of attributes. An index is generated for the dataset. The index has an indicator for each attribute value of each record in the dataset. Each indicator specifies all the records in the dataset having the same value for the attribute. Each task is assigned an attribute combination and a subset of the plurality of records in the dataset and is passed to a thread for execution on computing resources. The executing task inspects the set of records specified by the index indicator for each attribute value in the attribute combination to produce a result. The result of at least one task identifies a unique record for the associated attribute combination. The attribute combination producing the unique record is a QID.
摘要翻译：使用一组计算任务在数据集中检测准标识符（QID）。数据集具有多个记录和一组属性。为数据集生成索引。索引对数据集中每个记录的每个属性值都有一个指示符。每个指标指定数据集中具有相同值属性的所有记录。为每个任务分配了数据集中的多个记录的属性组合和子集，并被传递给一个线程以在计算资源上执行。执行任务检查由属性组合中的每个属性值由索引指示符指定的记录集合以产生结果。至少一个任务的结果识别关联的属性组合的唯一记录。产生唯一记录的属性组合是QID。

37. 发明授权

US09292798B2 Iterative active feature extraction 有权
标题翻译：迭代主动特征提取
公开(公告)号：US09292798B2
公开(公告)日：2016-03-22
申请号：US13723699
申请日：2012-12-21
申请人： International Business Machines Corporation
发明人： Christoph Lingenfelder , Pascal Pompey , Olivier Verscheure , Michael Wurst
IPC分类号： G06N5/02 , G06N99/00 , G06N5/04
CPC分类号： G06N99/005 , G06N5/02 , G06N5/025 , G06N5/043
摘要： Techniques for iterative feature extraction using domain knowledge are provided. In one aspect, a method for feature extraction is provided. The method includes the following steps. At least one query to predict at least one future value of a given value series based on a statistical model is received. At least two predictions of the future value are produced fulfilling at least the properties of 1) each being as probable as possible given the statistical model and 2) being mutually divert (in terms of numerical distance measure). A user is queried to select one of the predictions. The user may be queried for textual annotations for the predictions. The annotations may be used to identify additional covariates to create an extended set of covariates. The extended set of covariates may be used to improve the accuracy of the statistical model.
摘要翻译：提供了使用域知识进行迭代特征提取的技术。在一方面，提供了一种用于特征提取的方法。该方法包括以下步骤。接收至少一个基于统计模型来预测给定值序列的至少一个未来值的查询。至少产生两个未来价值的预测，至少满足1）的性质，每一个在统计模型中可能是可能的，2）相互转移（在数值距离测量方面）。查询用户以选择其中一个预测。可以查询用户的预测文本注释。注释可用于识别额外的协变量以创建扩展的一组协变量。扩展的协变量组可以用于提高统计模型的准确性。

38. 发明申请

US20150142511A1 RECOMMENDING AND PRICING DATASETS 审中-公开
标题翻译：推荐和定价数据
公开(公告)号：US20150142511A1
公开(公告)日：2015-05-21
申请号：US14313312
申请日：2014-06-24
申请人： International Business Machines Corporation
发明人： Aris Gkoulalas-Divanis , Michael Wurst
IPC分类号： G06Q30/02 , G06N99/00
CPC分类号： G06Q30/0201 , G06N7/005 , G06N20/00 , G06Q30/0203 , G06Q30/0206
摘要： A computer processor provides a set of datasets, including at least a first dataset, with each dataset of the set of datasets respectively being configured to allow the dataset to be presented according to multiple variations, with each variation being defined by a selection of at least one transformation. The computer processor receives customer feedback information relating to at least a first variation of the first dataset. The computer processor trains a first machine learning algorithm, based, at least in part, upon the customer feedback information. The computer processor performs, by the first machine learning algorithm, a marketing act. The marketing act includes at least one of the following: (i) defining a new variation of the first dataset, (ii) defining a new transformation for defining variations of the first dataset, (iii) recommending a predefined variation of the first dataset, and (iv) pricing a predefined variation of the first dataset.
摘要翻译：计算机处理器提供一组数据集，包括至少第一数据集，数据集集合中的每个数据集分别被配置为允许根据多个变化呈现数据集，每个变化由至少一个选择定义一个转变。计算机处理器接收与第一数据集的至少第一变体相关的客户反馈信息。计算机处理器至少部分地基于客户反馈信息来训练第一机器学习算法。计算机处理器通过第一机器学习算法执行营销行为。营销行为包括以下中的至少一个：（i）定义第一数据集的新变体，（ii）定义用于定义第一数据集的变化的新变换，（iii）推荐第一数据集的预定义变体，和（iv）对第一数据集的预定变体进行定价。

39. 发明授权

US09015183B2 Accelerating time series data base queries using dictionary based representations 有权
公开(公告)号：US09015183B2
公开(公告)日：2015-04-21
申请号：US13685263
申请日：2012-11-26
申请人： International Business Machines Corporation
发明人： Pascal Pompey , Olivier Verscheure , Michael Wurst
IPC分类号： G06F17/30
CPC分类号： G06F17/30427 , G06F17/30477
摘要： A method for accelerating time series data base queries includes segmenting an original time series of signal values into non-overlapping chunks, where a time-scale for each of the chunks is much less than the time scale of the entire time series, representing time series signal values in each chunk as a weighted superposition of atoms that are members of a shape dictionary to create a compressed time series, storing the original time series and the compressed time series into a database, determining whether a query is answerable using the compressed time series or the original time series, and whether answering the query using the compressed time series is faster. If answering the query is faster using the compressed representation, the query is executed on weight coefficients of the compressed time series to produce a query result, and the query result is translated back into an uncompressed representation.

40. 发明申请

US20140188563A1 CUSTOMER DEMOGRAPHIC DATA CHANGE DETECTION BASED ON MONITORED UTILITY CONSUMPTION 审中-公开
公开(公告)号：US20140188563A1
公开(公告)日：2014-07-03
申请号：US13728868
申请日：2012-12-27
申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION
发明人： Patrick Dantressangle , Eberhard Hechler , Martin A. Oberhofer , Michael Wurst
IPC分类号： G06Q30/02
CPC分类号： G06Q30/0201 , G06Q50/06
摘要： In general, the present disclosure describes techniques for detecting changes in demographic data of a customer based on energy consumption data of the customer. For example, a customer data management system receives energy consumption data of a customer and detects, based at least in part on the received energy consumption data of the customer, a change in demographic data associated with the customer. The customer data management system then outputs, based at least in part on the detecting, at least one demographic change report associated with the demographic data.

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式