会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明授权
    • Enabling advanced analytics with large data sets
    • US11562002B2
    • 2023-01-24
    • US16256519
    • 2019-01-24
    • BUSINESS OBJECTS SOFTWARE LTD.
    • Paul PallathRouzbeh Razavi
    • G06F16/28G06F16/2458
    • The present disclosure describes methods, systems, and computer program products for enabling advanced analytics with large datasets. One computer-implemented method includes receiving, by operation of a computer system, a dataset of multiple data records, each of the plurality of data records comprising one or more features and a target variable; selecting key features among the one or more features based at least on relevance measures of the one or more features with respect to the target variable; dividing the dataset into multiple subsets; for each of the multiple subsets, identifying a number of clusters and respective centroids of the number of clusters based on the key features; identifying a number of final centroids based on the respective centroids of the number of clusters for the each of the number of subsets, the number of final centroids being respective centroids of a number of final clusters; and for each data record in the multiple subsets, assigning the data record to one of the number of final clusters based on distances between the data record and the number of final centroids.