会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • System and method for synthesizing data
    • US11501205B2
    • 2022-11-15
    • US16536538
    • 2019-08-09
    • Cigna Intellectual Property, Inc.
    • David FogartyJing Lin
    • G06N5/04G06N20/00
    • Systems and methods for constructing sets of synthetic data. A single data record is identified from a first set of data. The first set of data comprises a first plurality of data records, each of the data records including multiple items of data describing an entity. Using pattern recognition, the single data record is processed to identify a group of records from within the first set that have corresponding characteristics equivalent to the single data record. The identified group of records comprises a target set of variables and the group of records from the first set that are not identified comprises a control set of variables. The target set of variables and the control set of variables are processed, using probability estimation and optimization constraints, to determine a score for each of the records in the first set. The score describes how similar each of the records in the first set is to the single data record. The records associated with a percentage of the highest scores are identified. The data associated with the single data record is replaced with data associated with the identified records identified, item-by-item.
    • 3. 发明申请
    • SYSTEM AND METHOD FOR SYNTHESIZING DATA
    • US20230052823A1
    • 2023-02-16
    • US17967147
    • 2022-10-17
    • Cigna Intellectual Property, Inc.
    • David J. FogartyJing Lin
    • G06N20/00G06N5/04
    • Systems and methods for constructing sets of synthetic data. A single data record is identified from a first set of data. The first set of data comprises a first plurality of data records, each of the data records including multiple items of data describing an entity. Using pattern recognition, the single data record is processed to identify a group of records from within the first set that have corresponding characteristics equivalent to the single data record. The identified group of records comprises a target set of variables and the group of records from the first set that are not identified comprises a control set of variables. The target set of variables and the control set of variables are processed, using probability estimation and optimization constraints, to determine a score for each of the records in the first set. The score describes how similar each of the records in the first set is to the single data record. The records associated with a percentage of the highest scores are identified. The data associated with the single data record is replaced with data associated with the identified records identified, item-by-item.
    • 4. 发明授权
    • System and method for combining data sets
    • US09881031B1
    • 2018-01-30
    • US14627198
    • 2015-02-20
    • CIGNA Intellectual Property, Inc.
    • Jing LinDavid FogartyChit Ming YipWanyu Liao
    • G06F15/18G06F17/30G06N99/00G06N5/04
    • G06F17/30289G06N5/04G06N99/005
    • Embodiments of the invention involve receiving a first set of data describing one or more first observations and a second set of data describing one or more second observations. The first set of data comprises at least two types of data and the second set of data comprises at least two types of data. At least one of the two types of data in the first data set are common with at least one of the two types of data in the second data set. The common types of data comprise common data to the first and second sets of data. The types of data that are not common comprise exclusive data for each of the first and second sets of data. A first multiple regression model is developed for the first data set. The common data for the first data set are set as independent variables and the exclusive data for the first data set are set as dependent variables. A second multiple regression model is developed for the second data set. The common data for the second data set are set as independent variables and the exclusive data for the second data set are set as dependent variables. Prediction results of the first and second multiple regression models are received. Based on the prediction results, at least some of the one or more first observations and the one or more second observations are classified as reasonable observations, which are well-predicted observations. At least some of the one or more first observations and the one or more second observations are classified as outlier observations, which are not classified as well-predicted observations. The outlier observations are removed. The reasonable observations are assigned into intervals for each of the types of data. Based on the assignment, the observations are merged to create a third data set.