会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • TEXT MINING SYSTEM, A TEXT MINING METHOD AND A RECORDING MEDIUM
    • 文本挖掘系统,文本挖掘方法和记录介质
    • US20120310950A1
    • 2012-12-06
    • US13518573
    • 2010-12-15
    • Kai IshikawaShinichi AndoAkihiro Tamura
    • Kai IshikawaShinichi AndoAkihiro Tamura
    • G06F17/30
    • G06Q10/10
    • In case plural pieces of data are analyzed, parts of these pieces of data including a difference which should be compared and analyzed with priority are analyzed exhaustively, with suppressing a cost of analyzing.A text mining system includes an analysis target data pair search unit which judges whether there is a commonality in expressions among pieces of text data, the pieces of text data being included in plural pieces of analysis target data, respectively, an analysis viewpoint generation unit which generates an analysis viewpoint which is a condition to extract an expression from each of the pieces of analysis target data with a commonality in such a way that characteristic expression lists, each of which is a set of characteristic expressions satisfying a predetermined condition in text data included in the pieces of analysis target data, are different among the pieces of analysis target data, a positive example set identification unit which identifies a positive example set including an expression matching the generated analysis viewpoint in each of the pieces of analysis target data, a characteristic quantity calculation unit which calculates a characteristic quantity showing a degree of characterizing the positive example set for each of expressions in each of the pieces of analysis target data, and a characteristic expression ranking unit which extracts expressions each having the calculated characteristic quantity being equal to or greater than a predetermined threshold as characteristic expressions and provides ranks for the extracted characteristic expressions in descending order of the calculated characteristic quantity, wherein the analysis target data pair search unit extracts the analysis viewpoint for the pieces of analysis target data among which a difference in ranks provided for each of the characteristic expressions is equal to or greater than a predetermined threshold.
    • 在分析多个数据的情况下,通过分析来分析包括优先进行比较和分析的差异的这些数据数据的部分,并抑制分析成本。 文本挖掘系统包括分析对象数据对搜索单元,分析对象数据对搜索单元分别判断文本数据中的表达式是否共通,文本数据分别包括在多个分析目标数据中,分析视点生成单元, 生成分析视点,该分析视点是以具有共同性的每个分析目标数据中提取表达式的条件,使得特征表达列表中的每一个是包括文本​​数据中满足预定条件的一组特征表达式 在分析目标数据中,在分析目标数据中不同,正样本集合识别单元,其识别在每个分析目标数据中包括与生成的分析视点相匹配的表达式的肯定示例集合,特征 数量计算单元,其计算表示度o的特征量 f表征每个分析目标数据中的每个表达式的正示例集合,以及特征表达式排序单元,其提取具有等于或大于预定阈值的计算特征量的表达式,并提供等级 对于所计算的特征量的降序的提取的特征表达式,其中分析对象数据对搜索单元提取分析目标数据的分析视点,其中为每个特征表达式提供的等级的差等于或等于 大于预定阈值。
    • 2. 发明申请
    • TEXT MINING SYSTEM, TEXT MINING METHOD AND RECORDING MEDIUM
    • 文本挖掘系统,文本挖掘方法和记录介质
    • US20120254071A1
    • 2012-10-04
    • US13516641
    • 2010-12-15
    • Kai IshikawaShinichi AndoAkihiro Tamura
    • Kai IshikawaShinichi AndoAkihiro Tamura
    • G06Q40/00
    • G06F16/34
    • Disclosed are a text mining system, text mining method, and recording medium for suppressing increase in cost of analysis for an analyst even if, when analyzing a plurality of data to be analyzed, the data are to be integrally analyzed. The text mining system comprises a data set generation unit for generating a data set to be analyzed that includes data to be analyzed that include text data; and a data set search unit for searching for a data set to be analyzed of which the feature representation coverage exceeds a value given beforehand, or the cost of analysis does not exceed a value given beforehand from data sets to be analyzed generated by the data set generation unit; wherein the feature representation coverage is the ratio of the number of feature representations included in a feature representation list which is a group of feature representations, which are representations satisfying predetermined conditions from text data within the data set to be analyzed, to the number of feature representations in all data to be analyzed; and the cost of analysis is defined on the basis of the number of feature representations included in the data set to be analyzed.
    • 公开了一种文本挖掘系统,文本挖掘方法和用于抑制分析者分析成本增加的记录介质,即使当分析要分析的多个数据时,数据将被整体分析。 文本挖掘系统包括:数据集生成单元,用于生成包括文本数据的待分析数据的待分析数据集; 以及数据集搜索单元,用于搜索特征表示覆盖超过预先给出的值的分析数据集,或分析成本不超过由数据集生成的要分析的数据集预先给出的值 发电机组; 其中特征表示覆盖是包括在作为一组特征表示的特征表示列表中的特征表示的数量的比例,其是满足预定条件的表示,从要分析的数据集中的文本数据到特征的数量 所有要分析的数据的表示; 并且基于包括在要分析的数据集中的特征表示的数量来定义分析成本。
    • 5. 发明申请
    • TEXT MINING METHOD, TEXT MINING DEVICE AND TEXT MINING PROGRAM
    • 文本挖掘方法,文本挖掘设备和文本挖掘程序
    • US20120284016A1
    • 2012-11-08
    • US13511504
    • 2010-12-07
    • Akihiro TamuraKai IshikawaShinichi Ando
    • Akihiro TamuraKai IshikawaShinichi Ando
    • G06F17/27
    • G06F17/3061
    • Disclosed are a text mining method, device, and program capable of performing text mining with a specific topic as an object with high precision. An element identification unit calculates a feature degree, which is an index for indicating a degree that within a text set of interest, which is a set of text that is to be analyzed, an element of the text appears. An output unit identifies distinctive elements within the text set of interest on the basis of the calculated feature degree and outputs the identified elements. The element identification unit corrects the feature degree on the basis of a topic relatedness degree, which is a value indicating a degree related to a topic of analysis, which is a topic for which each text portion of the text being analyzed has been partitioned into predetermined units that are to be analyzed.
    • 公开了一种能够以特定主题作为对象以高精度执行文本挖掘的文本挖掘方法,设备和程序。 元素识别单元计算特征度,该特征度是用于指示感兴趣的文本集合内的程度的索引,其是要被分析的一组文本,该文本的元素出现。 输出单元基于所计算的特征度来识别所述文本集合内的不同元素,并输出所识别的元素。 元素识别单元基于主题相关度来校正特征度,该主题相关度是指示与分析主题有关的程度的值,该分析题目是被分析的文本的每个文本部分已被划分为预定的 要分析的单位。
    • 6. 发明申请
    • INFORMATION ANALYSIS APPARATUS, INFORMATION ANALYSIS METHOD, AND COMPUTER READABLE STORAGE MEDIUM
    • 信息分析设备,信息分析方法和计算机可读存储介质
    • US20120096029A1
    • 2012-04-19
    • US13380735
    • 2010-05-28
    • Akihiro TamuraKai IshikawaShinichi Ando
    • Akihiro TamuraKai IshikawaShinichi Ando
    • G06F17/30
    • G06F17/2765
    • An information analysis device (30) comprises a relevant portion identification unit (31) that compares analyzed target text with topic-related text that is written about the same event as the analyzed target text and includes information related to a specific topic, and that specifies a portion of the analyzed target text related to the topic-related text; a potential topic word extraction unit (32) that extracts a word of the specific portion; and a statistical model generation unit (33) that generates a statistical model that estimates a degree of appearance of a word on a specific topic of the analyzed target text. The statistical model generation unit (33) generates a statistical model such that degrees of appearance in a specific topic of the topic-related text word and of the extracted word are higher than those of other words.
    • 信息分析装置(30)包括相关部分识别单元(31),其将分析的目标文本与与分析的目标文本相关的事件写入的主题相关文本进行比较,并且包括与特定主题相关的信息,并且指定 分析的目标文本的一部分与主题相关的文本相关; 提取特定部分的单词的潜在主题词提取单元(32); 以及统计模型生成单元(33),其生成估计所分析的目标文本的特定主题上的单词的外观程度的统计模型。 统计模型生成单元(33)生成统计模型,使得主题相关文本单词和提取单词的特定主题中的外观程度高于其他单词的程度。
    • 8. 发明授权
    • Text mining system for analysis target data, a text mining method for analysis target data and a recording medium for recording analysis target data
    • 用于分析目标数据的文本挖掘系统,用于分析目标数据的文本挖掘方法和用于记录分析目标数据的记录介质
    • US08805853B2
    • 2014-08-12
    • US13518573
    • 2010-12-15
    • Kai IshikawaShinichi AndoAkihiro Tamura
    • Kai IshikawaShinichi AndoAkihiro Tamura
    • G06F7/00G06F17/30
    • G06Q10/10
    • A text mining system including an analysis target search unit which judges whether a commonality in expressions among text data exists, an analysis viewpoint generation unit which generates an analysis viewpoint to extract an expression from the target data, a positive example set identification unit which identifies a positive example set including an expression matching the generated analysis viewpoint in the target data, a characteristic quantity calculation unit which calculates a characteristic quantity showing a degree of characterizing the positive example set of expressions in the target data, and a characteristic expression ranking unit which extracts expressions having the calculated characteristic quantity equal to or greater than a predetermined threshold as characteristic expressions and ranks the extracted characteristic expressions, and the target search unit extracts the analysis viewpoint among which a difference in ranks provided for the characteristic expressions is equal to or greater than a predetermined threshold.
    • 一种文本挖掘系统,包括分析对象搜索单元,其判断是否存在文本数据中的表达式中的共同性,分析视点生成单元,生成从目标数据中提取表达式的分析视点;正示例集识别单元, 包括与目标数据中生成的分析视点相匹配的表达式的正示例集合;特征量计算单元,计算表示目标数据中的表达式的正示例集合的特征度的特征量;以及特征表达式排序单元,其提取 具有等于​​或大于预定阈值的计算特征量的表达式作为特征表达式并对所提取的特征表达式进行排序,并且目标搜索单元提取其中为特征表达式提供的等级的差异为 等于或大于预定阈值。
    • 9. 发明授权
    • Text mining apparatus, text mining method, and computer-readable recording medium
    • 文本挖掘装置,文本挖掘方法和计算机可读记录介质
    • US08380741B2
    • 2013-02-19
    • US13060587
    • 2009-08-28
    • Kai IshikawaAkihiro TamuraShinichi Ando
    • Kai IshikawaAkihiro TamuraShinichi Ando
    • G06F17/30
    • G06F17/277G10L15/26
    • A text mining apparatus, a text mining method, and a program are provided that enable the influence that computer processing errors have on mining results to be reduced during text mining performed on a plurality of text data pieces including a text data piece generated by computer processing. A text mining apparatus 1 to be used includes an inherent portion extraction unit 6 that, for each of a plurality of text data pieces including a text data piece generated by computer processing, extracts an inherent portion of the text data piece relative to another of the text data pieces, an inherent confidence setting unit 7 that, for each inherent portion of each of the text data pieces, sets inherent confidence indicating confidence of the inherent portion, using the confidence that has been set for each of the text data pieces, and a mining processing unit 8 that performs text mining on each inherent portion of each of the text data pieces, using the inherent confidence.
    • 提供了一种文本挖掘装置,文本挖掘方法和程序,其能够在对包括通过计算机处理产生的文本数据片段的多个文本数据片段执行的文本挖掘期间减少计算机处理错误对挖掘结果的影响 。 要使用的文本挖掘装置1包括:固有部分提取单元6,对于包括通过计算机处理产生的文本数据片的多个文本数据片段中的每一个,提取文本数据片段中的另一个的固有部分 文本数据片段,固有置信度设置单元7,对于每个文本数据片段的每个固有部分,使用为每个文本数据片段设置的置信度来设置表示固有部分的置信度的固有置信度,以及 采用处理单元8,其使用固有置信度对每个文本数据的每个固有部分执行文本挖掘。