专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20120310950A1 TEXT MINING SYSTEM, A TEXT MINING METHOD AND A RECORDING MEDIUM 有权
标题翻译：文本挖掘系统，文本挖掘方法和记录介质
公开(公告)号：US20120310950A1
公开(公告)日：2012-12-06
申请号：US13518573
申请日：2010-12-15
申请人： Kai Ishikawa , Shinichi Ando , Akihiro Tamura
发明人： Kai Ishikawa , Shinichi Ando , Akihiro Tamura
IPC分类号： G06F17/30
CPC分类号： G06Q10/10
摘要： In case plural pieces of data are analyzed, parts of these pieces of data including a difference which should be compared and analyzed with priority are analyzed exhaustively, with suppressing a cost of analyzing.A text mining system includes an analysis target data pair search unit which judges whether there is a commonality in expressions among pieces of text data, the pieces of text data being included in plural pieces of analysis target data, respectively, an analysis viewpoint generation unit which generates an analysis viewpoint which is a condition to extract an expression from each of the pieces of analysis target data with a commonality in such a way that characteristic expression lists, each of which is a set of characteristic expressions satisfying a predetermined condition in text data included in the pieces of analysis target data, are different among the pieces of analysis target data, a positive example set identification unit which identifies a positive example set including an expression matching the generated analysis viewpoint in each of the pieces of analysis target data, a characteristic quantity calculation unit which calculates a characteristic quantity showing a degree of characterizing the positive example set for each of expressions in each of the pieces of analysis target data, and a characteristic expression ranking unit which extracts expressions each having the calculated characteristic quantity being equal to or greater than a predetermined threshold as characteristic expressions and provides ranks for the extracted characteristic expressions in descending order of the calculated characteristic quantity, wherein the analysis target data pair search unit extracts the analysis viewpoint for the pieces of analysis target data among which a difference in ranks provided for each of the characteristic expressions is equal to or greater than a predetermined threshold.
摘要翻译：在分析多个数据的情况下，通过分析来分析包括优先进行比较和分析的差异的这些数据数据的部分，并抑制分析成本。文本挖掘系统包括分析对象数据对搜索单元，分析对象数据对搜索单元分别判断文本数据中的表达式是否共通，文本数据分别包括在多个分析目标数据中，分析视点生成单元，生成分析视点，该分析视点是以具有共同性的每个分析目标数据中提取表达式的条件，使得特征表达列表中的每一个是包括文本数据中满足预定条件的一组特征表达式在分析目标数据中，在分析目标数据中不同，正样本集合识别单元，其识别在每个分析目标数据中包括与生成的分析视点相匹配的表达式的肯定示例集合，特征数量计算单元，其计算表示度o的特征量 f表征每个分析目标数据中的每个表达式的正示例集合，以及特征表达式排序单元，其提取具有等于或大于预定阈值的计算特征量的表达式，并提供等级对于所计算的特征量的降序的提取的特征表达式，其中分析对象数据对搜索单元提取分析目标数据的分析视点，其中为每个特征表达式提供的等级的差等于或等于大于预定阈值。

2. 发明申请

US20120254071A1 TEXT MINING SYSTEM, TEXT MINING METHOD AND RECORDING MEDIUM 审中-公开
标题翻译：文本挖掘系统，文本挖掘方法和记录介质
公开(公告)号：US20120254071A1
公开(公告)日：2012-10-04
申请号：US13516641
申请日：2010-12-15
申请人： Kai Ishikawa , Shinichi Ando , Akihiro Tamura
发明人： Kai Ishikawa , Shinichi Ando , Akihiro Tamura
IPC分类号： G06Q40/00
CPC分类号： G06F16/34
摘要： Disclosed are a text mining system, text mining method, and recording medium for suppressing increase in cost of analysis for an analyst even if, when analyzing a plurality of data to be analyzed, the data are to be integrally analyzed. The text mining system comprises a data set generation unit for generating a data set to be analyzed that includes data to be analyzed that include text data; and a data set search unit for searching for a data set to be analyzed of which the feature representation coverage exceeds a value given beforehand, or the cost of analysis does not exceed a value given beforehand from data sets to be analyzed generated by the data set generation unit; wherein the feature representation coverage is the ratio of the number of feature representations included in a feature representation list which is a group of feature representations, which are representations satisfying predetermined conditions from text data within the data set to be analyzed, to the number of feature representations in all data to be analyzed; and the cost of analysis is defined on the basis of the number of feature representations included in the data set to be analyzed.
摘要翻译：公开了一种文本挖掘系统，文本挖掘方法和用于抑制分析者分析成本增加的记录介质，即使当分析要分析的多个数据时，数据将被整体分析。文本挖掘系统包括：数据集生成单元，用于生成包括文本数据的待分析数据的待分析数据集; 以及数据集搜索单元，用于搜索特征表示覆盖超过预先给出的值的分析数据集，或分析成本不超过由数据集生成的要分析的数据集预先给出的值发电机组; 其中特征表示覆盖是包括在作为一组特征表示的特征表示列表中的特征表示的数量的比例，其是满足预定条件的表示，从要分析的数据集中的文本数据到特征的数量所有要分析的数据的表示; 并且基于包括在要分析的数据集中的特征表示的数量来定义分析成本。

3. 发明授权

US08751531B2 Text mining apparatus, text mining method, and computer-readable recording medium 有权
标题翻译：文本挖掘装置，文本挖掘方法和计算机可读记录介质
公开(公告)号：US08751531B2
公开(公告)日：2014-06-10
申请号：US13060608
申请日：2009-08-28
申请人： Kai Ishikawa , Akihiro Tamura , Shinichi Ando
发明人： Kai Ishikawa , Akihiro Tamura , Shinichi Ando
IPC分类号： G06F17/30 , G06F17/27
CPC分类号： G06F17/277 , G10L15/26
摘要： A text mining apparatus, a text mining method, and a program are provided that accurately discriminate inherent portions of each of a plurality of text data pieces including a text data piece generated by computer processing.A text mining apparatus 1 to be used performs text mining using, as targets, a plurality of text data pieces including a text data piece generated by computer processing. Confidence is set for each of the text data pieces. The text mining apparatus 1 includes an inherent portion extraction unit 6 that extracts an inherent portion of each text data piece relative to another of the text data pieces, using the confidence set for each of the text data pieces.
摘要翻译：提供文本挖掘装置，文本挖掘方法和程序，其准确地区分包括通过计算机处理生成的文本数据片的多个文本数据的每一个的固有部分。要使用的文本挖掘装置1使用包括通过计算机处理产生的文本数据片的多个文本数据作为目标执行文本挖掘。为每个文本数据设置置信度。文本挖掘装置1包括固有部分提取单元6，其使用针对每个文本数据片段的置信度来提取相对于另一个文本数据片段的每个文本数据片段的固有部分。

4. 发明授权

US08886519B2 Text processing apparatus, text processing method, and computer-readable recording medium 有权
标题翻译：文本处理装置，文本处理方法和计算机可读记录介质
公开(公告)号：US08886519B2
公开(公告)日：2014-11-11
申请号：US13142302
申请日：2009-12-21
申请人： Akihiro Tamura , Kai Ishikawa , Shinichi Ando
发明人： Akihiro Tamura , Kai Ishikawa , Shinichi Ando
IPC分类号： G06F17/27 , G06F17/28
CPC分类号： G06F17/2827 , G06F17/2775
摘要： A text processing apparatus is provided with a segment determination unit 36 and a descriptive content determination unit 33. The segment determination unit 36 determines, with respect to a homogeneous segment that is similar to segments constituting a first text which is set as an analysis target (analysis target text) and that is included in another first text, whether the content thereof is included in a second text. The descriptive content determination unit 33 determines whether each segment constituting the analysis target text should be described in a corresponding second text, based on the determination result.
摘要翻译：文本处理装置具有段确定单元36和描述内容确定单元33.段确定单元36关于类似于构成作为分析目标的第一文本的段的均匀段确定（分析目标文本），并且其被包括在另一第一文本中，其内容是否包括在第二文本中。描述内容确定单元33基于确定结果来确定构成分析目标文本的每个片段是否应当以对应的第二文本进行描述。

5. 发明申请

US20120284016A1 TEXT MINING METHOD, TEXT MINING DEVICE AND TEXT MINING PROGRAM 有权
标题翻译：文本挖掘方法，文本挖掘设备和文本挖掘程序
公开(公告)号：US20120284016A1
公开(公告)日：2012-11-08
申请号：US13511504
申请日：2010-12-07
申请人： Akihiro Tamura , Kai Ishikawa , Shinichi Ando
发明人： Akihiro Tamura , Kai Ishikawa , Shinichi Ando
IPC分类号： G06F17/27
CPC分类号： G06F17/3061
摘要： Disclosed are a text mining method, device, and program capable of performing text mining with a specific topic as an object with high precision. An element identification unit calculates a feature degree, which is an index for indicating a degree that within a text set of interest, which is a set of text that is to be analyzed, an element of the text appears. An output unit identifies distinctive elements within the text set of interest on the basis of the calculated feature degree and outputs the identified elements. The element identification unit corrects the feature degree on the basis of a topic relatedness degree, which is a value indicating a degree related to a topic of analysis, which is a topic for which each text portion of the text being analyzed has been partitioned into predetermined units that are to be analyzed.
摘要翻译：公开了一种能够以特定主题作为对象以高精度执行文本挖掘的文本挖掘方法，设备和程序。元素识别单元计算特征度，该特征度是用于指示感兴趣的文本集合内的程度的索引，其是要被分析的一组文本，该文本的元素出现。输出单元基于所计算的特征度来识别所述文本集合内的不同元素，并输出所识别的元素。元素识别单元基于主题相关度来校正特征度，该主题相关度是指示与分析主题有关的程度的值，该分析题目是被分析的文本的每个文本部分已被划分为预定的要分析的单位。

6. 发明申请

US20120096029A1 INFORMATION ANALYSIS APPARATUS, INFORMATION ANALYSIS METHOD, AND COMPUTER READABLE STORAGE MEDIUM 审中-公开
标题翻译：信息分析设备，信息分析方法和计算机可读存储介质
公开(公告)号：US20120096029A1
公开(公告)日：2012-04-19
申请号：US13380735
申请日：2010-05-28
申请人： Akihiro Tamura , Kai Ishikawa , Shinichi Ando
发明人： Akihiro Tamura , Kai Ishikawa , Shinichi Ando
IPC分类号： G06F17/30
CPC分类号： G06F17/2765
摘要： An information analysis device (30) comprises a relevant portion identification unit (31) that compares analyzed target text with topic-related text that is written about the same event as the analyzed target text and includes information related to a specific topic, and that specifies a portion of the analyzed target text related to the topic-related text; a potential topic word extraction unit (32) that extracts a word of the specific portion; and a statistical model generation unit (33) that generates a statistical model that estimates a degree of appearance of a word on a specific topic of the analyzed target text. The statistical model generation unit (33) generates a statistical model such that degrees of appearance in a specific topic of the topic-related text word and of the extracted word are higher than those of other words.
摘要翻译：信息分析装置（30）包括相关部分识别单元（31），其将分析的目标文本与与分析的目标文本相关的事件写入的主题相关文本进行比较，并且包括与特定主题相关的信息，并且指定分析的目标文本的一部分与主题相关的文本相关; 提取特定部分的单词的潜在主题词提取单元（32）; 以及统计模型生成单元（33），其生成估计所分析的目标文本的特定主题上的单词的外观程度的统计模型。统计模型生成单元（33）生成统计模型，使得主题相关文本单词和提取单词的特定主题中的外观程度高于其他单词的程度。

7. 发明申请

US20110161368A1 TEXT MINING APPARATUS, TEXT MINING METHOD, AND COMPUTER-READABLE RECORDING MEDIUM 有权
标题翻译：文本采矿设备，文本挖掘方法和计算机可读记录介质
公开(公告)号：US20110161368A1
公开(公告)日：2011-06-30
申请号：US13060608
申请日：2009-08-28
申请人： Kai Ishikawa , Akihiro Tamura , Shinichi Ando
发明人： Kai Ishikawa , Akihiro Tamura , Shinichi Ando
IPC分类号： G06F17/30
CPC分类号： G06F17/277 , G10L15/26
摘要： A text mining apparatus, a text mining method, and a program are provided that accurately discriminate inherent portions of each of a plurality of text data pieces including a text data piece generated by computer processing.A text mining apparatus 1 to be used performs text mining using, as targets, a plurality of text data pieces including a text data piece generated by computer processing. Confidence is set for each of the text data pieces. The text mining apparatus 1 includes an inherent portion extraction unit 6 that extracts an inherent portion of each text data piece relative to another of the text data pieces, using the confidence set for each of the text data pieces.
摘要翻译：提供文本挖掘装置，文本挖掘方法和程序，其准确地区分包括通过计算机处理生成的文本数据片的多个文本数据的每一个的固有部分。要使用的文本挖掘装置1使用包括通过计算机处理产生的文本数据片的多个文本数据作为目标执行文本挖掘。为每个文本数据设置置信度。文本挖掘装置1包括固有部分提取单元6，其使用针对每个文本数据片段的置信度来提取相对于另一个文本数据片段的每个文本数据片段的固有部分。

8. 发明授权

US08805853B2 Text mining system for analysis target data, a text mining method for analysis target data and a recording medium for recording analysis target data 有权
标题翻译：用于分析目标数据的文本挖掘系统，用于分析目标数据的文本挖掘方法和用于记录分析目标数据的记录介质
公开(公告)号：US08805853B2
公开(公告)日：2014-08-12
申请号：US13518573
申请日：2010-12-15
申请人： Kai Ishikawa , Shinichi Ando , Akihiro Tamura
发明人： Kai Ishikawa , Shinichi Ando , Akihiro Tamura
IPC分类号： G06F7/00 , G06F17/30
CPC分类号： G06Q10/10
摘要： A text mining system including an analysis target search unit which judges whether a commonality in expressions among text data exists, an analysis viewpoint generation unit which generates an analysis viewpoint to extract an expression from the target data, a positive example set identification unit which identifies a positive example set including an expression matching the generated analysis viewpoint in the target data, a characteristic quantity calculation unit which calculates a characteristic quantity showing a degree of characterizing the positive example set of expressions in the target data, and a characteristic expression ranking unit which extracts expressions having the calculated characteristic quantity equal to or greater than a predetermined threshold as characteristic expressions and ranks the extracted characteristic expressions, and the target search unit extracts the analysis viewpoint among which a difference in ranks provided for the characteristic expressions is equal to or greater than a predetermined threshold.
摘要翻译：一种文本挖掘系统，包括分析对象搜索单元，其判断是否存在文本数据中的表达式中的共同性，分析视点生成单元，生成从目标数据中提取表达式的分析视点;正示例集识别单元，包括与目标数据中生成的分析视点相匹配的表达式的正示例集合;特征量计算单元，计算表示目标数据中的表达式的正示例集合的特征度的特征量;以及特征表达式排序单元，其提取具有等于或大于预定阈值的计算特征量的表达式作为特征表达式并对所提取的特征表达式进行排序，并且目标搜索单元提取其中为特征表达式提供的等级的差异为等于或大于预定阈值。

9. 发明授权

US08380741B2 Text mining apparatus, text mining method, and computer-readable recording medium 有权
标题翻译：文本挖掘装置，文本挖掘方法和计算机可读记录介质
公开(公告)号：US08380741B2
公开(公告)日：2013-02-19
申请号：US13060587
申请日：2009-08-28
申请人： Kai Ishikawa , Akihiro Tamura , Shinichi Ando
发明人： Kai Ishikawa , Akihiro Tamura , Shinichi Ando
IPC分类号： G06F17/30
CPC分类号： G06F17/277 , G10L15/26
摘要： A text mining apparatus, a text mining method, and a program are provided that enable the influence that computer processing errors have on mining results to be reduced during text mining performed on a plurality of text data pieces including a text data piece generated by computer processing. A text mining apparatus 1 to be used includes an inherent portion extraction unit 6 that, for each of a plurality of text data pieces including a text data piece generated by computer processing, extracts an inherent portion of the text data piece relative to another of the text data pieces, an inherent confidence setting unit 7 that, for each inherent portion of each of the text data pieces, sets inherent confidence indicating confidence of the inherent portion, using the confidence that has been set for each of the text data pieces, and a mining processing unit 8 that performs text mining on each inherent portion of each of the text data pieces, using the inherent confidence.
摘要翻译：提供了一种文本挖掘装置，文本挖掘方法和程序，其能够在对包括通过计算机处理产生的文本数据片段的多个文本数据片段执行的文本挖掘期间减少计算机处理错误对挖掘结果的影响。要使用的文本挖掘装置1包括：固有部分提取单元6，对于包括通过计算机处理产生的文本数据片的多个文本数据片段中的每一个，提取文本数据片段中的另一个的固有部分文本数据片段，固有置信度设置单元7，对于每个文本数据片段的每个固有部分，使用为每个文本数据片段设置的置信度来设置表示固有部分的置信度的固有置信度，以及采用处理单元8，其使用固有置信度对每个文本数据的每个固有部分执行文本挖掘。

10. 发明申请

US20110282653A1 TEXT PROCESSING APPARATUS, TEXT PROCESSING METHOD, AND COMPUTER-READABLE RECORDING MEDIUM 有权
标题翻译：文字处理设备，文本处理方法和计算机可读记录介质
公开(公告)号：US20110282653A1
公开(公告)日：2011-11-17
申请号：US13142302
申请日：2009-12-21
申请人： Akihiro Tamura , Kai Ishikawa , Shinichi Ando
发明人： Akihiro Tamura , Kai Ishikawa , Shinichi Ando
IPC分类号： G06F17/27
CPC分类号： G06F17/2827 , G06F17/2775
摘要： A text processing apparatus is provided with a segment determination unit 36 and a descriptive content determination unit 33. The segment determination unit 36 determines, with respect to a homogeneous segment that is similar to segments constituting a first text which is set as an analysis target (analysis target text) and that is included in another first text, whether the content thereof is included in a second text. The descriptive content determination unit 33 determines whether each segment constituting the analysis target text should be described in a corresponding second text, based on the determination result.
摘要翻译：文本处理装置具有段确定单元36和描述内容确定单元33.段确定单元36关于类似于构成作为分析目标的第一文本的段的均匀段确定（分析目标文本），并且其被包括在另一第一文本中，其内容是否包括在第二文本中。描述内容确定单元33基于确定结果来确定构成分析目标文本的每个片段是否应当以对应的第二文本进行描述。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式