NLP -- BaiDu

Author： harrytsz
发布时间：August 26, 2019
1309 views
No comments
8999 words
Categories： NLP

新建 AipNlp:

AipNlp 是自然语言处理的 Python SDK 客户端，为使用自然语言处理的开发人员提供了一系列的交互方法。参考如下代码新建一个 AipNlp:

from aip import AipNlp
""" 你的 APPID AK SK """
APP_ID = '##########'                                  #'你的 APP ID'
API_KEY = '##########'                #'你的 Api key'
SECRET_KEY = '##########'   #'你的 Secret key'

client = AipNlp(APP_ID, API_KEY, SECRET_KEY)

配置AipNlp:

如果用户需要配置 AipNlp 的网络请求参数（一般不需要配置），可以在构造 AipNlp 之后调用接口设置参数，目前只支持以下参数：

接口	说明
setConnectionTimeoutInMillis	建立连接的超时时间（单位：毫秒）
setSocketTimeoutInMillis	通过打开的连接传输数据的超时时间（单位：毫秒）

接口说明：

词法分析：

词法分析接口向用户提供分词、词性标注、专名识别三大功能；能够识别出文本串中的基本词汇（分词），对这些词汇进行重组、标注组合后词汇的词性，并进一步识别出命名实体。

text = "百度是一家高科技公司"

""" 调用词法分析 """
client.lexer(text)

{'log_id': 3174179683102561622,
 'text': '百度是一家高科技公司',
 'items': [{'loc_details': [],
   'byte_offset': 0,
   'uri': '',
   'pos': '',
   'ne': 'ORG',
   'item': '百度',
   'basic_words': ['百度'],
   'byte_length': 4,
   'formal': ''},
  {'loc_details': [],
   'byte_offset': 4,
   'uri': '',
   'pos': 'v',
   'ne': '',
   'item': '是',
   'basic_words': ['是'],
   'byte_length': 2,
   'formal': ''},
  {'loc_details': [],
   'byte_offset': 6,
   'uri': '',
   'pos': 'm',
   'ne': '',
   'item': '一家',
   'basic_words': ['一', '家'],
   'byte_length': 4,
   'formal': ''},
  {'loc_details': [],
   'byte_offset': 10,
   'uri': '',
   'pos': 'n',
   'ne': '',
   'item': '高科技',
   'basic_words': ['高', '科技'],
   'byte_length': 6,
   'formal': ''},
  {'loc_details': [],
   'byte_offset': 16,
   'uri': '',
   'pos': 'n',
   'ne': '',
   'item': '公司',
   'basic_words': ['公司'],
   'byte_length': 4,
   'formal': ''}]}

词法分析（定制版）

text = "百度是一家高科技公司"

""" 调用词法分析（定制版）"""
client.lexerCustom(text)

{'log_id': 1030687273146384758,
 'items': [{'loc_details': [],
   'byte_offset': 0,
   'uri': '',
   'ne': 'ORG',
   'basic_words': ['百度'],
   'item': '百度',
   'pos': '',
   'byte_length': 4,
   'formal': ''},
  {'loc_details': [],
   'byte_offset': 4,
   'uri': '',
   'ne': '',
   'basic_words': ['是'],
   'item': '是',
   'pos': 'v',
   'byte_length': 2,
   'formal': ''},
  {'loc_details': [],
   'byte_offset': 6,
   'uri': '',
   'ne': '',
   'basic_words': ['一', '家'],
   'item': '一家',
   'pos': 'm',
   'byte_length': 4,
   'formal': ''},
  {'loc_details': [],
   'byte_offset': 10,
   'uri': '',
   'ne': '',
   'basic_words': ['高', '科技'],
   'item': '高科技',
   'pos': 'n',
   'byte_length': 6,
   'formal': ''},
  {'loc_details': [],
   'byte_offset': 16,
   'uri': '',
   'ne': '',
   'basic_words': ['公司'],
   'item': '公司',
   'pos': 'n',
   'byte_length': 4,
   'formal': ''}],
 'text': '百度是一家高科技公司'}

依存句法分析

依存句法分析接口可自动分析文本中的依存句法结构信息，哦拥句子中词与词之间的依存关系来表示词语的句法结构信息（如“主谓”、“动宾”、“定中”等结构关系），并用树状结构来表示整句的结构（如“主谓宾”、“定状补”等）。

text = "今天天气怎么样"

""" 调用依存句法分析 """
client.depParser(text)

""" 如果有可选参数 """
options = {}
options["mode"] = 1

""" 带参数调用依存句法分析 """
client.depParser(text, options)

{'log_id': 6738947376011839670,
 'text': '今天天气怎么样',
 'items': [{'postag': 't', 'head': 2, 'word': '今天', 'id': 1, 'deprel': 'ATT'},
  {'postag': 'n', 'head': 3, 'word': '天气', 'id': 2, 'deprel': 'SBV'},
  {'postag': 'r', 'head': 0, 'word': '怎么样', 'id': 3, 'deprel': 'HED'}]}

词向量表示

词向量表示接口提供中文词向量的查询功能。

word = "张飞"

""" 调用词向量表示 """
client.wordEmbedding(word)

{'log_id': 1696656248514338902,
 'word': '张飞',
 'vec': [-0.290384,
  -0.276273,
  0.302719,
  0.7209,
  -0.0765072,
  0.31901,
  0.270633,
  0.795086,
  -0.203823,
  -0.125412,
  0.45416,
  -0.172919,
  0.295541,
  -0.216173,
  ...]}

DNN 语言模型

中文 DNN 语言模型接口用于输出切词结果并给出每个词在句子中的概率值，判断一句话是否符合语言表达习惯。

text = "床前明月光"

""" 调用 DNN 语言模型 """
client.dnnlm(text)

{'log_id': 8461893498410162902,
 'text': '床前明月光',
 'items': [{'word': '床', 'prob': 3.85273e-05},
  {'word': '前', 'prob': 0.0289018},
  {'word': '明月', 'prob': 0.0284406},
  {'word': '光', 'prob': 0.808029}],
 'ppl': 79.0651}

词意相似度

输入两个词，得到两个词的相似度结果。

word1 = "北京"
word2 = "上海"

""" 调用词义相似度 """
client.wordSimEmbedding(word1, word2)

""" 如果有可选参数 """
options = {}
options["mode"] = 0

""" 带参数调用词义相似度 """
client.wordSimEmbedding(word1, word2, options)

{'log_id': 1841062063069490934,
 'error_code': 282004,
 'error_msg': 'invalid parameter(s)'}

短文本相似度

text1 = "浙富股份"
text2 = "万事通自考网"

""" 调用短文本相似度 """
client.simnet(text1, text2)

""" 如果有可选参数 """
options = {}
options["model"] = "CNN"

""" 带参数调用短文本相似度 """
client.simnet(text1, text2, options)

{'log_id': 8759613961966585046,
 'texts': {'text_2': '万事通自考网', 'text_1': '浙富股份'},
 'score': 0.0549339}

评论观点抽取

评论观点抽取接口用来提取一条评论句子的关注点和评论观点，并输出评论观点标签以及评论观点极性。

text = "三星电脑电池不给力"

""" 调用评论观点抽取 """
client.commentTag(text)

""" 如果有可选参数 """
options = {}
options["type"] = 13

""" 带参数调用评论观点抽取 """
client.commentTag(text, options)

{'log_id': 8426923826378164630,
 'items': [{'sentiment': 0,
   'abstract': '三星电脑<span>电池不给力</span>',
   'prop': '电池',
   'begin_pos': 8,
   'end_pos': 18,
   'adj': '不给力'}]}

情感倾向分析

对包含主观观点信息的文本进行情感极性类别（积极、消极、中性）的判断，并给出相应的置信度。

text = "苹果是一家伟大公司"

""" 调用情感倾向分析 """
client.sentimentClassify(text)

{'log_id': 7415487462125078582,
 'text': '苹果是一家伟大公司',
 'items': [{'positive_prob': 0.691839,
   'confidence': 0.315198,
   'negative_prob': 0.308161,
   'sentiment': 2}]}

文章标签

文章标签服务能够针对网络各类媒体文章进行快速的内容理解，根据输入含有标题的文章，输出多个内容标签以及对应的置信度，用于个性化推荐、相似文章聚合、文本内容分析等场景。

title = "iphone手机出现“白苹果”原因及解决办法，用苹果手机的可以看下"
content = "如果下面的方法还是没有解决你的问题建议来我们门店看下成都市锦江区红星路三段99号银石广场24层01室。"

""" 调用文章标签 """
client.keyword(title, content)

{'log_id': 4313909132996888022,
 'items': [{'score': 0.99775, 'tag': 'iphone'},
  {'score': 0.862602, 'tag': '手机'},
  {'score': 0.845657, 'tag': '苹果'},
  {'score': 0.837886, 'tag': '苹果公司'},
  {'score': 0.811601, 'tag': '白苹果'},
  {'score': 0.797911, 'tag': '数码'}]}

文章分类

对文章按照内容类型进行自动分类，首批支持娱乐、体育、科技等26个主流内容类型，文本内容分析等应用提供基础技术支持。

title = "欧洲冠军杯足球赛"

content = "欧洲冠军联赛是欧洲足球协会联盟主办的年度足球比赛，代表欧洲俱乐部足球最高荣誉和水平，被认为是全世界最高素质、最具影响力以及最高水平的俱乐部赛事，亦是世界上奖金最高的足球赛事和体育赛事之一。"

""" 调用文章分类 """
client.topic(title, content)

{'log_id': 2207187729196380118,
 'item': {'lv2_tag_list': [{'score': 0.915631, 'tag': '足球'},
   {'score': 0.803507, 'tag': '国际足球'},
   {'score': 0.77813, 'tag': '英超'}],
  'lv1_tag_list': [{'score': 0.830915, 'tag': '体育'}]}}

文本纠错

识别输入文本中有错误的片段，提示错误并给出正确的文本结果。支持短文本、长文本、语音等内容的错误识别，纠错是搜索引擎、语音识别、内容审查等功能更好运行的基础模块之一。

text = "百度是一家仁工智能公司"

""" 调用文本纠错 """
client.ecnet(text)

{'log_id': 4819268271360271574,
 'item': {'vec_fragment': [{'ori_frag': '仁工',
    'begin_pos': 10,
    'correct_frag': '人工',
    'end_pos': 14}],
  'score': 0.529867,
  'correct_query': '百度是一家人工智能公司'},
 'text': '百度是一家仁工智能公司'}

对话情绪识别接口

针对用户日常沟通文本背后所蕴含情绪的一种直观检测，可自动识别出当前会话者所表现出的情绪类别及其置信度，可以帮助企业更全面地把握产品服务质量、监控客户服务质量。

text = "本来今天高高兴兴"

""" 调用对话情绪识别接口 """
client.emotion(text)
""" 如果有可选参数 """
options = {}
options["scene"] = "talk"

""" 带参数调用对话情绪识别接口 """
client.emotion(text, options)

{'log_id': 901856600521512694,
 'text': '本来今天高高兴兴',
 'items': [{'subitems': [{'prob': 0.501008, 'label': 'happy'}],
   'replies': ['你的笑声真欢乐'],
   'prob': 0.501008,
   'label': 'optimistic'},
  {'subitems': [], 'replies': [], 'prob': 0.49872, 'label': 'neutral'},
  {'subitems': [],
   'replies': [],
   'prob': 0.000272128,
   'label': 'pessimistic'}]}

新闻摘要接口

自动抽取新闻文本中的关键信息，进而生成指定长度的新闻摘要。

content = "麻省理工学院的研究团队为无人机在仓库中使用RFID技术进行库存查找等工作，创造了一种..."

maxSummaryLen = 300

""" 调用新闻摘要接口 """
client.newsSummary(content, maxSummaryLen);

""" 如果有可选参数 """
options = {}
options["title"] = "标题"

""" 带参数调用新闻摘要接口 """
client.newsSummary(content, maxSummaryLen, options)

{'error_code': 6, 'error_msg': 'No permission to access data'}

Last modification：June 20, 2021

如果觉得我的文章对你有用，请随意赞赏

NLP -- BaiDu

harrytsz • 2019 年 08 月 26 日

新建 AipNlp:AipNlp 是自然语言处理的 Python SDK 客户端，为使用自然语言处理的开发人员提供了一系列的交互方法。参考如下代码新建一个 AipNlp:<pre><code class="lang-python">from aip import AipNlp
&quot;&quot;&quot; 你的 APPID AK SK &quot;&quot;&quot;
APP_ID = &#039;##########&#039; #&#039;你的 APP ID&#039;
API_KEY = &#039;##########&#039; #&#039;你的 Api key&#039;
SECRET_KEY = &#039;##########&#039; #&#039;你的 Secret key&#039;

client = AipNlp(APP_ID, API_KEY, SECRET_KEY)
</code></pre>配置AipNlp:如果用户需要配置 AipNlp 的网络请求参数（一般不需要配置），可以在构造 AipNlp 之后调用接口设置参数，目前只支持以下参数：<table><thead><tr><th align="center">接口</th><th align="center">说明</th></tr></thead><tbody><tr><td align="center">setConnectionTimeoutInMillis</td><td align="center">建立连接的超时时间（单位：毫秒）</td></tr><tr><td align="center">setSocketTimeoutInMillis</td><td align="center">通过打开的连接传输数据的超时时间（单位：毫秒）</td></tr></tbody></table>接口说明：词法分析：词法分析接口向用户提供分词、词性标注、专名识别三大功能；能够识别出文本串中的基本词汇（分词），对这些词汇进行重组、标注组合后词汇的词性，并进一步识别出命名实体。<pre><code class="lang-python">text = &quot;百度是一家高科技公司&quot;

&quot;&quot;&quot; 调用词法分析 &quot;&quot;&quot;
client.lexer(text)</code></pre><pre><code>{&#039;log_id&#039;: 3174179683102561622,
 &#039;text&#039;: &#039;百度是一家高科技公司&#039;,
 &#039;items&#039;: [{&#039;loc_details&#039;: [],
 &#039;byte_offset&#039;: 0,
 &#039;uri&#039;: &#039;&#039;,
 &#039;pos&#039;: &#039;&#039;,
 &#039;ne&#039;: &#039;ORG&#039;,
 &#039;item&#039;: &#039;百度&#039;,
 &#039;basic_words&#039;: [&#039;百度&#039;],
 &#039;byte_length&#039;: 4,
 &#039;formal&#039;: &#039;&#039;},
 {&#039;loc_details&#039;: [],
 &#039;byte_offset&#039;: 4,
 &#039;uri&#039;: &#039;&#039;,
 &#039;pos&#039;: &#039;v&#039;,
 &#039;ne&#039;: &#039;&#039;,
 &#039;item&#039;: &#039;是&#039;,
 &#039;basic_words&#039;: [&#039;是&#039;],
 &#039;byte_length&#039;: 2,
 &#039;formal&#039;: &#039;&#039;},
 {&#039;loc_details&#039;: [],
 &#039;byte_offset&#039;: 6,
 &#039;uri&#039;: &#039;&#039;,
 &#039;pos&#039;: &#039;m&#039;,
 &#039;ne&#039;: &#039;&#039;,
 &#039;item&#039;: &#039;一家&#039;,
 &#039;basic_words&#039;: [&#039;一&#039;, &#039;家&#039;],
 &#039;byte_length&#039;: 4,
 &#039;formal&#039;: &#039;&#039;},
 {&#039;loc_details&#039;: [],
 &#039;byte_offset&#039;: 10,
 &#039;uri&#039;: &#039;&#039;,
 &#039;pos&#039;: &#039;n&#039;,
 &#039;ne&#039;: &#039;&#039;,
 &#039;item&#039;: &#039;高科技&#039;,
 &#039;basic_words&#039;: [&#039;高&#039;, &#039;科技&#039;],
 &#039;byte_length&#039;: 6,
 &#039;formal&#039;: &#039;&#039;},
 {&#039;loc_details&#039;: [],
 &#039;byte_offset&#039;: 16,
 &#039;uri&#039;: &#039;&#039;,
 &#039;pos&#039;: &#039;n&#039;,
 &#039;ne&#039;: &#039;&#039;,
 &#039;item&#039;: &#039;公司&#039;,
 &#039;basic_words&#039;: [&#039;公司&#039;],
 &#039;byte_length&#039;: 4,
 &#039;formal&#039;: &#039;&#039;}]}

</code></pre>词法分析（定制版）<pre><code class="lang-python">text = &quot;百度是一家高科技公司&quot;

&quot;&quot;&quot; 调用词法分析（定制版）&quot;&quot;&quot;
client.lexerCustom(text)</code></pre><pre><code>{&#039;log_id&#039;: 1030687273146384758,
 &#039;items&#039;: [{&#039;loc_details&#039;: [],
 &#039;byte_offset&#039;: 0,
 &#039;uri&#039;: &#039;&#039;,
 &#039;ne&#039;: &#039;ORG&#039;,
 &#039;basic_words&#039;: [&#039;百度&#039;],
 &#039;item&#039;: &#039;百度&#039;,
 &#039;pos&#039;: &#039;&#039;,
 &#039;byte_length&#039;: 4,
 &#039;formal&#039;: &#039;&#039;},
 {&#039;loc_details&#039;: [],
 &#039;byte_offset&#039;: 4,
 &#039;uri&#039;: &#039;&#039;,
 &#039;ne&#039;: &#039;&#039;,
 &#039;basic_words&#039;: [&#039;是&#039;],
 &#039;item&#039;: &#039;是&#039;,
 &#039;pos&#039;: &#039;v&#039;,
 &#039;byte_length&#039;: 2,
 &#039;formal&#039;: &#039;&#039;},
 {&#039;loc_details&#039;: [],
 &#039;byte_offset&#039;: 6,
 &#039;uri&#039;: &#039;&#039;,
 &#039;ne&#039;: &#039;&#039;,
 &#039;basic_words&#039;: [&#039;一&#039;, &#039;家&#039;],
 &#039;item&#039;: &#039;一家&#039;,
 &#039;pos&#039;: &#039;m&#039;,
 &#039;byte_length&#039;: 4,
 &#039;formal&#039;: &#039;&#039;},
 {&#039;loc_details&#039;: [],
 &#039;byte_offset&#039;: 10,
 &#039;uri&#039;: &#039;&#039;,
 &#039;ne&#039;: &#039;&#039;,
 &#039;basic_words&#039;: [&#039;高&#039;, &#039;科技&#039;],
 &#039;item&#039;: &#039;高科技&#039;,
 &#039;pos&#039;: &#039;n&#039;,
 &#039;byte_length&#039;: 6,
 &#039;formal&#039;: &#039;&#039;},
 {&#039;loc_details&#039;: [],
 &#039;byte_offset&#039;: 16,
 &#039;uri&#039;: &#039;&#039;,
 &#039;ne&#039;: &#039;&#039;,
 &#039;basic_words&#039;: [&#039;公司&#039;],
 &#039;item&#039;: &#039;公司&#039;,
 &#039;pos&#039;: &#039;n&#039;,
 &#039;byte_length&#039;: 4,
 &#039;formal&#039;: &#039;&#039;}],
 &#039;text&#039;: &#039;百度是一家高科技公司&#039;}

</code></pre>依存句法分析依存句法分析接口可自动分析文本中的依存句法结构信息，哦拥句子中词与词之间的依存关系来表示词语的句法结构信息（如“主谓”、“动宾”、“定中”等结构关系），并用树状结构来表示整句的结构（如“主谓宾”、“定状补”等）。<pre><code class="lang-python">text = &quot;今天天气怎么样&quot;

&quot;&quot;&quot; 调用依存句法分析 &quot;&quot;&quot;
client.depParser(text)

&quot;&quot;&quot; 如果有可选参数 &quot;&quot;&quot;
options = {}
options[&quot;mode&quot;] = 1

&quot;&quot;&quot; 带参数调用依存句法分析 &quot;&quot;&quot;
client.depParser(text, options)</code></pre><pre><code>{&#039;log_id&#039;: 6738947376011839670,
 &#039;text&#039;: &#039;今天天气怎么样&#039;,
 &#039;items&#039;: [{&#039;postag&#039;: &#039;t&#039;, &#039;head&#039;: 2, &#039;word&#039;: &#039;今天&#039;, &#039;id&#039;: 1, &#039;deprel&#039;: &#039;ATT&#039;},
 {&#039;postag&#039;: &#039;n&#039;, &#039;head&#039;: 3, &#039;word&#039;: &#039;天气&#039;, &#039;id&#039;: 2, &#039;deprel&#039;: &#039;SBV&#039;},
 {&#039;postag&#039;: &#039;r&#039;, &#039;head&#039;: 0, &#039;word&#039;: &#039;怎么样&#039;, &#039;id&#039;: 3, &#039;deprel&#039;: &#039;HED&#039;}]}

</code></pre>词向量表示词向量表示接口提供中文词向量的查询功能。<pre><code class="lang-python">word = &quot;张飞&quot;

&quot;&quot;&quot; 调用词向量表示 &quot;&quot;&quot;
client.wordEmbedding(word)</code></pre><pre><code>{&#039;log_id&#039;: 1696656248514338902,
 &#039;word&#039;: &#039;张飞&#039;,
 &#039;vec&#039;: [-0.290384,
 -0.276273,
 0.302719,
 0.7209,
 -0.0765072,
 0.31901,
 0.270633,
 0.795086,
 -0.203823,
 -0.125412,
 0.45416,
 -0.172919,
 0.295541,
 -0.216173,
 ...]}

</code></pre>DNN 语言模型中文 DNN 语言模型接口用于输出切词结果并给出每个词在句子中的概率值，判断一句话是否符合语言表达习惯。<pre><code class="lang-python">text = &quot;床前明月光&quot;

&quot;&quot;&quot; 调用 DNN 语言模型 &quot;&quot;&quot;
client.dnnlm(text)</code></pre><pre><code>{&#039;log_id&#039;: 8461893498410162902,
 &#039;text&#039;: &#039;床前明月光&#039;,
 &#039;items&#039;: [{&#039;word&#039;: &#039;床&#039;, &#039;prob&#039;: 3.85273e-05},
 {&#039;word&#039;: &#039;前&#039;, &#039;prob&#039;: 0.0289018},
 {&#039;word&#039;: &#039;明月&#039;, &#039;prob&#039;: 0.0284406},
 {&#039;word&#039;: &#039;光&#039;, &#039;prob&#039;: 0.808029}],
 &#039;ppl&#039;: 79.0651}

</code></pre>词意相似度输入两个词，得到两个词的相似度结果。<pre><code class="lang-python">word1 = &quot;北京&quot;
word2 = &quot;上海&quot;

&quot;&quot;&quot; 调用词义相似度 &quot;&quot;&quot;
client.wordSimEmbedding(word1, word2)

&quot;&quot;&quot; 如果有可选参数 &quot;&quot;&quot;
options = {}
options[&quot;mode&quot;] = 0

&quot;&quot;&quot; 带参数调用词义相似度 &quot;&quot;&quot;
client.wordSimEmbedding(word1, word2, options)</code></pre><pre><code>{&#039;log_id&#039;: 1841062063069490934,
 &#039;error_code&#039;: 282004,
 &#039;error_msg&#039;: &#039;invalid parameter(s)&#039;}

</code></pre>短文本相似度<pre><code class="lang-python">text1 = &quot;浙富股份&quot;
text2 = &quot;万事通自考网&quot;

&quot;&quot;&quot; 调用短文本相似度 &quot;&quot;&quot;
client.simnet(text1, text2)

&quot;&quot;&quot; 如果有可选参数 &quot;&quot;&quot;
options = {}
options[&quot;model&quot;] = &quot;CNN&quot;

&quot;&quot;&quot; 带参数调用短文本相似度 &quot;&quot;&quot;
client.simnet(text1, text2, options)</code></pre><pre><code>{&#039;log_id&#039;: 8759613961966585046,
 &#039;texts&#039;: {&#039;text_2&#039;: &#039;万事通自考网&#039;, &#039;text_1&#039;: &#039;浙富股份&#039;},
 &#039;score&#039;: 0.0549339}

</code></pre>评论观点抽取评论观点抽取接口用来提取一条评论句子的关注点和评论观点，并输出评论观点标签以及评论观点极性。<pre><code class="lang-python">text = &quot;三星电脑电池不给力&quot;

&quot;&quot;&quot; 调用评论观点抽取 &quot;&quot;&quot;
client.commentTag(text)

&quot;&quot;&quot; 如果有可选参数 &quot;&quot;&quot;
options = {}
options[&quot;type&quot;] = 13

&quot;&quot;&quot; 带参数调用评论观点抽取 &quot;&quot;&quot;
client.commentTag(text, options)</code></pre><pre><code>{&#039;log_id&#039;: 8426923826378164630,
 &#039;items&#039;: [{&#039;sentiment&#039;: 0,
 &#039;abstract&#039;: &#039;三星电脑&lt;span&gt;电池不给力&lt;/span&gt;&#039;,
 &#039;prop&#039;: &#039;电池&#039;,
 &#039;begin_pos&#039;: 8,
 &#039;end_pos&#039;: 18,
 &#039;adj&#039;: &#039;不给力&#039;}]}

</code></pre>情感倾向分析对包含主观观点信息的文本进行情感极性类别（积极、消极、中性）的判断，并给出相应的置信度。<pre><code class="lang-python">text = &quot;苹果是一家伟大公司&quot;

&quot;&quot;&quot; 调用情感倾向分析 &quot;&quot;&quot;
client.sentimentClassify(text)</code></pre><pre><code>{&#039;log_id&#039;: 7415487462125078582,
 &#039;text&#039;: &#039;苹果是一家伟大公司&#039;,
 &#039;items&#039;: [{&#039;positive_prob&#039;: 0.691839,
 &#039;confidence&#039;: 0.315198,
 &#039;negative_prob&#039;: 0.308161,
 &#039;sentiment&#039;: 2}]}

</code></pre>文章标签文章标签服务能够针对网络各类媒体文章进行快速的内容理解，根据输入含有标题的文章，输出多个内容标签以及对应的置信度，用于个性化推荐、相似文章聚合、文本内容分析等场景。<pre><code class="lang-python">title = &quot;iphone手机出现“白苹果”原因及解决办法，用苹果手机的可以看下&quot;
content = &quot;如果下面的方法还是没有解决你的问题建议来我们门店看下成都市锦江区红星路三段99号银石广场24层01室。&quot;

&quot;&quot;&quot; 调用文章标签 &quot;&quot;&quot;
client.keyword(title, content)</code></pre><pre><code>{&#039;log_id&#039;: 4313909132996888022,
 &#039;items&#039;: [{&#039;score&#039;: 0.99775, &#039;tag&#039;: &#039;iphone&#039;},
 {&#039;score&#039;: 0.862602, &#039;tag&#039;: &#039;手机&#039;},
 {&#039;score&#039;: 0.845657, &#039;tag&#039;: &#039;苹果&#039;},
 {&#039;score&#039;: 0.837886, &#039;tag&#039;: &#039;苹果公司&#039;},
 {&#039;score&#039;: 0.811601, &#039;tag&#039;: &#039;白苹果&#039;},
 {&#039;score&#039;: 0.797911, &#039;tag&#039;: &#039;数码&#039;}]}

</code></pre>文章分类对文章按照内容类型进行自动分类，首批支持娱乐、体育、科技等26个主流内容类型，文本内容分析等应用提供基础技术支持。<pre><code class="lang-python">title = &quot;欧洲冠军杯足球赛&quot;

content = &quot;欧洲冠军联赛是欧洲足球协会联盟主办的年度足球比赛，代表欧洲俱乐部足球最高荣誉和水平，被认为是全世界最高素质、最具影响力以及最高水平的俱乐部赛事，亦是世界上奖金最高的足球赛事和体育赛事之一。&quot;

&quot;&quot;&quot; 调用文章分类 &quot;&quot;&quot;
client.topic(title, content)</code></pre><pre><code>{&#039;log_id&#039;: 2207187729196380118,
 &#039;item&#039;: {&#039;lv2_tag_list&#039;: [{&#039;score&#039;: 0.915631, &#039;tag&#039;: &#039;足球&#039;},
 {&#039;score&#039;: 0.803507, &#039;tag&#039;: &#039;国际足球&#039;},
 {&#039;score&#039;: 0.77813, &#039;tag&#039;: &#039;英超&#039;}],
 &#039;lv1_tag_list&#039;: [{&#039;score&#039;: 0.830915, &#039;tag&#039;: &#039;体育&#039;}]}}

</code></pre>文本纠错识别输入文本中有错误的片段，提示错误并给出正确的文本结果。支持短文本、长文本、语音等内容的错误识别，纠错是搜索引擎、语音识别、内容审查等功能更好运行的基础模块之一。<pre><code class="lang-python">text = &quot;百度是一家仁工智能公司&quot;

&quot;&quot;&quot; 调用文本纠错 &quot;&quot;&quot;
client.ecnet(text)</code></pre><pre><code>{&#039;log_id&#039;: 4819268271360271574,
 &#039;item&#039;: {&#039;vec_fragment&#039;: [{&#039;ori_frag&#039;: &#039;仁工&#039;,
 &#039;begin_pos&#039;: 10,
 &#039;correct_frag&#039;: &#039;人工&#039;,
 &#039;end_pos&#039;: 14}],
 &#039;score&#039;: 0.529867,
 &#039;correct_query&#039;: &#039;百度是一家人工智能公司&#039;},
 &#039;text&#039;: &#039;百度是一家仁工智能公司&#039;}

</code></pre>对话情绪识别接口针对用户日常沟通文本背后所蕴含情绪的一种直观检测，可自动识别出当前会话者所表现出的情绪类别及其置信度，可以帮助企业更全面地把握产品服务质量、监控客户服务质量。<pre><code class="lang-python">text = &quot;本来今天高高兴兴&quot;

&quot;&quot;&quot; 调用对话情绪识别接口 &quot;&quot;&quot;
client.emotion(text)
&quot;&quot;&quot; 如果有可选参数 &quot;&quot;&quot;
options = {}
options[&quot;scene&quot;] = &quot;talk&quot;

&quot;&quot;&quot; 带参数调用对话情绪识别接口 &quot;&quot;&quot;
client.emotion(text, options)</code></pre><pre><code>{&#039;log_id&#039;: 901856600521512694,
 &#039;text&#039;: &#039;本来今天高高兴兴&#039;,
 &#039;items&#039;: [{&#039;subitems&#039;: [{&#039;prob&#039;: 0.501008, &#039;label&#039;: &#039;happy&#039;}],
 &#039;replies&#039;: [&#039;你的笑声真欢乐&#039;],
 &#039;prob&#039;: 0.501008,
 &#039;label&#039;: &#039;optimistic&#039;},
 {&#039;subitems&#039;: [], &#039;replies&#039;: [], &#039;prob&#039;: 0.49872, &#039;label&#039;: &#039;neutral&#039;},
 {&#039;subitems&#039;: [],
 &#039;replies&#039;: [],
 &#039;prob&#039;: 0.000272128,
 &#039;label&#039;: &#039;pessimistic&#039;}]}

</code></pre>新闻摘要接口自动抽取新闻文本中的关键信息，进而生成指定长度的新闻摘要。<pre><code class="lang-python">content = &quot;麻省理工学院的研究团队为无人机在仓库中使用RFID技术进行库存查找等工作，创造了一种...&quot;

maxSummaryLen = 300

&quot;&quot;&quot; 调用新闻摘要接口 &quot;&quot;&quot;
client.newsSummary(content, maxSummaryLen);

&quot;&quot;&quot; 如果有可选参数 &quot;&quot;&quot;
options = {}
options[&quot;title&quot;] = &quot;标题&quot;

&quot;&quot;&quot; 带参数调用新闻摘要接口 &quot;&quot;&quot;
client.newsSummary(content, maxSummaryLen, options)</code></pre><pre><code>{&#039;error_code&#039;: 6, &#039;error_msg&#039;: &#039;No permission to access data&#039;}
</code></pre>

NLP -- BaiDu

Leave a Comment Cancel reply
使用cookie技术保留您的个人信息以便您下次快速评论，继续评论表示您已同意该条款

神经网络是什么？如何直观理解它的能力极限？它是如何无限逼近真理？

01 | 技术架构：深度学习推荐系统的经典技术架构长啥样？

CodeForces -- Domino piling

概率图模型理论与应用

光学基础知识：焦点、弥散圆、景深、焦深

CUDA编程入门极简教程

JVM 内存结构 VS Java内存模型 VS Java对象模型

MCMC 蒙特卡罗方法 (一)

Python中jieba中文分词库的使用

C++ STL priority_queue容器适配器详解

NLP -- BaiDu

Leave a Comment Cancel reply 使用cookie技术保留您的个人信息以便您下次快速评论，继续评论表示您已同意该条款

NLP -- BaiDu

Leave a Comment Cancel reply
使用cookie技术保留您的个人信息以便您下次快速评论，继续评论表示您已同意该条款