首页 \ 问答 \ 使用Wikipedia API和Python 2.7从列表中提取特定用户注释(Extract specific users comments from a list using Wikipedia API and Python 2.7)

使用Wikipedia API和Python 2.7从列表中提取特定用户注释(Extract specific users comments from a list using Wikipedia API and Python 2.7)

我正在使用维基百科API - wikitools包从维基百科中提取一些数据。 我得到下面显示的格式的输出,现在我想提取几个页面的特定用户的修订的时间戳和注释。 假设我只想要TechBot发表的评论,然后我想我可以做类似的事情:

for revision in res["query"]["pages"]["7940378"]["revisions"]:
    if revision["user"] = "Techbot":
        do.something()

但问题是[“7940378”],因为这是一个唯一的页面ID,并将为每个页面更改,我不知道如何获取pageid。 还有另一种方法吗?

[{
"query": {
  "pages": {
    "7940378": {
      "ns": 0, 
      "pageid": 7940378, 
      "revisions": [
        {
          "comment": "robot  Modifying: [[az:T\u00fcrk Tarixi]]", 
          "timestamp": "2009-01-03T19:47:11Z", 
          "user": "TechBot"
        }, 
        {
          "comment": "", 
          "timestamp": "2009-02-14T02:07:49Z", 
          "anon": "", 
          "user": "88.231.237.130"
        }, 
        {
          "comment": "fixing recent deletion by merging it with the next paragraph", 
          "timestamp": "2009-04-03T14:49:27Z", 
          "user": "Soap"
        }, 
        {
          "comment": "robot  Modifying: [[az:T\u00fcrk tarixi]]", 
          "timestamp": "2009-04-09T14:35:19Z", 
          "user": "RibotBOT"
        }, 
        {
          "comment": "Repairing link to disambiguation page - [[Wikipedia:Disambiguation pages with links|You can help!]]", 
          "timestamp": "2009-06-12T23:55:55Z", 
          "user": "J04n"
        }
      ], 
      "title": "History of the Turkic peoples"
    }
  }
}, 
"continue": {
  "rvcontinue": "20090807172715|306635892", 
  "continue": "||"
}, 
"warnings": {
  "main": {
    "*": "Unrecognized parameter: 'user'"
  }
}
}]

I am using the wikipedia API - wikitools package to extract some data from Wikipedia. I get the output of the format shown below and now I want to extract the timestamp and the comment for revisions made of specific user for several pages. Let's say I just want the comments made by TechBot, then I figured that I can do something like:

for revision in res["query"]["pages"]["7940378"]["revisions"]:
    if revision["user"] = "Techbot":
        do.something()

But the problem is ["7940378"] because this is a unique page id and will change for every page and I dont know how to get the pageid. Is there another way of doing this?

[{
"query": {
  "pages": {
    "7940378": {
      "ns": 0, 
      "pageid": 7940378, 
      "revisions": [
        {
          "comment": "robot  Modifying: [[az:T\u00fcrk Tarixi]]", 
          "timestamp": "2009-01-03T19:47:11Z", 
          "user": "TechBot"
        }, 
        {
          "comment": "", 
          "timestamp": "2009-02-14T02:07:49Z", 
          "anon": "", 
          "user": "88.231.237.130"
        }, 
        {
          "comment": "fixing recent deletion by merging it with the next paragraph", 
          "timestamp": "2009-04-03T14:49:27Z", 
          "user": "Soap"
        }, 
        {
          "comment": "robot  Modifying: [[az:T\u00fcrk tarixi]]", 
          "timestamp": "2009-04-09T14:35:19Z", 
          "user": "RibotBOT"
        }, 
        {
          "comment": "Repairing link to disambiguation page - [[Wikipedia:Disambiguation pages with links|You can help!]]", 
          "timestamp": "2009-06-12T23:55:55Z", 
          "user": "J04n"
        }
      ], 
      "title": "History of the Turkic peoples"
    }
  }
}, 
"continue": {
  "rvcontinue": "20090807172715|306635892", 
  "continue": "||"
}, 
"warnings": {
  "main": {
    "*": "Unrecognized parameter: 'user'"
  }
}
}]

原文:https://stackoverflow.com/questions/36422266
更新时间:2023-01-21 13:01

最满意答案

以下查询将帮助你:)

INSERT INTO SubHeadings (SubHeadingName, HeadingID) 
SELECT subhdng.SubHeadingName, hdng.HeadingID 
FROM Headings hdng
INNER JOIN SubHeadings subhdng ON hdng.HeadingID = subhdng.HeadingID 
WHERE hdng.TopicID = 2

Following query will help you :)

INSERT INTO SubHeadings (SubHeadingName, HeadingID) 
SELECT subhdng.SubHeadingName, hdng.HeadingID 
FROM Headings hdng
INNER JOIN SubHeadings subhdng ON hdng.HeadingID = subhdng.HeadingID 
WHERE hdng.TopicID = 2

相关问答

更多

相关文章

更多

最新问答

更多
  • 获取MVC 4使用的DisplayMode后缀(Get the DisplayMode Suffix being used by MVC 4)
  • 如何通过引用返回对象?(How is returning an object by reference possible?)
  • 矩阵如何存储在内存中?(How are matrices stored in memory?)
  • 每个请求的Java新会话?(Java New Session For Each Request?)
  • css:浮动div中重叠的标题h1(css: overlapping headlines h1 in floated divs)
  • 无论图像如何,Caffe预测同一类(Caffe predicts same class regardless of image)
  • xcode语法颜色编码解释?(xcode syntax color coding explained?)
  • 在Access 2010 Runtime中使用Office 2000校对工具(Use Office 2000 proofing tools in Access 2010 Runtime)
  • 从单独的Web主机将图像传输到服务器上(Getting images onto server from separate web host)
  • 从旧版本复制文件并保留它们(旧/新版本)(Copy a file from old revision and keep both of them (old / new revision))
  • 西安哪有PLC可控制编程的培训
  • 在Entity Framework中选择基类(Select base class in Entity Framework)
  • 在Android中出现错误“数据集和渲染器应该不为null,并且应该具有相同数量的系列”(Error “Dataset and renderer should be not null and should have the same number of series” in Android)
  • 电脑二级VF有什么用
  • Datamapper Ruby如何添加Hook方法(Datamapper Ruby How to add Hook Method)
  • 金华英语角.
  • 手机软件如何制作
  • 用于Android webview中图像保存的上下文菜单(Context Menu for Image Saving in an Android webview)
  • 注意:未定义的偏移量:PHP(Notice: Undefined offset: PHP)
  • 如何读R中的大数据集[复制](How to read large dataset in R [duplicate])
  • Unity 5 Heighmap与地形宽度/地形长度的分辨率关系?(Unity 5 Heighmap Resolution relationship to terrain width / terrain length?)
  • 如何通知PipedOutputStream线程写入最后一个字节的PipedInputStream线程?(How to notify PipedInputStream thread that PipedOutputStream thread has written last byte?)
  • python的访问器方法有哪些
  • DeviceNetworkInformation:哪个是哪个?(DeviceNetworkInformation: Which is which?)
  • 在Ruby中对组合进行排序(Sorting a combination in Ruby)
  • 网站开发的流程?
  • 使用Zend Framework 2中的JOIN sql检索数据(Retrieve data using JOIN sql in Zend Framework 2)
  • 条带格式类型格式模式编号无法正常工作(Stripes format type format pattern number not working properly)
  • 透明度错误IE11(Transparency bug IE11)
  • linux的基本操作命令。。。