首页 \ 问答 \ 使用Wikipedia API和Python 2.7从列表中提取特定用户注释(Extract specific users comments from a list using Wikipedia API and Python 2.7)

使用Wikipedia API和Python 2.7从列表中提取特定用户注释(Extract specific users comments from a list using Wikipedia API and Python 2.7)

mysql

 我正在使用维基百科API - wikitools包从维基百科中提取一些数据。 我得到下面显示的格式的输出，现在我想提取几个页面的特定用户的修订的时间戳和注释。 假设我只想要TechBot发表的评论，然后我想我可以做类似的事情：  
for revision in res["query"]["pages"]["7940378"]["revisions"]:
    if revision["user"] = "Techbot":
        do.something()
 
 但问题是[“7940378”]，因为这是一个唯一的页面ID，并将为每个页面更改，我不知道如何获取pageid。 还有另一种方法吗？  
[{
"query": {
  "pages": {
    "7940378": {
      "ns": 0, 
      "pageid": 7940378, 
      "revisions": [
        {
          "comment": "robot  Modifying: [[az:T\u00fcrk Tarixi]]", 
          "timestamp": "2009-01-03T19:47:11Z", 
          "user": "TechBot"
        }, 
        {
          "comment": "", 
          "timestamp": "2009-02-14T02:07:49Z", 
          "anon": "", 
          "user": "88.231.237.130"
        }, 
        {
          "comment": "fixing recent deletion by merging it with the next paragraph", 
          "timestamp": "2009-04-03T14:49:27Z", 
          "user": "Soap"
        }, 
        {
          "comment": "robot  Modifying: [[az:T\u00fcrk tarixi]]", 
          "timestamp": "2009-04-09T14:35:19Z", 
          "user": "RibotBOT"
        }, 
        {
          "comment": "Repairing link to disambiguation page - [[Wikipedia:Disambiguation pages with links|You can help!]]", 
          "timestamp": "2009-06-12T23:55:55Z", 
          "user": "J04n"
        }
      ], 
      "title": "History of the Turkic peoples"
    }
  }
}, 
"continue": {
  "rvcontinue": "20090807172715|306635892", 
  "continue": "||"
}, 
"warnings": {
  "main": {
    "*": "Unrecognized parameter: 'user'"
  }
}
}]

I am using the wikipedia API - wikitools package to extract some data from Wikipedia. I get the output of the format shown below and now I want to extract the timestamp and the comment for revisions made of specific user for several pages. Let's say I just want the comments made by TechBot, then I figured that I can do something like: 
for revision in res["query"]["pages"]["7940378"]["revisions"]:
    if revision["user"] = "Techbot":
        do.something()
 
But the problem is ["7940378"] because this is a unique page id and will change for every page and I dont know how to get the pageid. Is there another way of doing this? 
[{
"query": {
  "pages": {
    "7940378": {
      "ns": 0, 
      "pageid": 7940378, 
      "revisions": [
        {
          "comment": "robot  Modifying: [[az:T\u00fcrk Tarixi]]", 
          "timestamp": "2009-01-03T19:47:11Z", 
          "user": "TechBot"
        }, 
        {
          "comment": "", 
          "timestamp": "2009-02-14T02:07:49Z", 
          "anon": "", 
          "user": "88.231.237.130"
        }, 
        {
          "comment": "fixing recent deletion by merging it with the next paragraph", 
          "timestamp": "2009-04-03T14:49:27Z", 
          "user": "Soap"
        }, 
        {
          "comment": "robot  Modifying: [[az:T\u00fcrk tarixi]]", 
          "timestamp": "2009-04-09T14:35:19Z", 
          "user": "RibotBOT"
        }, 
        {
          "comment": "Repairing link to disambiguation page - [[Wikipedia:Disambiguation pages with links|You can help!]]", 
          "timestamp": "2009-06-12T23:55:55Z", 
          "user": "J04n"
        }
      ], 
      "title": "History of the Turkic peoples"
    }
  }
}, 
"continue": {
  "rvcontinue": "20090807172715|306635892", 
  "continue": "||"
}, 
"warnings": {
  "main": {
    "*": "Unrecognized parameter: 'user'"
  }
}
}]

原文：https://stackoverflow.com/questions/36422266

更新时间：2023-01-21 13:01

最满意答案

 以下查询将帮助你:)  
INSERT INTO SubHeadings (SubHeadingName, HeadingID) 
SELECT subhdng.SubHeadingName, hdng.HeadingID 
FROM Headings hdng
INNER JOIN SubHeadings subhdng ON hdng.HeadingID = subhdng.HeadingID 
WHERE hdng.TopicID = 2

Following query will help you :) 
INSERT INTO SubHeadings (SubHeadingName, HeadingID) 
SELECT subhdng.SubHeadingName, hdng.HeadingID 
FROM Headings hdng
INNER JOIN SubHeadings subhdng ON hdng.HeadingID = subhdng.HeadingID 
WHERE hdng.TopicID = 2

使用Wikipedia API和Python 2.7从列表中提取特定用户注释(Extract specific users comments from a list using Wikipedia API and Python 2.7)

最满意答案

相关问答

暂时禁用更新PK和FK数据类型的限制(Temporarily deactivate constraints for updating datatype of PKs and FKs)[2022-03-21]

需要使用新PK重复更新FK(Need to update FK with new PK on duplicate)[2022-04-04]

NHibernate 2 FK与一对一的PK相同(NHibernate 2 FKs to same PK in one-to-one relationship)[2022-01-12]

Hibernate何时创建从FK到非PK列的新条目？(When does Hibernate create a new entry from a FK to a non-PK column?)[2023-11-25]

从表中返回PK，其中FK出现多次(Returning PK from table, where FK appears more than once)[2022-01-15]

使用FK删除行作为PK + Symfony2和Doctrine(Remove row with FK as PK + Symfony2 and Doctrine)[2021-05-30]

PK，FK约束设计(PK, FK Constraints design)[2022-03-30]

EclipseLink复合PK与FK(EclipseLink composite PK with FK)[2022-09-27]

实体框架代码第一：PK是FK的一对一(Entity Framework Code First: One-to-One where PK is a FK)[2022-10-30]

Symfony2 - Doctrine2 - MySql：复合FK和PK(Symfony2 - Doctrine2 - MySql: Composite FK and PK)[2022-03-03]

相关文章

最新问答