首页 \ 问答 \ 搜索html中的内容(Search the contents in the html)

搜索html中的内容(Search the contents in the html)

 HTML中有两个不同的标签，分别是row0-1和row1-1 。 我想从row1-1标记中搜索名为row1Time的HTML中的内容。  
 这是HTML示例：  
<li class="zc-ssl-pg" id="row0-1" style="">
    <span id="row1Time" class="zc-ssl-pg-time">4:00 PM</span>
    <li class="zc-ssl-pg" id="row1-1" style="">
    <span id="row1Time" class="zc-ssl-pg-time">3:00 PM</span>
 
 这是我的PHP：  
    <?php
    $errmsg_arr = array();
    $errflag = false;
    $link;

    function db_connect()
    {
      define('DB_HOST', 'localhost');
      define('DB_USER', 'myusername');
      define('DB_PASSWORD', 'mypassword');
      define('DB_DATABASE', 'mydbname');

      $errmsg_arr = array();
      $errflag = false;
      $link = mysql_connect(DB_HOST, DB_USER, DB_PASSWORD);

      if(!$link) 
      {
        die('Failed to connect to server: ' . mysql_error());
      }

      $db = mysql_select_db(DB_DATABASE);
      if(!$db) 
      {
        die("Unable to select database");
      }
    }

    $links = $row['links'];
    include ('simple_html_dom.php');
    $html = file_get_html($links);
    //echo $row['links'];

    $base = $row['links'];

    $curl = curl_init();
    curl_setopt($curl, CURLOPT_SSL_VERIFYPEER, FALSE);
    curl_setopt($curl, CURLOPT_HEADER, false);
    curl_setopt($curl, CURLOPT_FOLLOWLOCATION, true);
    curl_setopt($curl, CURLOPT_URL, $base);
    curl_setopt($curl, CURLOPT_REFERER, $base);
    curl_setopt($curl, CURLOPT_RETURNTRANSFER, TRUE);
    $str = curl_exec($curl);
    curl_close($curl);

    // Create a DOM object
    $html = new simple_html_dom();
    // Load HTML from a string
    $html->load($str);

    //get all category links
    /*foreach($html_base->find('a') as $element) {
        echo "<pre>";
        print_r( $element->href );
        echo "</pre>";
    }*/

    //$html_base->clear();
    //unset($html_base);

    $time1 = $html->find('span[id=row1Time]', 0)->plaintext;
    echo '<span id="time1">'.$time1.'</span> - ';
?>
 
 当我尝试使用以下方法解析HTML中的内容时：  
$time1 = $html->find('span[id=row1Time]', 0)->plaintext;
echo '<span id="time1">'.$time1.'</span> - ';
 
 我得到的返回输出来自标签row0-1 with row1Time ：  
4:00 PM
 
 我想从带有row1Time的标签row1-1搜索HTML中的内容，以获得包含3:00 PM的返回。 你能帮我用simple_html_dom帮助我获取这些内容吗？ 

There are two different tags in the HTML which are row0-1 and row1-1. I want to search for the contents in the HTML called row1Time from the row1-1 tag. 
Here's and example HTML: 
<li class="zc-ssl-pg" id="row0-1" style="">
    <span id="row1Time" class="zc-ssl-pg-time">4:00 PM</span>
    <li class="zc-ssl-pg" id="row1-1" style="">
    <span id="row1Time" class="zc-ssl-pg-time">3:00 PM</span>
 
Here's my PHP: 
    <?php
    $errmsg_arr = array();
    $errflag = false;
    $link;

    function db_connect()
    {
      define('DB_HOST', 'localhost');
      define('DB_USER', 'myusername');
      define('DB_PASSWORD', 'mypassword');
      define('DB_DATABASE', 'mydbname');

      $errmsg_arr = array();
      $errflag = false;
      $link = mysql_connect(DB_HOST, DB_USER, DB_PASSWORD);

      if(!$link) 
      {
        die('Failed to connect to server: ' . mysql_error());
      }

      $db = mysql_select_db(DB_DATABASE);
      if(!$db) 
      {
        die("Unable to select database");
      }
    }

    $links = $row['links'];
    include ('simple_html_dom.php');
    $html = file_get_html($links);
    //echo $row['links'];

    $base = $row['links'];

    $curl = curl_init();
    curl_setopt($curl, CURLOPT_SSL_VERIFYPEER, FALSE);
    curl_setopt($curl, CURLOPT_HEADER, false);
    curl_setopt($curl, CURLOPT_FOLLOWLOCATION, true);
    curl_setopt($curl, CURLOPT_URL, $base);
    curl_setopt($curl, CURLOPT_REFERER, $base);
    curl_setopt($curl, CURLOPT_RETURNTRANSFER, TRUE);
    $str = curl_exec($curl);
    curl_close($curl);

    // Create a DOM object
    $html = new simple_html_dom();
    // Load HTML from a string
    $html->load($str);

    //get all category links
    /*foreach($html_base->find('a') as $element) {
        echo "<pre>";
        print_r( $element->href );
        echo "</pre>";
    }*/

    //$html_base->clear();
    //unset($html_base);

    $time1 = $html->find('span[id=row1Time]', 0)->plaintext;
    echo '<span id="time1">'.$time1.'</span> - ';
?>
 
When I tried to parse the contents from the HTML using this: 
$time1 = $html->find('span[id=row1Time]', 0)->plaintext;
echo '<span id="time1">'.$time1.'</span> - ';
 
The return output I get is from the tags row0-1 with row1Time: 
4:00 PM
 
I want to search for the contents in the HTML from the tags row1-1 with row1Time to get the return contains 3:00 PM. Can you please help me get those contents using simple_html_dom?

原文：https://stackoverflow.com/questions/22848741

更新时间：2023-07-05 17:07

最满意答案

 我试图用SWI-Prolog重新格式化你的代码：  
keys(Struct):-
    member(Struct,child(_,4,_,_)),
    member(Struct,child(_,5,_,_)),
    member(Struct,child(_,6,_,_),
           member(Struct,child(_,7,_,_),
              member(Struct,child(_,8,_,_)),
              member(Struct,child(_,_,"Banana",_),
...
 
 似乎你错过了一些括号...在明显的修正后，我得到了  
?- solve.
Solve:[child(Dima,6,Icecream,Thunderstorm),child(Kate,8,Pasta,Spiders),child(Misha,5,Chocolate,Ghosts),child(Sveta,7,Pizza,Dogs),child(Ura,4,Banana,Darkness)]
true .

I tried to reformat your code with SWI-Prolog: 
keys(Struct):-
    member(Struct,child(_,4,_,_)),
    member(Struct,child(_,5,_,_)),
    member(Struct,child(_,6,_,_),
           member(Struct,child(_,7,_,_),
              member(Struct,child(_,8,_,_)),
              member(Struct,child(_,_,"Banana",_),
...
 
seems you're missing some parenthesis... after the obvious correction, I get 
?- solve.
Solve:[child(Dima,6,Icecream,Thunderstorm),child(Kate,8,Pasta,Spiders),child(Misha,5,Chocolate,Ghosts),child(Sveta,7,Pizza,Dogs),child(Ura,4,Banana,Darkness)]
true .

搜索html中的内容(Search the contents in the html)

最满意答案

相关问答

Java堆栈溢出错误 - 如何增加Eclipse中的堆栈大小？(Java stack overflow error - how to increase the stack size in Eclipse?)[2024-02-09]

如何增加Java堆栈大小？(How to increase the Java stack size?)[2023-08-11]

LuaJit增加堆栈/堆大小(LuaJit increase stack/heap size)[2021-09-20]

增加堆栈大小以使用alloca（）？(Increase stack size to use alloca()?)[2022-03-29]

增加堆栈大小(Increase stack size)[2022-05-10]

增加堆栈大小时，prolog会出现语法错误(prolog get syntax error when increase stack size)[2023-04-13]

有没有办法将默认堆栈大小增加到16777216字节以外？(Is there a way to increase the default stack size beyond 16777216 bytes?)[2023-09-20]

Tomcat7 Stack Size很小(Tomcat7 Stack Size is to small)[2022-08-12]

使用XCode增加堆栈大小(Increase stack size with XCode)[2022-08-02]

如何增加堆栈大小以允许更多递归？(How to increase stack size to allow more recursion?)[2022-08-09]

相关文章

最新问答