PHP中使用CURL获取页面title例子

前端技术 2023/09/08 PHP

通过PHP获取页面title内容的实战演示：

范例代码：

<?php
/*
功能：取得 URL 页面上的 <title> 内容

参数：$_POST[\'url\']
*/

// 设置最长执行的秒数
ini_set (\"expect.timeout\", 30);
set_time_limit(30);

// 检查 URL
if(!isset($_POST[\'url\']) || $_POST[\'url\'] == \'\'){
   echo \"URL 错误\";
   exit;
}


/* 取得 URL 页面数据 */
// 初始化 CURL
$ch = curl_init();

// 设置 URL
curl_setopt($ch, CURLOPT_URL, $_POST[\'url\']);
// 让 curl_exec() 获取的信息以数据流的形式返回，而不是直接输出。
curl_setopt ($ch, CURLOPT_RETURNTRANSFER, 1);
// 在发起连接前等待的时间，如果设置为0，则不等待
curl_setopt ($ch, CURLOPT_CONNECTTIMEOUT, 0);
// 设置 CURL 最长执行的秒数
curl_setopt ($ch, CURLOPT_TIMEOUT, 30);

// 尝试取得文件内容
$store = curl_exec ($ch);


// 检查文件是否正确取得
if (curl_errno($ch)){
   echo \"无法取得 URL 数据\";
   //echo curl_error($ch);/*显示错误信息*/
   exit;
}

// 关闭 CURL
curl_close($ch);


// 解析 HTML 的 <head> 区段
preg_match(\"/<head.*>(.*)<\\/head>/smUi\",$store, $htmlHeaders);
if(!count($htmlHeaders)){
   echo \"无法解析数据中的 <head> 区段\";
   exit;
}

// 取得 <head> 中 meta 设置的编码格式
if(preg_match(\"/<meta[^>]*http-equiv[^>]*charset=(.*)(\\\"|\')/Ui\",$htmlHeaders[1], $results)){
   $charset = $results[1];
}else{
   $charset = \"None\";
}

// 取得 <title> 中的文字
if(preg_match(\"/<title>(.*)<\\/title>/Ui\",$htmlHeaders[1], $htmlTitles)){
   if(!count($htmlTitles)){
       echo \"无法解析 <title> 的内容\";
       exit;
   }

   // 将 <title> 的文字编码格式转成 UTF-8
   if($charset == \"None\"){
       $title=$htmlTitles[1];
   }else{
       $title=iconv($charset, \"UTF-8\", $htmlTitles[1]);
   }
   echo $title;
}

本文地址：https://www.stayed.cn/item/22609

转载请注明出处。

本站部分内容来源于网络,如侵犯到您的权益,请联系我

微信
QQ好友
QQ空间
腾讯微博
新浪微博
人人网

我的博客

人生若只如初见，何事秋风悲画扇。

我的标签

随笔档案

2024-02(2)
2023-06(1)
2023-05(1)
2023-04(14)
2023-03(3)
2023-01(6)
2022-12(5)
2022-11(5)
2022-07(2)
2022-06(4)
2022-05(3)
2022-03(1)
2021-12(6)
2021-11(1)
2021-10(3)
2021-09(5)
2021-07(5)
2021-02(2)
2021-01(7)
2020-12(18)
2020-11(14)
2020-10(12)
2020-09(10)
2020-08(22)
2020-07(2)
2020-06(1)
2020-04(5)
2020-03(9)
2020-02(7)
2020-01(9)
2019-12(8)
2019-11(10)
2019-10(11)
2019-09(17)
2019-08(16)
2019-07(6)
2019-06(3)
2019-04(1)
2019-03(8)
2019-02(5)
2019-01(1)
2018-11(2)
2018-10(3)
2018-09(1)
2018-08(3)
2018-07(3)
2018-06(7)
2018-04(4)
2018-03(5)
2018-02(4)
2018-01(22)
2017-12(3)
2017-11(5)
2017-10(15)
2017-09(26)
2017-08(1)
2017-07(3)