去除HTML标签删除HTML示例代码

前端技术 2023/09/07 .NET
复制代码 代码如下:

/// <summary>
/// 去除HTML标签
/// </summary>
/// <param name=\"Htmlstring\"></param>
/// <returns></returns>
public static string DeleteHTML(string Htmlstring)
{
//删除HTML
Htmlstring = Regex.Replace(Htmlstring, @\"<(.[^>]*)>\", \"\", RegexOptions.IgnoreCase);
Htmlstring = Regex.Replace(Htmlstring, @\"([\\r\\n])[\\s]+\", \"\", RegexOptions.IgnoreCase);
Htmlstring = Regex.Replace(Htmlstring, @\"-->\", \"\", RegexOptions.IgnoreCase);
Htmlstring = Regex.Replace(Htmlstring, @\"<!--.*\", \"\", RegexOptions.IgnoreCase);
Htmlstring = Regex.Replace(Htmlstring, @\"&(quot|#34);\", \"\\\"\", RegexOptions.IgnoreCase);
Htmlstring = Regex.Replace(Htmlstring, @\"&(amp|#38);\", \"&\", RegexOptions.IgnoreCase);
Htmlstring = Regex.Replace(Htmlstring, @\"&(lt|#60);\", \"<\", RegexOptions.IgnoreCase);
Htmlstring = Regex.Replace(Htmlstring, @\"&(gt|#62);\", \">\", RegexOptions.IgnoreCase);
Htmlstring = Regex.Replace(Htmlstring, @\"&(nbsp|#160);\", \"\", RegexOptions.IgnoreCase);
Htmlstring = Regex.Replace(Htmlstring, @\"&(iexcl|#161);\", \"\\xa1\", RegexOptions.IgnoreCase);
Htmlstring = Regex.Replace(Htmlstring, @\"&(cent|#162);\", \"\\xa2\", RegexOptions.IgnoreCase);
Htmlstring = Regex.Replace(Htmlstring, @\"&(pound|#163);\", \"\\xa3\", RegexOptions.IgnoreCase);
Htmlstring = Regex.Replace(Htmlstring, @\"&(copy|#169);\", \"\\xa9\", RegexOptions.IgnoreCase);
Htmlstring = Regex.Replace(Htmlstring, @\"&#(\\d+);\", \"\", RegexOptions.IgnoreCase);
Htmlstring = Htmlstring.Replace(\"<\", \"\");
Htmlstring = Htmlstring.Replace(\">\", \"\");
Htmlstring = Htmlstring.Replace(\"\\r\\n\", \"\");
return Htmlstring;
}

本文地址:https://www.stayed.cn/item/20334

转载请注明出处。

本站部分内容来源于网络,如侵犯到您的权益,请 联系我

我的博客

人生若只如初见,何事秋风悲画扇。