首页 > 编程 > .NET > 正文

去除HTML标签删除HTML示例代码

2024-07-10 13:27:52
字体:
来源:转载
供稿:网友
这篇文章主要介绍了如何去除HTML标签、删除HTML。示例中使用到了一个正则,直接使用就可以了
 
 
复制代码代码如下:

/// <summary> 
/// 去除HTML标签 
/// </summary> 
/// <param name="Htmlstring"></param> 
/// <returns></returns> 
public static string DeleteHTML(string Htmlstring) 

//删除HTML 
Htmlstring = Regex.Replace(Htmlstring, @"<(.[^>]*)>", "", RegexOptions.IgnoreCase); 
Htmlstring = Regex.Replace(Htmlstring, @"([/r/n])[/s]+", "", RegexOptions.IgnoreCase); 
Htmlstring = Regex.Replace(Htmlstring, @"-->", "", RegexOptions.IgnoreCase); 
Htmlstring = Regex.Replace(Htmlstring, @"<!--.*", "", RegexOptions.IgnoreCase); 
Htmlstring = Regex.Replace(Htmlstring, @"&(quot|#34);", "/"", RegexOptions.IgnoreCase); 
Htmlstring = Regex.Replace(Htmlstring, @"&(amp|#38);", "&", RegexOptions.IgnoreCase); 
Htmlstring = Regex.Replace(Htmlstring, @"&(lt|#60);", "<", RegexOptions.IgnoreCase); 
Htmlstring = Regex.Replace(Htmlstring, @"&(gt|#62);", ">", RegexOptions.IgnoreCase); 
Htmlstring = Regex.Replace(Htmlstring, @"&(nbsp|#160);", "", RegexOptions.IgnoreCase); 
Htmlstring = Regex.Replace(Htmlstring, @"&(iexcl|#161);", "/xa1", RegexOptions.IgnoreCase); 
Htmlstring = Regex.Replace(Htmlstring, @"&(cent|#162);", "/xa2", RegexOptions.IgnoreCase); 
Htmlstring = Regex.Replace(Htmlstring, @"&(pound|#163);", "/xa3", RegexOptions.IgnoreCase); 
Htmlstring = Regex.Replace(Htmlstring, @"&(copy|#169);", "/xa9", RegexOptions.IgnoreCase); 
Htmlstring = Regex.Replace(Htmlstring, @"&#(/d+);", "", RegexOptions.IgnoreCase); 
Htmlstring = Htmlstring.Replace("<", ""); 
Htmlstring = Htmlstring.Replace(">", ""); 
Htmlstring = Htmlstring.Replace("/r/n", ""); 
return Htmlstring; 

发表评论 共有条评论
用户名: 密码:
验证码: 匿名发表