asp.NET开发中正则表达式中BUG分析

2024-07-10 13:21:22

字体：大中小

来源：转载

供稿：网友

比如以下的代码就是用来测试用正则表达式匹配从 0xff 到 0xffff 的字符。而值范围在 0 到 0xfe 的所有字符是不能被匹配的。　
以下为引用的内容：

复制代码代码如下:

Regex regex = new Regex(@"[/u00FF-/uFFFF]+");
　　// The characters, whoes value are smaller than 0xff,
　　// are not expected to be matched.
　　for (int i = 0; i <0xff; i++) {
　　string s = new string(new char[] { (char)i });
　　Debug.Assert(!regex.IsMatch(s), string.Format(
　　"The character was not expected to be matched: 0x{0:X}!", i));
　　}
　　// However, the characters whoes value
　　// are greater than 0xfe are expected to be matched.
　　for (int i = 0xff; i <= 0xffff; i++) {
　　string s = new string(new char[] { (char)i });
　　Debug.Assert(regex.IsMatch(s), string.Format(
　　"The character was expected to be matched: 0x{0:X}!", i));
　　}

这时的运行结果是正常的，没有任何的断言错误出现。
然而当使用忽略大小写的匹配模式时，结果就不一样了。将上面代码中的第一行改成：
1Regex regex = new Regex(@"[/u00FF-/uFFFF]+", RegexOptions.IgnoreCase);
程序运行的时候就会有两处断言错误。它们分别是字符值为 73 和 105，也就是小写字母 i 和大写字母 I。这个 bug 非常奇怪，别的字符都很正常!而且用 javascript脚本在 IE (版本是6.0)里面运行也同样有这么 bug 存在(比如下面这段代码)。然而在 Firefox中运行就是没有问题的。还是 Firefox 好啊，呵呵!
以下为引用的内容：

复制代码代码如下: