Python3正则匹配re.split，re.finditer及re.findall函数用法详解

2020-02-15 21:48:09

字体：大中小

来源：转载

供稿：网友

本文实例讲述了Python3正则匹配re.split，re.finditer及re.findall函数用法。分享给大家供大家参考，具体如下：

re.split re.finditer re.findall

@(python3)

官方 re 模块说明文档

re.compile() 函数

编译正则表达式模式，返回一个对象。可以把常用的正则表达式编译成正则表达式对象，方便后续调用及提高效率。

re 模块最离不开的就是 re.compile 函数。其他函数都依赖于 compile 创建的正则表达式对象

re.compile(pattern, flags=0)

pattern 指定编译时的表达式字符串 flags 编译标志位，用来修改正则表达式的匹配方式。支持 re.L|re.M 同时匹配

flags 标志位参数

re.I(re.IGNORECASE)
使匹配对大小写不敏感

re.L(re.LOCAL)
做本地化识别（locale-aware）匹配

re.M(re.MULTILINE)
多行匹配，影响 ^ 和 $

re.S(re.DOTALL)
使 . 匹配包括换行在内的所有字符

re.U(re.UNICODE)
根据Unicode字符集解析字符。这个标志影响 /w, /W, /b, /B.

re.X(re.VERBOSE)
该标志通过给予你更灵活的格式以便你将正则表达式写得更易于理解。

示例：

import recontent = 'Citizen wang , always fall in love with neighbour，WANG'rr = re.compile(r'wan/w', re.I) # 不区分大小写print(type(rr))a = rr.findall(content)print(type(a))print(a)

findall 返回的是一个 list 对象

<class '_sre.SRE_Pattern'>
<class 'list'>
['wang', 'WANG']

re.split 函数

按照指定的 pattern 格式，分割 string 字符串，返回一个分割后的列表。

re.split(pattern, string, maxsplit=0, flags=0)

pattern compile 生成的正则表达式对象，或者自定义也可 string 要匹配的字符串 maxsplit 指定最大分割次数，不指定将全部分割

import restr = 'say hello world! hello python'str_nm = 'one1two2three3four4'pattern = re.compile(r'(?P<space>/s)') # 创建一个匹配空格的正则表达式对象pattern_nm = re.compile(r'(?P<space>/d+)') # 创建一个匹配空格的正则表达式对象match = re.split(pattern, str)match_nm = re.split(pattern_nm, str_nm, maxsplit=1)print(match)print(match_nm)

结果：

['say', ' ', 'hello', ' ', 'world!', ' ', 'hello', ' ', 'python']
['one', '1', 'two2three3four4']

re.findall() 方法

返回一个包含所有匹配到的字符串的列表。

pattern 匹配模式，由 re.compile 获得 string 需要匹配的字符串

import restr = 'say hello world! hello python'pattern = re.compile(r'(?P<first>h/w)(?P<symbol>l+)(?P<last>o/s)') # 分组，0 组是整个 world!, 1组 or，2组 ld!match = re.findall(pattern, str)print(match)

上一篇：解决Python安装后pip不能用的问题

下一篇：pip命令无法使用的解决方法