0% found this document useful (0 votes)

72 views16 pages

Regular Expression 4

The document discusses regular expressions in Python. It covers matching patterns, functions like search(), findall(), split(), and sub(). It discusses special characters like [], ., ^, $, *, etc. and how they are used to match patterns. It provides examples of using regular expressions to match words starting with a particular letter, replacing substrings, and verifying phone numbers.

Uploaded by

patilpatil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views16 pages

Regular Expression 4

Uploaded by

patilpatil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Regular Expression

• Regular Expression
In the real world, string parsing in most programming languages is handled by
regular expression. Regular expression in a python programming language is a
method used for matching text pattern.
The “re” module which comes with every python installation provides regular
expression support.
In python, a regular expression search is typically written as:
match = re.search(pattern, string)
The re.search() method takes two arguments, a regular expression pattern
and a string and searches for that pattern within the string. If the pattern is
found within the string, search() returns a match object or None otherwise.
So in a regular expression, given a string, determine whether that string
matches a given pattern, and, optionally, collect substrings that contain
relevant information.
• Matching patterns
• Regular expressions are complicated mini-language. They rely on special
characters to match unknown strings, but let's start with literal characters,
such as letters, numbers, and the space character, which always match
themselves. Let's see a basic example:
import re
search_string = "TutorialsPoint"
pattern = "Tutorials"
match = re.match(pattern, search_string)
#If-statement after search() tests if it succeeded
if match:
print("regex matches: ", match.group())
else:
print('pattern not found')

Output-- regex matches: Tutorials

• RegEx Functions
• The re module offers a set of functions that allows us to search a string for
a match:

Functi Description
on
findall Returns a list containing all matches

search Returns a Match object if there is a match anywhere in the string

split Returns a list where the string has been split at each match

sub Replaces one or many matches with a string

• Metacharacters
• Metacharacters are characters with a special meaning:

Character Description Example

[] A set of characters "[a-m]"

\ Signals a special sequence (can also be used to escape special characters) "\d"

. Any character (except newline character) "he..o"

^ Starts with "^hello"

$ Ends with "planet$"

* Zero or more occurrences "he.*o"

+ One or more occurrences "he.+o"

? Zero or one occurrences "he.?o"

{} Exactly the specified number of occurrences "he.{2}o"

| Either or "falls|stays"

() Capture and group

import re
txt = "The rain in Spain"
#Find all lower case characters alphabetically between "a" and "m":
x = re.findall("[a-m]", txt)
print(x)

import re
txt = "That will be 59 dollars"
#Find all digit characters:
x = re.findall("\d", txt)
print(x)

import re
txt = "hello planet"
#Check if the string ends with 'planet‘
x = re.findall("planet$", txt)
if x:
print("Yes, the string ends with 'planet'")
else:
print("No match")
• Special Sequences
• A special sequence is a \ followed by one of the characters in the list
below, and has a special meaning:
Character Description Example
\A Returns a match if the specified characters are at the beginning of the string "\AThe"

\b Returns a match where the specified characters are at the beginning or at the end of a word r"\bain"
(the "r" in the beginning is making sure that the string is being treated as a "raw string") r"ain\b"

\B Returns a match where the specified characters are present, but NOT at the beginning (or at the end) of r"\Bain"
a word r"ain\B"
(the "r" in the beginning is making sure that the string is being treated as a "raw string")

\d Returns a match where the string contains digits (numbers from 0-9) "\d"

\D Returns a match where the string DOES NOT contain digits "\D"

\s Returns a match where the string contains a white space character "\s"

\S Returns a match where the string DOES NOT contain a white space character "\S"

\w Returns a match where the string contains any word characters (characters from a to Z, digits from 0-9, "\w"
and the underscore _ character)
\W Returns a match where the string DOES NOT contain any word characters "\W"

\Z Returns a match if the specified characters are at the end of the string "Spain\Z"
import re
txt = "The rain in Spain"
#Check if the string starts with "The":
x = re.findall("\AThe", txt)
print(x)
if x:
print("Yes, there is a match!")
else:
print("No match")

import re
txt = "The rain in Spain"
#Check if "ain" is present, but NOT at the beginning of a word:
x = re.findall(r"\Bain", txt)
print(x)
if x:
print("Yes, there is at least one match!")
else:
print("No match")
• Sets
• A set is a set of characters inside a pair of square brackets [] with a special
meaning:
Set Description
[arn] Returns a match where one of the specified characters (a, r, or n) are present

[a-n] Returns a match for any lower case character, alphabetically between a and n

[^arn] Returns a match for any character EXCEPT a, r, and n

[0123] Returns a match where any of the specified digits (0, 1, 2, or 3) are present

[0-9] Returns a match for any digit between 0 and 9

[0-5][0-9] Returns a match for any two-digit numbers from 00 and 59

[a-zA-Z] Returns a match for any character alphabetically between a and z, lower case OR upper case

[+] In sets, +, *, ., |, (), $,{} has no special meaning, so [+] means: return a match for any + character in
the string
import re
txt = "The rain in Spain"
#Check if the string has any characters between a and n:
x = re.findall("[a-n]", txt)
print(x)
if x:
print("Yes, there is at least one match!")
else:
print("No match")
-----------------------------------------------------------------------------------------------------------
import re
txt = "8 times before 11:45 AM"
#Check if the string has any digits:
x = re.findall("[0-9]", txt)
print(x)
if x:
print("Yes, there is at least one match!")
else:
print("No match")
• The findall() Function
• The findall() function returns a list containing all matches.
import re
#Return a list containing every occurrence of "ai":
txt = "The rain in Spain“
x = re.findall("ai", txt)
print(x)
• ----------------------------------------------------------------------------------------------
• The list contains the matches in the order they are found.
• If no matches are found, an empty list is returned:
import re
txt = "The rain in Spain"
#Check if "Portugal" is in the string:
x = re.findall("Portugal", txt)
print(x)
if (x):
print("Yes, there is at least one match!")
else:
print("No match")
• The search() Function
• The search() function searches the string for a match, and returns a Match
object if there is a match.
• If there is more than one match, only the first occurrence of the match will
be returned:
import re
txt = "The rain in Spain"
x = re.search("\s", txt)
print("The first white-space character is located in position:", x.start())
---------------------------------------------------------------------------------------------------
If no matches are found, the value None is returned:
import re
txt = "The rain in Spain"
x = re.search("Portugal", txt)
print(x)
• The split() Function
• The split() function returns a list where the string has been split at each
match:
import re
#Split the string at every white-space character:
txt = "The rain in Spain"
x = re.split("\s", txt)
print(x)
• --------------------------------------------------------------------------------------------------
• You can control the number of occurrences by specifying
the maxsplit parameter:
• Split the string only at the first occurrence:
import re
#Split the string at the first white-space character:
txt = "The rain in Spain"
x = re.split("\s", txt, 1)
print(x)
match word with perticular pattern
import re

Str="sat,hat,mat,pat"

allStr=re.findall("[shmp]at",Str)
# speciafically word start with s h m p and end with at

for i in allStr:
print(i) Output--- sat hat mat pat
----------------------------------------
Match Series of range of character

import re
Str="Sat, hat,mat,pat"

someStr=re.findall("[h-m]at",Str)
for i in someStr:
print(i) Output--- hat mat
----------------------------------
someStr=re.findall("[^h-m]at",Str)# everything apart from h-m
Replace a string
import re
item= 'hat,rat,mat,pat'
regex=re.compile("[r]at")
item=regex.sub("item",item) # for replacing
print(item) Output-- hat item mat pat
--------------------------------------------
Verify Phone Number
import re
# \w [a-z A-Z 0-9]
# \W [^a-z A-Z 0-9]
phn = "412-555-1212"
if re.search("\w{3}-\w{3}-\w{4}",phn)
print("It is a phone number")
-----------------------------------------------------------
if re.search("\d{3}-\d{3}-\d{4}",phn)
-----------------------------------------
Verify Name
import re
if re.search("\w{2,20}\s\w{2,20}", "Sachin Tendulkar"):
print("Name is valid")
#{first name range} \s—space {last name range}
-----------------------------------------------
verify email address
import re
email = "sk@aol.com md@.com @seo.com dc@.com"
print("Email Matches :",len(re.findall("[\w._%+-]{1,20}@[\w.-]{2,20}.[A-Za-
z]{2,3}",email)))
Output-- sk@aol.com

Re Expression 19 and 20
No ratings yet
Re Expression 19 and 20
26 pages
Kip o Editor Manual
No ratings yet
Kip o Editor Manual
241 pages
9 RegEx
No ratings yet
9 RegEx
57 pages
Unit 3 Python
No ratings yet
Unit 3 Python
72 pages
PP - Module-3 Notes
No ratings yet
PP - Module-3 Notes
56 pages
Python Unit-3
No ratings yet
Python Unit-3
23 pages
Suni
No ratings yet
Suni
104 pages
Python Unit 3
No ratings yet
Python Unit 3
46 pages
3.III-Regular Expression Part-I & II 2022-23
No ratings yet
3.III-Regular Expression Part-I & II 2022-23
14 pages
Unit 2
No ratings yet
Unit 2
69 pages
Japan CTD
No ratings yet
Japan CTD
25 pages
UNIT4
No ratings yet
UNIT4
67 pages
Lecture 11 Regular Expressions
No ratings yet
Lecture 11 Regular Expressions
17 pages
9 RegEx
No ratings yet
9 RegEx
57 pages
Unit 4 Regular Expression
No ratings yet
Unit 4 Regular Expression
16 pages
Lecture 14 Regular Expressions
No ratings yet
Lecture 14 Regular Expressions
4 pages
Unit - 4 Regex
No ratings yet
Unit - 4 Regex
28 pages
13B RegExp
No ratings yet
13B RegExp
38 pages
Regular Expression 01
No ratings yet
Regular Expression 01
48 pages
Reg Exp
No ratings yet
Reg Exp
10 pages
Kevin's Resume
No ratings yet
Kevin's Resume
2 pages
Unit7 RegularExpressionpdf 2023 10 17 09 16 29
No ratings yet
Unit7 RegularExpressionpdf 2023 10 17 09 16 29
17 pages
Module II
No ratings yet
Module II
17 pages
Summary Python 1
No ratings yet
Summary Python 1
36 pages
Unit 4
No ratings yet
Unit 4
18 pages
Regular Expression L
No ratings yet
Regular Expression L
20 pages
Python Re
No ratings yet
Python Re
18 pages
17 - Regular Expression
No ratings yet
17 - Regular Expression
20 pages
Aa7b3 Data Structure Mcqs
No ratings yet
Aa7b3 Data Structure Mcqs
80 pages
Tsa Lab Record - Cse
No ratings yet
Tsa Lab Record - Cse
53 pages
Python 201 - (Slightly) Advanced Python Topics
No ratings yet
Python 201 - (Slightly) Advanced Python Topics
69 pages
Python Complete Unit 3
No ratings yet
Python Complete Unit 3
40 pages
Regular
No ratings yet
Regular
9 pages
Regular Expression
No ratings yet
Regular Expression
17 pages
Regular Exp
No ratings yet
Regular Exp
10 pages
RegEx in Python
No ratings yet
RegEx in Python
5 pages
Lec 06 - Regular Expression
No ratings yet
Lec 06 - Regular Expression
19 pages
Report File 1
No ratings yet
Report File 1
41 pages
PM Debug Info
No ratings yet
PM Debug Info
17 pages
Compsys21 PDF
No ratings yet
Compsys21 PDF
27 pages
Regular Expressions: Python For Everybody
No ratings yet
Regular Expressions: Python For Everybody
34 pages
UNIT-4 (Regular Expressions)
No ratings yet
UNIT-4 (Regular Expressions)
25 pages
Lecture 7 Re Part2 Split
No ratings yet
Lecture 7 Re Part2 Split
8 pages
Day-13 Python Regx
No ratings yet
Day-13 Python Regx
11 pages
Regular Expression
No ratings yet
Regular Expression
22 pages
Unit-3 Python
No ratings yet
Unit-3 Python
72 pages
Ge Rex
No ratings yet
Ge Rex
32 pages
Regular Expression
No ratings yet
Regular Expression
21 pages
Python Reg Expressions
No ratings yet
Python Reg Expressions
8 pages
Python Regex Cheat Sheet
No ratings yet
Python Regex Cheat Sheet
29 pages
Bike Sharing Python Report
No ratings yet
Bike Sharing Python Report
40 pages
Ite Unit 5
No ratings yet
Ite Unit 5
16 pages
23.python Regular Expressions
No ratings yet
23.python Regular Expressions
7 pages
1 What Do You See As The Objective of Information Security Within A Business or Organization
No ratings yet
1 What Do You See As The Objective of Information Security Within A Business or Organization
8 pages
PP - Chapter - 4
No ratings yet
PP - Chapter - 4
15 pages
Business Analytics Assignment Business Analytics Assignment: Neha Singh Neha Singh
No ratings yet
Business Analytics Assignment Business Analytics Assignment: Neha Singh Neha Singh
16 pages
Introduction To MIS and System Concepts
No ratings yet
Introduction To MIS and System Concepts
31 pages
Lecture 9 Python
No ratings yet
Lecture 9 Python
8 pages
Lecture 6 Re Basics
No ratings yet
Lecture 6 Re Basics
12 pages
Introduction To ADB'S Management Action Record System (Mars) and Lessons Database
No ratings yet
Introduction To ADB'S Management Action Record System (Mars) and Lessons Database
25 pages
Manipulating Text With Regular Expression in Python
No ratings yet
Manipulating Text With Regular Expression in Python
4 pages
Python - Regular Expressions - Code
No ratings yet
Python - Regular Expressions - Code
4 pages
CoreDB - A Data Lake Service
No ratings yet
CoreDB - A Data Lake Service
4 pages
Unit-3 - Regular Expression
No ratings yet
Unit-3 - Regular Expression
15 pages
Free Resume Builder and
100% (2)
Free Resume Builder and
4 pages
Regular Expressions
No ratings yet
Regular Expressions
5 pages
Python Course: Session 6b - Regular Expressions
No ratings yet
Python Course: Session 6b - Regular Expressions
11 pages
Robotics INNOVATION REPORT
No ratings yet
Robotics INNOVATION REPORT
15 pages
Chapter - 11 - Regular Expressions
100% (1)
Chapter - 11 - Regular Expressions
10 pages
Regular Expressions Python
No ratings yet
Regular Expressions Python
26 pages
2 MARKS With Answer EE3017-Embedded C Programming
No ratings yet
2 MARKS With Answer EE3017-Embedded C Programming
19 pages
Python
No ratings yet
Python
4 pages
Regular Exp
No ratings yet
Regular Exp
6 pages
Python RegEx
No ratings yet
Python RegEx
11 pages
Brocade Full Fos Eula Final Oct 1 2019
No ratings yet
Brocade Full Fos Eula Final Oct 1 2019
8 pages
STARTER V55 HF1 Restrictions
No ratings yet
STARTER V55 HF1 Restrictions
6 pages
Dnsbox: Powerful DNS, DHCP & Ipam Appliance
No ratings yet
Dnsbox: Powerful DNS, DHCP & Ipam Appliance
2 pages
Practical-6
No ratings yet
Practical-6
7 pages
H8 User Manual
100% (4)
H8 User Manual
89 pages
Python Regular Expressions
No ratings yet
Python Regular Expressions
14 pages
Python Regex: Re - Match, Re - Search, Re - Findall With Example
No ratings yet
Python Regex: Re - Match, Re - Search, Re - Findall With Example
10 pages
Computer Shop Management System
No ratings yet
Computer Shop Management System
33 pages
Learn Python through Nursery Rhymes and Fairy Tales: Classic Stories Translated into Python Programs (Coding for Kids and Beginners)
From Everand
Learn Python through Nursery Rhymes and Fairy Tales: Classic Stories Translated into Python Programs (Coding for Kids and Beginners)
Shari Eskenas
5/5 (1)
Regular Expressions: Regular Expressions Are A Powerful Tool For Various Kinds of String Manipulation
No ratings yet
Regular Expressions: Regular Expressions Are A Powerful Tool For Various Kinds of String Manipulation
4 pages
Federal Contractor Resume
100% (2)
Federal Contractor Resume
5 pages
Python Regex
No ratings yet
Python Regex
8 pages
Discover Pro Gen.1&2 (MIB) : Map Material Update
No ratings yet
Discover Pro Gen.1&2 (MIB) : Map Material Update
5 pages
New 308
No ratings yet
New 308
3 pages
Manual
No ratings yet
Manual
17 pages
MP 3054
No ratings yet
MP 3054
8 pages

Regular Expression 4

Uploaded by

Regular Expression 4

Uploaded by

Regular Expression

Output-- regex matches: Tutorials

search Returns a Match object if there is a match anywhere in the string

sub Replaces one or many matches with a string

Character Description Example

. Any character (except newline character) "he..o"

^ Starts with "^hello"

$ Ends with "planet$"

* Zero or more occurrences "he.*o"

+ One or more occurrences "he.+o"

? Zero or one occurrences "he.?o"

{} Exactly the specified number of occurrences "he.{2}o"

() Capture and group

[^arn] Returns a match for any character EXCEPT a, r, and n

[0-9] Returns a match for any digit between 0 and 9

[0-5][0-9] Returns a match for any two-digit numbers from 00 and 59

You might also like