python的正则表达式入门求助

推荐学习书目

Learn Python the Hard Way

Python Sites

PyPI - Python Package Index

http://diveintopython.org/toc/index.html

Pocoo

值得关注的项目

PyPy

Celery

Jinja2

Read the Docs

gevent

pyenv

virtualenv

Sentry

Shovel

Pyflakes

pytest

Python 编程

pep8 Checker

Styles

PEP 8

Google Python Style Guide

Code Style from The Hitchhiker's Guide

This topic created in 4557 days ago, the information mentioned may be changed or developed.

初学python 遇到正则表达式的难题各位大虾能推荐下如何入门么

http://*259

诸如此类网址末尾是数字怎么把它从网页里提取出来？

Supplement 1 Nov 19, 2013

谢谢各位的帮助有点心得了

Python

正则表达式

入门

10 replies 1970-01-01 08:00:00 +08:00

zhy0216

Nov 18, 2013

http://www.amazon.cn/%E6%AD%A3%E5%88%99%E6%8C%87%E5%BC%95-%E4%BD%99%E6%99%9F/dp/B007X6O6J0/ref=sr_1_3?ie=UTF8&qid=1384785712&sr=8-3&keywords=%E6%AD%A3%E5%88%99%E8%A1%A8%E8%BE%BE%E5%BC%8F

yxjxx

Nov 18, 2013

我也刚学python不久,写过一篇笔记. http://yxjxx.me/regular-expression

mengzhuo

Nov 18, 2013

首先网页就不要用正则提取内容，BS4是你的好伙伴
然后提取的所有链接再用正则匹配

https?:\/\/([\d\.]+)\/

Perry

Nov 19, 2013

关于入门：
入门正则可以不用书
几分钟的入门：http://net.tutsplus.com/tutorials/other/8-regular-expressions-you-should-know/
cheatsheet：http://www.addedbytes.com/cheat-sheets/regular-expressions-cheat-sheet
然后发挥你的想象力自己写并验证：http://rubular.com

LetFoxRun

Nov 19, 2013 via Android

@yxjxx

博客里的 intersting 是不是打错了，还是故意这么写的？

sandtears

Nov 19, 2013

import re
tmpRe = re.compile(r"^http://.*?(\d+)$")
tmpNum = tmpRe.match(url).groups()[0]

此时tmp即为str类型的数字

clino

Nov 19, 2013

建议装一个 kodos ,是一个正则的调试集成环境

lixm

Nov 19, 2013

html页面为什么不用xml解析而要去用正则呢？

yxjxx

Nov 19, 2013

@LetFoxRun ,打错了!感谢指出!

C0VN

Nov 19, 2013

http://images.cnblogs.com/cnblogs_com/huxi/Windows-Live-Writer/Python_10A67/pyre_ebb9ce1c-e5e8-4219-a8ae-7ee620d5f9f1.png