kellnerp
Mechanical
- Feb 11, 2005
- 1,141
So far I have this:
It does pretty good with tags all on one line, but things like <img src=blah blah blah that may extend over more than one line are not being caught.
Likewise things like <style type=text/css> ... </style> where I want to remove not just the tags, but the text between the tags are not being caught. Again, <style></style> tag pairs run over multiple lines in the general case.
Is there a way to accomplish this in sed? I can do it in awk already.
TOP
CSWP, BSSE
Phenom IIx6 1100T = 8GB = FX1400 = XP64SP2 = SW2009SP3
"Node news is good news."
Code:
sed -r 's/(<[^>\n]*>)//g'
Likewise things like <style type=text/css> ... </style> where I want to remove not just the tags, but the text between the tags are not being caught. Again, <style></style> tag pairs run over multiple lines in the general case.
Is there a way to accomplish this in sed? I can do it in awk already.
TOP
CSWP, BSSE
Phenom IIx6 1100T = 8GB = FX1400 = XP64SP2 = SW2009SP3
"Node news is good news."