from scrapy.selector import Selector htmlText="A1B...
from scrapy.selector import Selector htmlText="A1B1<c>C1</c><d>D<e>E</e></d>B2<c>C2</c>" selector=Selector(text=htmlText) s=selector.xpath("//a/following-sibling::*") print(s.extract()) s=selector.xpath("//a/following-sibling::*[position()=1]") print(s.extract()) s=selector.xpath("//b[position()=1]/following-sibling::*") print(s.extract()) _____________________________________________ print(s.extract()) 程序结果: ['B1', '<c>C1</c>', '<d>D<e>E</e></d>', 'B2', '<c>C2</c>'] ['B1'] ['<c>C1</c>', '<d>D<e>E</e></d>', 'B2', '<c>C2</c>'] ['<c>C1</c>']
A、s=selector.xpath("//b[position()=1]/following-sibling::*[position()=1]")
B、s=selector.xpath("//b[position()=1]/following-sibling::*")
C、s=selector.xpath("//b/following-sibling::*[position()=1]")
D、s=selector.xpath("//b/following-sibling::*")