php - 正则表达式 + preg_match_all - 获取属性的值

我正在尝试获取 href 的值第一个的属性 <a>帖子中的标签是图片。
这是我到目前为止所拥有的:

$pattern = "/<a.+href=('|\")(.*?).(bmp|gif|jpeg|jpg|png)('|\").*>/i";
$output = preg_match_all($pattern, $post->post_content, $matches);
$first_link = $matches[1][0];

但是，这不起作用。

我有一个代码可以获取 src <img> 的值确实有效的标签:

$pattern = "/<img.+src=[\'"]([^\'"]+)[\'"].*>/i";
$output = preg_match_all($pattern, $post->post_content, $matches);
$first_img = $matches[1][0];

因为我不是正则表达式和 php 方面的专家，所以我不知道我做错了什么。

我也找不到任何像样的、有条理的正则表达式指南，所以一个链接也很有用!

最佳答案

这不是您应该使用正则表达式解决的问题。如果你想解析 HTML，你需要的是一个 HTML 解析器，而 PHP 已经有一个非常适合你的解析器!

$html = <<<HTML
<a href="http://somesillyexample.com/some/silly/path/to/a/file.jpeg">
HTML;

$dom = new DomDocument;
$dom->loadHTML($html); // load HTML from a string
$elements = $dom->getElementsByTagName('a'); // get all elements with an 'a' tag in the DOM
foreach ($elements as $node) {
    /* If the element has an href attribute let's get it */
    if ($node->hasAttribute('href')) {
        echo $node->getAttribute('href') . "\n";
    }
}
/*
will output:

http://somesillyexample.com/some/silly/path/to/a/file.jpeg
*/

参见 DOMDocument文档以获取更多详细信息。

关于php - 正则表达式 + preg_match_all - 获取属性的值，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/13779605/

上一篇：Perl 简单循环

下一篇：random - "Resetting"伪随机数生成器种子多次？

相关文章：

php - 需要查询以检索最新数据，但也需要 groupby

javascript - Codeigniter:在ajax中成功获取php数组

regex - 我如何编写一个在每一行上执行多次替换的正则表达式，除非该行以某个字符串开头？

ios - 在 String 中的特定字符串的字符串末尾添加一个字符串

wordpress - 根据所选货币隐藏 Woocommerce 中的付款方式

php - sql查询从表中选择最大值的计数

javascript - 根据 td 值更改文本颜色

r - 如何提取R中的第一个字符串

wordpress - 如何在 Amazon Affiliate iframe 上启用文本压缩？

javascript - 将变量从 PHP 传递到 AJAX 成功函数