Skip to content Skip to sidebar Skip to footer

Php - Split A String Of Html Attributes Into An Indexed Array

I've got a string with HTML attributes: $attribs = ' id= 'header ' class = 'foo bar' style ='background-color:#fff; color: red; ''; How to transform that string into an indexed

Solution 1:

Use SimpleXML:

<?php$attribs = ' id= "header " class = "foo   bar" style ="background-color:#fff; color: red; "';

$x = new SimpleXMLElement("<element $attribs />");

print_r($x);

?>

This assumes that the attributes are always name/value pairs...

Solution 2:

You could use a regular expression to extract that information:

$attribs = ' id= "header " class = "foo   bar" style ="background-color:#fff; color: red; "';
$pattern = '/(\\w+)\s*=\\s*("[^"]*"|\'[^\']*\'|[^"\'\\s>]*)/';
preg_match_all($pattern, $attribs, $matches, PREG_SET_ORDER);
$attrs = array();
foreach ($matchesas$match) {
    if (($match[2][0] == '"' || $match[2][0] == "'") && $match[2][0] == $match[2][strlen($match[2])-1]) {
        $match[2] = substr($match[2], 1, -1);
    }
    $name = strtolower($match[1]);
    $value = html_entity_decode($match[2]);
    switch ($name) {
    case'class':
        $attrs[$name] = preg_split('/\s+/', trim($value));
        break;
    case'style':
        // parse CSS property declarationsbreak;
    default:
        $attrs[$name] = $value;
    }
}
var_dump($attrs);

Now you just need to parse the classes of class (split at whitespaces) and property declarations of style (a little bit harder as it can contain comments and URLs with ; in it).

Solution 3:

Easy way could be also:

$atts_array = current((array) new SimpleXMLElement("<element $attribs />"));

Solution 4:

You can't use a regular expression to parse html-attributes. This is because the syntax is contextual. You can use regular expressions to tokenize the input, but you need a state machine to parse it.

If the performance isn't a big deal, the safest way to do it, is probably to wrap the attributes in a tag and then send it through an html parser. Eg.:

functionparse_attributes($input) {
  $dom = new DomDocument();
  $dom->loadHtml("<foo " . $input. "/>");
  $attributes = array();
  foreach ($dom->documentElement->attributes as$name => $attr) {
    $attributes[$name] = $node->value;
  }
  return$attributes;
}

You could probably optimize the above, by reusing the parser, or by using XmlReader or the sax parser.

Solution 5:

May be this helps you .. What it does ..

  • A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way!
  • Require PHP 5+.
  • Supports invalid HTML.
  • Find tags on an HTML page with selectors just like jQuery.
  • Extract contents from HTML in a single line.

http://simplehtmldom.sourceforge.net/

Post a Comment for "Php - Split A String Of Html Attributes Into An Indexed Array"