DOMDocument saveHTML with wrong output












1















I have this simple code:



$input = '<p>ěščřžýáíé</p><p><img alt="" src="http://www.test.com/img.jpg" style="width: 100px; height: 100px;"></p>';
$dom = new DOMDocument('1.0', 'UTF-8');
$dom->encoding = 'UTF-8';
$dom->loadHTML($input, LIBXML_HTML_NOIMPLIED | LIBXML_HTML_NODEFDTD);
$imgs = $dom->getElementsByTagName('img');
foreach($imgs as $img){
$src = $img->getAttribute('src');
$style = $img->getAttribute('style');
$newSrc = 'http://www.test.com/img001.jpg';
$img->setAttribute( 'src' , $newSrc );
}
$content = $dom->saveHTML();


Problem is that output is encoded.
I expect same characters as are on input.
I tried decoding without success. Something wrong with using DOM object?



<p>&Auml;›&Aring;&iexcl;&Auml;&Aring;™&Aring;&frac34;&Atilde;&frac12;&Atilde;&iexcl;&Atilde;&shy;&Atilde;&copy;<p><img alt="" src="http://www.test.com/img001.jpg" style="width: 100px; height: 100px;"></p></p>









share|improve this question

























  • The output is encoded in what way? url encoded, character entities?

    – Andy G
    Nov 22 '18 at 16:08
















1















I have this simple code:



$input = '<p>ěščřžýáíé</p><p><img alt="" src="http://www.test.com/img.jpg" style="width: 100px; height: 100px;"></p>';
$dom = new DOMDocument('1.0', 'UTF-8');
$dom->encoding = 'UTF-8';
$dom->loadHTML($input, LIBXML_HTML_NOIMPLIED | LIBXML_HTML_NODEFDTD);
$imgs = $dom->getElementsByTagName('img');
foreach($imgs as $img){
$src = $img->getAttribute('src');
$style = $img->getAttribute('style');
$newSrc = 'http://www.test.com/img001.jpg';
$img->setAttribute( 'src' , $newSrc );
}
$content = $dom->saveHTML();


Problem is that output is encoded.
I expect same characters as are on input.
I tried decoding without success. Something wrong with using DOM object?



<p>&Auml;›&Aring;&iexcl;&Auml;&Aring;™&Aring;&frac34;&Atilde;&frac12;&Atilde;&iexcl;&Atilde;&shy;&Atilde;&copy;<p><img alt="" src="http://www.test.com/img001.jpg" style="width: 100px; height: 100px;"></p></p>









share|improve this question

























  • The output is encoded in what way? url encoded, character entities?

    – Andy G
    Nov 22 '18 at 16:08














1












1








1


0






I have this simple code:



$input = '<p>ěščřžýáíé</p><p><img alt="" src="http://www.test.com/img.jpg" style="width: 100px; height: 100px;"></p>';
$dom = new DOMDocument('1.0', 'UTF-8');
$dom->encoding = 'UTF-8';
$dom->loadHTML($input, LIBXML_HTML_NOIMPLIED | LIBXML_HTML_NODEFDTD);
$imgs = $dom->getElementsByTagName('img');
foreach($imgs as $img){
$src = $img->getAttribute('src');
$style = $img->getAttribute('style');
$newSrc = 'http://www.test.com/img001.jpg';
$img->setAttribute( 'src' , $newSrc );
}
$content = $dom->saveHTML();


Problem is that output is encoded.
I expect same characters as are on input.
I tried decoding without success. Something wrong with using DOM object?



<p>&Auml;›&Aring;&iexcl;&Auml;&Aring;™&Aring;&frac34;&Atilde;&frac12;&Atilde;&iexcl;&Atilde;&shy;&Atilde;&copy;<p><img alt="" src="http://www.test.com/img001.jpg" style="width: 100px; height: 100px;"></p></p>









share|improve this question
















I have this simple code:



$input = '<p>ěščřžýáíé</p><p><img alt="" src="http://www.test.com/img.jpg" style="width: 100px; height: 100px;"></p>';
$dom = new DOMDocument('1.0', 'UTF-8');
$dom->encoding = 'UTF-8';
$dom->loadHTML($input, LIBXML_HTML_NOIMPLIED | LIBXML_HTML_NODEFDTD);
$imgs = $dom->getElementsByTagName('img');
foreach($imgs as $img){
$src = $img->getAttribute('src');
$style = $img->getAttribute('style');
$newSrc = 'http://www.test.com/img001.jpg';
$img->setAttribute( 'src' , $newSrc );
}
$content = $dom->saveHTML();


Problem is that output is encoded.
I expect same characters as are on input.
I tried decoding without success. Something wrong with using DOM object?



<p>&Auml;›&Aring;&iexcl;&Auml;&Aring;™&Aring;&frac34;&Atilde;&frac12;&Atilde;&iexcl;&Atilde;&shy;&Atilde;&copy;<p><img alt="" src="http://www.test.com/img001.jpg" style="width: 100px; height: 100px;"></p></p>






php domdocument






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Nov 22 '18 at 16:19







step

















asked Nov 22 '18 at 16:02









stepstep

5162828




5162828













  • The output is encoded in what way? url encoded, character entities?

    – Andy G
    Nov 22 '18 at 16:08



















  • The output is encoded in what way? url encoded, character entities?

    – Andy G
    Nov 22 '18 at 16:08

















The output is encoded in what way? url encoded, character entities?

– Andy G
Nov 22 '18 at 16:08





The output is encoded in what way? url encoded, character entities?

– Andy G
Nov 22 '18 at 16:08












1 Answer
1






active

oldest

votes


















2














saveHTML() has a few 'features' which I don't understand, but when saving with a particular document node it will work if you then utf8_decode() the result...



$content = utf8_decode($dom->saveHTML($dom->documentElement));


gives...



<p>ěščřžýáíé<p><img alt="" src="http://www.test.com/img001.jpg" style="width: 100px; height: 100px;"></p></p>





share|improve this answer























    Your Answer






    StackExchange.ifUsing("editor", function () {
    StackExchange.using("externalEditor", function () {
    StackExchange.using("snippets", function () {
    StackExchange.snippets.init();
    });
    });
    }, "code-snippets");

    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "1"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














    draft saved

    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53434646%2fdomdocument-savehtml-with-wrong-output%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    2














    saveHTML() has a few 'features' which I don't understand, but when saving with a particular document node it will work if you then utf8_decode() the result...



    $content = utf8_decode($dom->saveHTML($dom->documentElement));


    gives...



    <p>ěščřžýáíé<p><img alt="" src="http://www.test.com/img001.jpg" style="width: 100px; height: 100px;"></p></p>





    share|improve this answer




























      2














      saveHTML() has a few 'features' which I don't understand, but when saving with a particular document node it will work if you then utf8_decode() the result...



      $content = utf8_decode($dom->saveHTML($dom->documentElement));


      gives...



      <p>ěščřžýáíé<p><img alt="" src="http://www.test.com/img001.jpg" style="width: 100px; height: 100px;"></p></p>





      share|improve this answer


























        2












        2








        2







        saveHTML() has a few 'features' which I don't understand, but when saving with a particular document node it will work if you then utf8_decode() the result...



        $content = utf8_decode($dom->saveHTML($dom->documentElement));


        gives...



        <p>ěščřžýáíé<p><img alt="" src="http://www.test.com/img001.jpg" style="width: 100px; height: 100px;"></p></p>





        share|improve this answer













        saveHTML() has a few 'features' which I don't understand, but when saving with a particular document node it will work if you then utf8_decode() the result...



        $content = utf8_decode($dom->saveHTML($dom->documentElement));


        gives...



        <p>ěščřžýáíé<p><img alt="" src="http://www.test.com/img001.jpg" style="width: 100px; height: 100px;"></p></p>






        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Nov 22 '18 at 16:25









        Nigel RenNigel Ren

        26.7k61833




        26.7k61833






























            draft saved

            draft discarded




















































            Thanks for contributing an answer to Stack Overflow!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53434646%2fdomdocument-savehtml-with-wrong-output%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Costa Masnaga

            Fotorealismo

            Sidney Franklin