Parsing nested XML in Python?












0















I have the following goodreads response:



<GoodreadsResponse>
<Request>
</Request>
<book>
<popular_shelves>
<shelf name="test" other="1"/>
<shelf name="test2" other="2"/>
</popular_shelves/>
</book>
</GoodreadsResponse>


I want to retrieve the popular_shelves 2nd shelf item. (index 1).



Attempt 1:



from xml.etree import ElementTree as ET

root = ET.parse(urllib.urlopen(baseEndpoint+bookName)).getroot()
for atype in root.findall('book/popular_shelves'):
print(atype.get('shelf'))


Attempt 2:



  genre = root.find('book').findall('popular_shelves')[0].findall('shelf')
print genre[0].text









share|improve this question























  • What did you get with those attempts?

    – usr2564301
    Nov 25 '18 at 19:29
















0















I have the following goodreads response:



<GoodreadsResponse>
<Request>
</Request>
<book>
<popular_shelves>
<shelf name="test" other="1"/>
<shelf name="test2" other="2"/>
</popular_shelves/>
</book>
</GoodreadsResponse>


I want to retrieve the popular_shelves 2nd shelf item. (index 1).



Attempt 1:



from xml.etree import ElementTree as ET

root = ET.parse(urllib.urlopen(baseEndpoint+bookName)).getroot()
for atype in root.findall('book/popular_shelves'):
print(atype.get('shelf'))


Attempt 2:



  genre = root.find('book').findall('popular_shelves')[0].findall('shelf')
print genre[0].text









share|improve this question























  • What did you get with those attempts?

    – usr2564301
    Nov 25 '18 at 19:29














0












0








0








I have the following goodreads response:



<GoodreadsResponse>
<Request>
</Request>
<book>
<popular_shelves>
<shelf name="test" other="1"/>
<shelf name="test2" other="2"/>
</popular_shelves/>
</book>
</GoodreadsResponse>


I want to retrieve the popular_shelves 2nd shelf item. (index 1).



Attempt 1:



from xml.etree import ElementTree as ET

root = ET.parse(urllib.urlopen(baseEndpoint+bookName)).getroot()
for atype in root.findall('book/popular_shelves'):
print(atype.get('shelf'))


Attempt 2:



  genre = root.find('book').findall('popular_shelves')[0].findall('shelf')
print genre[0].text









share|improve this question














I have the following goodreads response:



<GoodreadsResponse>
<Request>
</Request>
<book>
<popular_shelves>
<shelf name="test" other="1"/>
<shelf name="test2" other="2"/>
</popular_shelves/>
</book>
</GoodreadsResponse>


I want to retrieve the popular_shelves 2nd shelf item. (index 1).



Attempt 1:



from xml.etree import ElementTree as ET

root = ET.parse(urllib.urlopen(baseEndpoint+bookName)).getroot()
for atype in root.findall('book/popular_shelves'):
print(atype.get('shelf'))


Attempt 2:



  genre = root.find('book').findall('popular_shelves')[0].findall('shelf')
print genre[0].text






python xml






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Nov 25 '18 at 19:07









John LippsonJohn Lippson

312213




312213













  • What did you get with those attempts?

    – usr2564301
    Nov 25 '18 at 19:29



















  • What did you get with those attempts?

    – usr2564301
    Nov 25 '18 at 19:29

















What did you get with those attempts?

– usr2564301
Nov 25 '18 at 19:29





What did you get with those attempts?

– usr2564301
Nov 25 '18 at 19:29












2 Answers
2






active

oldest

votes


















1














This is how I got the 2nd shelf item from popular_shelves:



import xml.etree.ElementTree as ET

payload = '''
<GoodreadsResponse>
<Request>
</Request>
<book>
<popular_shelves>
<shelf name="test" other="1"/>
<shelf name="test2" other="2"/>
</popular_shelves>
</book>
</GoodreadsResponse>
'''

root = ET.fromstring(payload)
shelves = root.findall("./book/popular_shelves/shelf") # this will get you the list of shelves
print shelves[1].get('name') # fetching the name of 2nd shelf item


So, we can load all shelf items under ./book/popular_shelves into a list. And, then use the list index to access 1st, 2nd, etc. shelf items.






share|improve this answer































    0














    you can try module untangle, it simple and easy to work with, for example:



    In [95]: from untangle import parse

    In [96]: payload = '''
    ...: <GoodreadsResponse>
    ...: <Request>
    ...: </Request>
    ...: <book>
    ...: <popular_shelves>
    ...: <shelf name="test" other="1"/>
    ...: <shelf name="test2" other="2"/>
    ...: </popular_shelves>
    ...: </book>
    ...: </GoodreadsResponse>
    ...: '''

    In [97]: obj = parse(payload)

    In [98]: shelf1 = obj.GoodreadsResponse.book.popular_shelves.shelf[1]

    In [99]: vars(shelf1)
    Out[99]:
    {'_attributes': {u'name': u'test2', u'other': u'2'},
    '_name': u'shelf',
    'cdata': '',
    'children': ,
    'is_root': False}





    share|improve this answer























      Your Answer






      StackExchange.ifUsing("editor", function () {
      StackExchange.using("externalEditor", function () {
      StackExchange.using("snippets", function () {
      StackExchange.snippets.init();
      });
      });
      }, "code-snippets");

      StackExchange.ready(function() {
      var channelOptions = {
      tags: "".split(" "),
      id: "1"
      };
      initTagRenderer("".split(" "), "".split(" "), channelOptions);

      StackExchange.using("externalEditor", function() {
      // Have to fire editor after snippets, if snippets enabled
      if (StackExchange.settings.snippets.snippetsEnabled) {
      StackExchange.using("snippets", function() {
      createEditor();
      });
      }
      else {
      createEditor();
      }
      });

      function createEditor() {
      StackExchange.prepareEditor({
      heartbeatType: 'answer',
      autoActivateHeartbeat: false,
      convertImagesToLinks: true,
      noModals: true,
      showLowRepImageUploadWarning: true,
      reputationToPostImages: 10,
      bindNavPrevention: true,
      postfix: "",
      imageUploader: {
      brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
      contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
      allowUrls: true
      },
      onDemand: true,
      discardSelector: ".discard-answer"
      ,immediatelyShowMarkdownHelp:true
      });


      }
      });














      draft saved

      draft discarded


















      StackExchange.ready(
      function () {
      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53470918%2fparsing-nested-xml-in-python%23new-answer', 'question_page');
      }
      );

      Post as a guest















      Required, but never shown

























      2 Answers
      2






      active

      oldest

      votes








      2 Answers
      2






      active

      oldest

      votes









      active

      oldest

      votes






      active

      oldest

      votes









      1














      This is how I got the 2nd shelf item from popular_shelves:



      import xml.etree.ElementTree as ET

      payload = '''
      <GoodreadsResponse>
      <Request>
      </Request>
      <book>
      <popular_shelves>
      <shelf name="test" other="1"/>
      <shelf name="test2" other="2"/>
      </popular_shelves>
      </book>
      </GoodreadsResponse>
      '''

      root = ET.fromstring(payload)
      shelves = root.findall("./book/popular_shelves/shelf") # this will get you the list of shelves
      print shelves[1].get('name') # fetching the name of 2nd shelf item


      So, we can load all shelf items under ./book/popular_shelves into a list. And, then use the list index to access 1st, 2nd, etc. shelf items.






      share|improve this answer




























        1














        This is how I got the 2nd shelf item from popular_shelves:



        import xml.etree.ElementTree as ET

        payload = '''
        <GoodreadsResponse>
        <Request>
        </Request>
        <book>
        <popular_shelves>
        <shelf name="test" other="1"/>
        <shelf name="test2" other="2"/>
        </popular_shelves>
        </book>
        </GoodreadsResponse>
        '''

        root = ET.fromstring(payload)
        shelves = root.findall("./book/popular_shelves/shelf") # this will get you the list of shelves
        print shelves[1].get('name') # fetching the name of 2nd shelf item


        So, we can load all shelf items under ./book/popular_shelves into a list. And, then use the list index to access 1st, 2nd, etc. shelf items.






        share|improve this answer


























          1












          1








          1







          This is how I got the 2nd shelf item from popular_shelves:



          import xml.etree.ElementTree as ET

          payload = '''
          <GoodreadsResponse>
          <Request>
          </Request>
          <book>
          <popular_shelves>
          <shelf name="test" other="1"/>
          <shelf name="test2" other="2"/>
          </popular_shelves>
          </book>
          </GoodreadsResponse>
          '''

          root = ET.fromstring(payload)
          shelves = root.findall("./book/popular_shelves/shelf") # this will get you the list of shelves
          print shelves[1].get('name') # fetching the name of 2nd shelf item


          So, we can load all shelf items under ./book/popular_shelves into a list. And, then use the list index to access 1st, 2nd, etc. shelf items.






          share|improve this answer













          This is how I got the 2nd shelf item from popular_shelves:



          import xml.etree.ElementTree as ET

          payload = '''
          <GoodreadsResponse>
          <Request>
          </Request>
          <book>
          <popular_shelves>
          <shelf name="test" other="1"/>
          <shelf name="test2" other="2"/>
          </popular_shelves>
          </book>
          </GoodreadsResponse>
          '''

          root = ET.fromstring(payload)
          shelves = root.findall("./book/popular_shelves/shelf") # this will get you the list of shelves
          print shelves[1].get('name') # fetching the name of 2nd shelf item


          So, we can load all shelf items under ./book/popular_shelves into a list. And, then use the list index to access 1st, 2nd, etc. shelf items.







          share|improve this answer












          share|improve this answer



          share|improve this answer










          answered Nov 25 '18 at 21:18









          singletonsingleton

          814




          814

























              0














              you can try module untangle, it simple and easy to work with, for example:



              In [95]: from untangle import parse

              In [96]: payload = '''
              ...: <GoodreadsResponse>
              ...: <Request>
              ...: </Request>
              ...: <book>
              ...: <popular_shelves>
              ...: <shelf name="test" other="1"/>
              ...: <shelf name="test2" other="2"/>
              ...: </popular_shelves>
              ...: </book>
              ...: </GoodreadsResponse>
              ...: '''

              In [97]: obj = parse(payload)

              In [98]: shelf1 = obj.GoodreadsResponse.book.popular_shelves.shelf[1]

              In [99]: vars(shelf1)
              Out[99]:
              {'_attributes': {u'name': u'test2', u'other': u'2'},
              '_name': u'shelf',
              'cdata': '',
              'children': ,
              'is_root': False}





              share|improve this answer




























                0














                you can try module untangle, it simple and easy to work with, for example:



                In [95]: from untangle import parse

                In [96]: payload = '''
                ...: <GoodreadsResponse>
                ...: <Request>
                ...: </Request>
                ...: <book>
                ...: <popular_shelves>
                ...: <shelf name="test" other="1"/>
                ...: <shelf name="test2" other="2"/>
                ...: </popular_shelves>
                ...: </book>
                ...: </GoodreadsResponse>
                ...: '''

                In [97]: obj = parse(payload)

                In [98]: shelf1 = obj.GoodreadsResponse.book.popular_shelves.shelf[1]

                In [99]: vars(shelf1)
                Out[99]:
                {'_attributes': {u'name': u'test2', u'other': u'2'},
                '_name': u'shelf',
                'cdata': '',
                'children': ,
                'is_root': False}





                share|improve this answer


























                  0












                  0








                  0







                  you can try module untangle, it simple and easy to work with, for example:



                  In [95]: from untangle import parse

                  In [96]: payload = '''
                  ...: <GoodreadsResponse>
                  ...: <Request>
                  ...: </Request>
                  ...: <book>
                  ...: <popular_shelves>
                  ...: <shelf name="test" other="1"/>
                  ...: <shelf name="test2" other="2"/>
                  ...: </popular_shelves>
                  ...: </book>
                  ...: </GoodreadsResponse>
                  ...: '''

                  In [97]: obj = parse(payload)

                  In [98]: shelf1 = obj.GoodreadsResponse.book.popular_shelves.shelf[1]

                  In [99]: vars(shelf1)
                  Out[99]:
                  {'_attributes': {u'name': u'test2', u'other': u'2'},
                  '_name': u'shelf',
                  'cdata': '',
                  'children': ,
                  'is_root': False}





                  share|improve this answer













                  you can try module untangle, it simple and easy to work with, for example:



                  In [95]: from untangle import parse

                  In [96]: payload = '''
                  ...: <GoodreadsResponse>
                  ...: <Request>
                  ...: </Request>
                  ...: <book>
                  ...: <popular_shelves>
                  ...: <shelf name="test" other="1"/>
                  ...: <shelf name="test2" other="2"/>
                  ...: </popular_shelves>
                  ...: </book>
                  ...: </GoodreadsResponse>
                  ...: '''

                  In [97]: obj = parse(payload)

                  In [98]: shelf1 = obj.GoodreadsResponse.book.popular_shelves.shelf[1]

                  In [99]: vars(shelf1)
                  Out[99]:
                  {'_attributes': {u'name': u'test2', u'other': u'2'},
                  '_name': u'shelf',
                  'cdata': '',
                  'children': ,
                  'is_root': False}






                  share|improve this answer












                  share|improve this answer



                  share|improve this answer










                  answered Dec 3 '18 at 9:46









                  panda912panda912

                  463




                  463






























                      draft saved

                      draft discarded




















































                      Thanks for contributing an answer to Stack Overflow!


                      • Please be sure to answer the question. Provide details and share your research!

                      But avoid



                      • Asking for help, clarification, or responding to other answers.

                      • Making statements based on opinion; back them up with references or personal experience.


                      To learn more, see our tips on writing great answers.




                      draft saved


                      draft discarded














                      StackExchange.ready(
                      function () {
                      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53470918%2fparsing-nested-xml-in-python%23new-answer', 'question_page');
                      }
                      );

                      Post as a guest















                      Required, but never shown





















































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown

































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown







                      Popular posts from this blog

                      Ottavio Pratesi

                      Tricia Helfer

                      15 giugno