My storm topology fails after running for 10 days












0















My storm topology fails after running for 10 days , when I deploy the same topology (same JAR) with new name it runs well till date, so my question is that what are the new resources got allocated for newly deployed Storm topology including zookeeper memory. If I redeployed that topology with OLD name its fails again in few hours.



I have not done any changes before deploying it with new topology name.



Is that Storm topology consume any memory space on worker node after running for longer period which i need to take care of?










share|improve this question























  • Please describe what you mean when you say that your topology "fails". Do you get error logs, does it crash with OOME or what do you mean? Also you should consider listing the components used by your topology (spouts, bolts, is it Trident?) as well as the Storm version.

    – Stig Rohde Døssing
    Nov 26 '18 at 18:37











  • what I mean by fail is that after analyzing Storm logs I can see one port stops processing the tuples, that results in huge failures at Spout.It just print the logmetrics info for hours.No sign of tuple processing in the logs

    – Milind Pardeshi
    Nov 27 '18 at 2:58











  • 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system memory/heap.initBytes [230686720] 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system _receive.sojourn_time [9952.0] 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system __receive.overflow

    – Milind Pardeshi
    Nov 27 '18 at 3:00











  • Its not trident topology , it contain one spout and eight bolts

    – Milind Pardeshi
    Nov 27 '18 at 3:02
















0















My storm topology fails after running for 10 days , when I deploy the same topology (same JAR) with new name it runs well till date, so my question is that what are the new resources got allocated for newly deployed Storm topology including zookeeper memory. If I redeployed that topology with OLD name its fails again in few hours.



I have not done any changes before deploying it with new topology name.



Is that Storm topology consume any memory space on worker node after running for longer period which i need to take care of?










share|improve this question























  • Please describe what you mean when you say that your topology "fails". Do you get error logs, does it crash with OOME or what do you mean? Also you should consider listing the components used by your topology (spouts, bolts, is it Trident?) as well as the Storm version.

    – Stig Rohde Døssing
    Nov 26 '18 at 18:37











  • what I mean by fail is that after analyzing Storm logs I can see one port stops processing the tuples, that results in huge failures at Spout.It just print the logmetrics info for hours.No sign of tuple processing in the logs

    – Milind Pardeshi
    Nov 27 '18 at 2:58











  • 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system memory/heap.initBytes [230686720] 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system _receive.sojourn_time [9952.0] 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system __receive.overflow

    – Milind Pardeshi
    Nov 27 '18 at 3:00











  • Its not trident topology , it contain one spout and eight bolts

    – Milind Pardeshi
    Nov 27 '18 at 3:02














0












0








0








My storm topology fails after running for 10 days , when I deploy the same topology (same JAR) with new name it runs well till date, so my question is that what are the new resources got allocated for newly deployed Storm topology including zookeeper memory. If I redeployed that topology with OLD name its fails again in few hours.



I have not done any changes before deploying it with new topology name.



Is that Storm topology consume any memory space on worker node after running for longer period which i need to take care of?










share|improve this question














My storm topology fails after running for 10 days , when I deploy the same topology (same JAR) with new name it runs well till date, so my question is that what are the new resources got allocated for newly deployed Storm topology including zookeeper memory. If I redeployed that topology with OLD name its fails again in few hours.



I have not done any changes before deploying it with new topology name.



Is that Storm topology consume any memory space on worker node after running for longer period which i need to take care of?







apache apache-storm






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Nov 26 '18 at 12:30









Milind PardeshiMilind Pardeshi

2117




2117













  • Please describe what you mean when you say that your topology "fails". Do you get error logs, does it crash with OOME or what do you mean? Also you should consider listing the components used by your topology (spouts, bolts, is it Trident?) as well as the Storm version.

    – Stig Rohde Døssing
    Nov 26 '18 at 18:37











  • what I mean by fail is that after analyzing Storm logs I can see one port stops processing the tuples, that results in huge failures at Spout.It just print the logmetrics info for hours.No sign of tuple processing in the logs

    – Milind Pardeshi
    Nov 27 '18 at 2:58











  • 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system memory/heap.initBytes [230686720] 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system _receive.sojourn_time [9952.0] 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system __receive.overflow

    – Milind Pardeshi
    Nov 27 '18 at 3:00











  • Its not trident topology , it contain one spout and eight bolts

    – Milind Pardeshi
    Nov 27 '18 at 3:02



















  • Please describe what you mean when you say that your topology "fails". Do you get error logs, does it crash with OOME or what do you mean? Also you should consider listing the components used by your topology (spouts, bolts, is it Trident?) as well as the Storm version.

    – Stig Rohde Døssing
    Nov 26 '18 at 18:37











  • what I mean by fail is that after analyzing Storm logs I can see one port stops processing the tuples, that results in huge failures at Spout.It just print the logmetrics info for hours.No sign of tuple processing in the logs

    – Milind Pardeshi
    Nov 27 '18 at 2:58











  • 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system memory/heap.initBytes [230686720] 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system _receive.sojourn_time [9952.0] 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system __receive.overflow

    – Milind Pardeshi
    Nov 27 '18 at 3:00











  • Its not trident topology , it contain one spout and eight bolts

    – Milind Pardeshi
    Nov 27 '18 at 3:02

















Please describe what you mean when you say that your topology "fails". Do you get error logs, does it crash with OOME or what do you mean? Also you should consider listing the components used by your topology (spouts, bolts, is it Trident?) as well as the Storm version.

– Stig Rohde Døssing
Nov 26 '18 at 18:37





Please describe what you mean when you say that your topology "fails". Do you get error logs, does it crash with OOME or what do you mean? Also you should consider listing the components used by your topology (spouts, bolts, is it Trident?) as well as the Storm version.

– Stig Rohde Døssing
Nov 26 '18 at 18:37













what I mean by fail is that after analyzing Storm logs I can see one port stops processing the tuples, that results in huge failures at Spout.It just print the logmetrics info for hours.No sign of tuple processing in the logs

– Milind Pardeshi
Nov 27 '18 at 2:58





what I mean by fail is that after analyzing Storm logs I can see one port stops processing the tuples, that results in huge failures at Spout.It just print the logmetrics info for hours.No sign of tuple processing in the logs

– Milind Pardeshi
Nov 27 '18 at 2:58













2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system memory/heap.initBytes [230686720] 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system _receive.sojourn_time [9952.0] 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system __receive.overflow

– Milind Pardeshi
Nov 27 '18 at 3:00





2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system memory/heap.initBytes [230686720] 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system _receive.sojourn_time [9952.0] 2018-11-10 04:17:26.540 o.a.s.m.LoggingMetricsConsumer [INFO] 1541823446 wn0-stormp.t3rhv0qy3pnuzkwi2ieuucmbpf.ax.internal.cloudapp.net:6700 -1:__system __receive.overflow

– Milind Pardeshi
Nov 27 '18 at 3:00













Its not trident topology , it contain one spout and eight bolts

– Milind Pardeshi
Nov 27 '18 at 3:02





Its not trident topology , it contain one spout and eight bolts

– Milind Pardeshi
Nov 27 '18 at 3:02












1 Answer
1






active

oldest

votes


















0














I'm familiar with at least one bug in Storm pre-1.0.0 that can cause workers to hang. If you aren't on the latest Storm version, try upgrading.



Other than that, your best bet for debugging this is to use jstack or kill -3 on the worker JVM to figure out what your hanging worker is doing. You may also want to enable debug logging if it doesn't harm your performance too much. You do this by doing config.setDebug(true); when setting up the topology.



Once you know why the worker isn't processing tuples you can try posting the stack trace here, maybe there's an issue in Storm.






share|improve this answer
























    Your Answer






    StackExchange.ifUsing("editor", function () {
    StackExchange.using("externalEditor", function () {
    StackExchange.using("snippets", function () {
    StackExchange.snippets.init();
    });
    });
    }, "code-snippets");

    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "1"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














    draft saved

    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53481165%2fmy-storm-topology-fails-after-running-for-10-days%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    0














    I'm familiar with at least one bug in Storm pre-1.0.0 that can cause workers to hang. If you aren't on the latest Storm version, try upgrading.



    Other than that, your best bet for debugging this is to use jstack or kill -3 on the worker JVM to figure out what your hanging worker is doing. You may also want to enable debug logging if it doesn't harm your performance too much. You do this by doing config.setDebug(true); when setting up the topology.



    Once you know why the worker isn't processing tuples you can try posting the stack trace here, maybe there's an issue in Storm.






    share|improve this answer




























      0














      I'm familiar with at least one bug in Storm pre-1.0.0 that can cause workers to hang. If you aren't on the latest Storm version, try upgrading.



      Other than that, your best bet for debugging this is to use jstack or kill -3 on the worker JVM to figure out what your hanging worker is doing. You may also want to enable debug logging if it doesn't harm your performance too much. You do this by doing config.setDebug(true); when setting up the topology.



      Once you know why the worker isn't processing tuples you can try posting the stack trace here, maybe there's an issue in Storm.






      share|improve this answer


























        0












        0








        0







        I'm familiar with at least one bug in Storm pre-1.0.0 that can cause workers to hang. If you aren't on the latest Storm version, try upgrading.



        Other than that, your best bet for debugging this is to use jstack or kill -3 on the worker JVM to figure out what your hanging worker is doing. You may also want to enable debug logging if it doesn't harm your performance too much. You do this by doing config.setDebug(true); when setting up the topology.



        Once you know why the worker isn't processing tuples you can try posting the stack trace here, maybe there's an issue in Storm.






        share|improve this answer













        I'm familiar with at least one bug in Storm pre-1.0.0 that can cause workers to hang. If you aren't on the latest Storm version, try upgrading.



        Other than that, your best bet for debugging this is to use jstack or kill -3 on the worker JVM to figure out what your hanging worker is doing. You may also want to enable debug logging if it doesn't harm your performance too much. You do this by doing config.setDebug(true); when setting up the topology.



        Once you know why the worker isn't processing tuples you can try posting the stack trace here, maybe there's an issue in Storm.







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Nov 27 '18 at 18:18









        Stig Rohde DøssingStig Rohde Døssing

        1,891235




        1,891235
































            draft saved

            draft discarded




















































            Thanks for contributing an answer to Stack Overflow!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53481165%2fmy-storm-topology-fails-after-running-for-10-days%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Create new schema in PostgreSQL using DBeaver

            Deepest pit of an array with Javascript: test on Codility

            Costa Masnaga