Spark max amount of event time windows











up vote
0
down vote

favorite












Is there a limitation for Spark Event Time Streaming on the number of windows you can hold in parallel?



In general I would assume this depends on the available memory per executor and the amount of data you are using within the window, but there may be some metadata overhead in the master for holding references about the existence of all windows.



Does anyone know if there are some articles or official statements about that?



Lets assume I would like to create several million windows holding just small data like 120 rows. Would this cause a giant metadata overhead which might interfere with some limitations?










share|improve this question






















  • It should be easy to check - just create a dummy job and see how it behaves. In general I strongly suspect it won't work - very large number of jobs is not something that Spark is good at. It also raises a question - what use case would justify such thing?
    – user10465355
    2 days ago















up vote
0
down vote

favorite












Is there a limitation for Spark Event Time Streaming on the number of windows you can hold in parallel?



In general I would assume this depends on the available memory per executor and the amount of data you are using within the window, but there may be some metadata overhead in the master for holding references about the existence of all windows.



Does anyone know if there are some articles or official statements about that?



Lets assume I would like to create several million windows holding just small data like 120 rows. Would this cause a giant metadata overhead which might interfere with some limitations?










share|improve this question






















  • It should be easy to check - just create a dummy job and see how it behaves. In general I strongly suspect it won't work - very large number of jobs is not something that Spark is good at. It also raises a question - what use case would justify such thing?
    – user10465355
    2 days ago













up vote
0
down vote

favorite









up vote
0
down vote

favorite











Is there a limitation for Spark Event Time Streaming on the number of windows you can hold in parallel?



In general I would assume this depends on the available memory per executor and the amount of data you are using within the window, but there may be some metadata overhead in the master for holding references about the existence of all windows.



Does anyone know if there are some articles or official statements about that?



Lets assume I would like to create several million windows holding just small data like 120 rows. Would this cause a giant metadata overhead which might interfere with some limitations?










share|improve this question













Is there a limitation for Spark Event Time Streaming on the number of windows you can hold in parallel?



In general I would assume this depends on the available memory per executor and the amount of data you are using within the window, but there may be some metadata overhead in the master for holding references about the existence of all windows.



Does anyone know if there are some articles or official statements about that?



Lets assume I would like to create several million windows holding just small data like 120 rows. Would this cause a giant metadata overhead which might interfere with some limitations?







apache-spark bigdata spark-streaming iot






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked 2 days ago









AlexL

4071417




4071417












  • It should be easy to check - just create a dummy job and see how it behaves. In general I strongly suspect it won't work - very large number of jobs is not something that Spark is good at. It also raises a question - what use case would justify such thing?
    – user10465355
    2 days ago


















  • It should be easy to check - just create a dummy job and see how it behaves. In general I strongly suspect it won't work - very large number of jobs is not something that Spark is good at. It also raises a question - what use case would justify such thing?
    – user10465355
    2 days ago
















It should be easy to check - just create a dummy job and see how it behaves. In general I strongly suspect it won't work - very large number of jobs is not something that Spark is good at. It also raises a question - what use case would justify such thing?
– user10465355
2 days ago




It should be easy to check - just create a dummy job and see how it behaves. In general I strongly suspect it won't work - very large number of jobs is not something that Spark is good at. It also raises a question - what use case would justify such thing?
– user10465355
2 days ago

















active

oldest

votes











Your Answer






StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














 

draft saved


draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53350244%2fspark-max-amount-of-event-time-windows%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown






























active

oldest

votes













active

oldest

votes









active

oldest

votes






active

oldest

votes
















 

draft saved


draft discarded



















































 


draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53350244%2fspark-max-amount-of-event-time-windows%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Costa Masnaga

Fotorealismo

Sidney Franklin