首页 \ 问答 \ 什么是“并行”的“任务”(What is the “task” in Storm parallelism)

什么是“并行”的“任务”(What is the “task” in Storm parallelism)

 我正在尝试学习叽叽t witter的风暴，遵循伟大的文章“ 了解风暴拓扑的并行性 ”  
 但是，我对“任务”的概念有些困惑。 任务是组件的运行实例（spout还是螺栓）？ 执行者有多个任务实际上是说执行者多次执行相同的组件，我是否正确？  
 此外，在一般的并行性意义上，Storm将为一个喷口或螺栓生成一个专用的线程（执行器），但是由具有多个任务的执行器（线程）对并行性的贡献是什么？ 我认为在一个线程中有多个任务，因为一个线程顺序执行，只能使线程成为一种“缓存”资源，避免为下一个任务运行产生新的线程。 我对么？  
 我可以在花更多的时间调查之后，自己清理这些混乱，但是你知道我们都喜欢stackoverflow ;-)  
 提前致谢。 

I'm trying to learn twitter storm by following the great article "Understanding the parallelism of a Storm topology" 
However I'm a bit confused by the concept of "task". Is a task an running instance of the component(spout or bolt) ? A executor having multiple tasks actually is saying the same component is executed for multiple times by the executor, am I correct ? 
Moreover in a general parallelism sense, Storm will spawn a dedicated thread(executor) for a spout or bolt, but what is contributed to the parallelism by an executor(thread) having multiple tasks ? I think having multiple tasks in a thread, since a thread executes sequentially, only make the thread a kind of "cached" resource, which avoids spawning new thread for next task run. Am I correct?  
I may clear those confusion by myself after taking more time to investigate, but you know, we both love stackoverflow ;-) 
Thanks in advance.

原文：https://stackoverflow.com/questions/17257448

更新时间：2023-07-17 08:07

最满意答案

 您只需将文本复制并粘贴到.txt文件中即可。  
 然后打电话：  
library(tidyverse)
a <- readLines("test.txt") %>%
  # Convert to df
  as.data.frame(stringsAsFactors = FALSE) %>%
  # Filter empty rows
  filter(nchar(.) != 0)
 
 科林 

You can simply copy and paste your text in a .txt file.  
Then call :  
library(tidyverse)
a <- readLines("test.txt") %>%
  # Convert to df
  as.data.frame(stringsAsFactors = FALSE) %>%
  # Filter empty rows
  filter(nchar(.) != 0)
 
Colin

相关问答

将非结构化数据（每行）放入新列中(Put unstructured data - each line - in new columns)[2023-04-25]

您只需将文本复制并粘贴到.txt文件中即可。然后打电话： library(tidyverse) a <- readLines("test.txt") %>% # Convert to df as.data.frame(stringsAsFactors = FALSE) %>% # Filter empty rows filter(nchar(.) != 0) 科林 You can simply copy and paste your text in a .txt file. Then c ...
如何将非结构化数据插入/附加到bigquery表(How to insert/append unstructured data to bigquery table)[2022-02-21]

您可以将自动检测与BigQuery加载API一起使用，即使用bq cli工具的示例如下所示： ~$ cat /tmp/x.json {"name":"xyz","mobile":"xxx","location":"abc"} {"name":"xyz","mobile":"xxx","age":"22"} ~$ bq load --autodetect --source_format=NEWLINE_DELIMITED_JSON tmp.x /tmp/x.json Upload complete. ~$ ...
Pig如何处理非结构化数据而Hive不能处理？(How does Pig handle unstructured data while Hive can't?)[2024-01-31]

Pig是为处理模式较少的数据集而构建的......在hive中我们强制执行一个存储在derby中的模式，或者可以配置为存储在mysql中。现在还不清楚你在寻找什么！ Pig is built to processes schema less data sets..whereas in hive we enforce a schema which is stored in derby or can be configured to store in mysql..Now it is not clear wha ...
如何阅读pandas中的非结构化csv(How to read unstructured csv in pandas)[2023-06-24]

预处理函数get_names()打开文件，检查分割行的最大长度。然后我读取第一行并从最大长度添加缺失值。第一行的最后一个值是) ，所以我用firstline[:-1]删除它，然后我按+1 rng = range(1, m - lenfirstline + 2)添加范围缺失列。 +2是因为范围从值1开始。然后你可以使用函数read_csv ，skipp第一行和名称使用get_names()输出。 import pandas as pd import csv #preprocessing def ge ...
db中的结构化数据与非结构化数据(structured vs. unstructured data in db)[2022-03-16]

我说如果你需要运行SQL查询来计算min / max / avg之类的东西，或者根据值执行排序，限制或连接，那么你应该创建100+列。这就是我要做的。您没有说明您使用的是哪个品牌的数据库，但大多数应该在表中支持100多列，而没有低效率的风险。请不要使用Entity-Attribute-Value反模式 - 一些人会建议的键/值设计。将任意键/值对的任意集合插入到这样的设计中是很好的和容易的，但是对于每个属性具有一列的传统表格中的任何简单查询，使用EAV设计变得极其困难且效率低下。您还失去了使用SQ ...
如何计算非结构化数据中第三列之前的字符数（包括空格）？(How to count number of characters(including spaces) before the third column in unstructured data?)[2022-03-16]

gregexpr是基本的R regex函数，它返回关于字符串中多个匹配位置的信息。要提取它们，您可以使用regmatches ，但由于gregexpr返回每个匹配的第一个字符的索引，因此您可以简单搜索非空格字符组\\S+ ，选择第三个索引，然后减去一个以获得数字前面的字符： sapply(gregexpr('\\S+', x), `[`, 3) - 1 #> [1] 30 34 31 gregexpr is the base R regex function that returns informati ...
什么是非结构化数据？(What is meant by unstructured data? In the field of using ETL tools to work on data?)[2022-05-24]

传统的EDI不是非结构化的。 EDI通常遵循特定定义数据结构的某种标准（X12，EDIFACT，TRADACOMS等）。 XML，CSV和分隔文件也是结构化的。他们有一个定义的字段分隔符和一个记录终止符。非结构化数据的一个示例是具有专有格式的多个数据的Excel文件。没有记录标识符，数据解析器将无法理解数据是什么。它会以数据/文本的形式出现，但不会有映射器需要翻译/集成的任何上下文。 Word文档或PDF也可以被视为“非结构化”。 Traditional EDI is NOT unstructure ...
请求非结构化表格(Request on unstructured table)[2022-04-05]

您计划的模式可以满足您的需求。 SELECT可以是： SELECT * FROM user U WHERE EXISTS ( SELECT 1 FROM user_has_metadata UHM WHERE U.id=UHM.user_id AND UHM.value = "Fr" ) AND EXISTS ( SELECT 1 FROM user_has_metadata UHM2 WHERE U.id=UH ...
T-SQL将非结构化XML转换为列(T- SQL transform unstructured XML into columns)[2023-08-28]

您可以在SQL Server中使用XPath来访问XML数据节点。这是一个使用您的数据的示例。 declare @test xml set @test = '1ProductDesc' SELECT @test.value('(/product/productID/node())[1]', 'nvarchar(max)') as productID, @test.value('(/pr ...
Cassandra和非结构化数据(Cassandra and unstructured data)[2023-11-16]

Cassandra充其量只能搜索半结构化数据。这也是通过使用群集密钥和二级索引。群集密钥绝对是搜索半结构化数据的有效方式。在不指定分区键的情况下搜索二级索引数据效率不高。有一些解决方案可以帮助DSE Search（Solr with Cassandr）和Stargate。如果其中一列是非结构化文本，这两种解决方案也可能有所帮助。否则，使用Cassandra进行非结构化数据并不是一个好主意，因为没有密钥可能无法搜索。 Cassandra can at best be searchable for ...

Storm【技术文档】-Worker Executor Task的关系

storm

Storm Topology的并发度

Storm入门

Storm-源码分析-Topology Submit-Task

Storm Config

Storm-源码分析- bolt (backtype.storm.task)

Storm编程概述

Hadoop lzo文件的并行map处理

Storm配置项详解

什么是“并行”的“任务”(What is the “task” in Storm parallelism)

最满意答案

相关问答

将非结构化数据（每行）放入新列中(Put unstructured data - each line - in new columns)[2023-04-25]

如何将非结构化数据插入/附加到bigquery表(How to insert/append unstructured data to bigquery table)[2022-02-21]

Pig如何处理非结构化数据而Hive不能处理？(How does Pig handle unstructured data while Hive can't?)[2024-01-31]

如何阅读pandas中的非结构化csv(How to read unstructured csv in pandas)[2023-06-24]

db中的结构化数据与非结构化数据(structured vs. unstructured data in db)[2022-03-16]

如何计算非结构化数据中第三列之前的字符数（包括空格）？(How to count number of characters(including spaces) before the third column in unstructured data?)[2022-03-16]

什么是非结构化数据？(What is meant by unstructured data? In the field of using ETL tools to work on data?)[2022-05-24]

请求非结构化表格(Request on unstructured table)[2022-04-05]

T-SQL将非结构化XML转换为列(T- SQL transform unstructured XML into columns)[2023-08-28]

Cassandra和非结构化数据(Cassandra and unstructured data)[2023-11-16]

相关文章

最新问答