Error on initialization of server mk-worker (stormconf.ser is missing)
|
|
|
|
Moshe Bixenshpaner
|
12-8-10
|
Hi guys,
I have a simple topology that puts values on a Redis server.
When I deploy it, no client gets to connect Redis, although it works perfectly fine when I run it via LocalCluster.
I attached logs and conf for the nimbus and the supervisors.
The cluster is configured as following:
- 1GB RAM for the nimbus (192.168.1.22)
- 1GB RAM for the zookeeper1 (192.168.1.31)
- 2GB RAM for the supervisor1 (192.168.1.16; 4 workers)
- 2GB RAM for the supervisor2 (192.168.1.19; 2 workers)
All machines are virtual and have JDK 6u33 x64 installed.
nimbus, supervisor1 & supervisor2 have Storm 0.8.0, ZeroMQ 2.1.7 and the latest JZMQ installed.
zookeeper1 has Python 2.6.6 (with default configuration) and Zookeeper 3.3.6 installed.
I'm not sure this is the entire problem, but I'm getting the following exception onsomeof my supervisors (in our case - supervisor2):
2012-08-10 08:21:27 worker [ERROR] Error on initialization of server mk-worker
java.io.FileNotFoundException: File '/opt/storm/local/supervisor/<wbr>stormdist/DistributedSystem-1-<wbr>1344586762/stormconf.ser' does not exist</wbr></wbr>
at org.apache.commons.io.<wbr>FileUtils.openInputStream(<wbr>FileUtils.java:137)</wbr></wbr>
at org.apache.commons.io.<wbr>FileUtils.readFileToByteArray(<wbr>FileUtils.java:1135)</wbr></wbr>
at backtype.storm.config$read_<wbr>supervisor_storm_conf.invoke(<wbr>config.clj:138)</wbr></wbr>
at backtype.storm.daemon.worker$<wbr>worker_data.invoke(worker.clj:<wbr>146)</wbr></wbr>
at backtype.storm.daemon.worker$<wbr>fn__4316$exec_fn__1206__auto__<wbr>__4317.invoke(worker.clj:331)</wbr></wbr>
at clojure.lang.AFn.<wbr>applyToHelper(AFn.java:185)</wbr>
at clojure.lang.AFn.applyTo(AFn.<wbr>java:151)</wbr>
at clojure.core$apply.invoke(c ore.clj:601)
at backtype.storm.daemon.worker$<wbr>fn__4316$mk_worker__4372.<wbr>doInvoke(worker.clj:322)</wbr></wbr>
at clojure.lang.RestFn.invoke(<wbr>RestFn.java:512)</wbr>
at backtype.storm.daemon.worker$_<wbr>main.invoke(worker.clj:432)</wbr>
at clojure.lang.AFn.<wbr>applyToHelper(AFn.java:172)</wbr>
at clojure.lang.AFn.applyTo(AFn.<wbr>java:151)</wbr>
at backtype.storm.daemon.worker.<wbr>main(Unknown Source)</wbr>
2012-08-10 08:21:27 util [INFO] Halting process: ("Error on initialization")
The topology I'm trying to run requires 4 workers altogether.
So even if supervisor2 dysfunctions, the other supervisor should be able to run the entire topology on its own.
Am I doing something wrong here?
Thanks,
Moshe.
Are your supervisors sharing a directory over a network mount, by any chance? What happens if you turn off supervisor2 completely? Do topologies launch successfully on the other supervisor?
|
Moshe Bixenshpaner
|
12-8-14
|
No, supervisors don't share directories.
They are virtual machines created by kvm though (I'm not sure if it has anything to do with the problem).
If I have enough workers on a single supervisor, everything works perfectly fine.
It seems the coordination between the supervisors is the cause of the problem.
Thanks,
Moshe.
The error you're facing indicates that the supervisor failed to download the configuration file from Nimbus. Can you show me the results of doing an ls -R on the supervisor local dir for the node that's getting that error? (do it
while the topology is active and causing the error – that is, don't shut it down and then do the ls -R).
|
Moshe Bixenshpaner
|
12-8-14
|
Hi,
I attached the local directory and log files for nimbus and each of the supervisors.
sv2 is the supervisor that fails to load.
Thanks,
附件 (6)
nb-local.tar
366 KB
查看
下载
sv1-local.tar
187 KB
查看
下载
sv2-local.tar
187 KB
查看
下载
I would need you to do the ls -R while the error is happening and the topology is still active.
|
Moshe Bixenshpaner
|
12-8-15
|
This is exactly what I did (only I attached a tar file or the entire local directory, instead of just attaching the output from an ls -R).
I don't quite understand – you said you did the ls -R a few days after the exception happened.
|
Moshe Bixenshpaner
|
12-8-17
|
I deleted that post, the one I posted eventually was after I reset everything, reproduced the whole thing and attached logs and contents of local directories.
The sv2 logs don't show any exceptions.
|
Moshe Bixenshpaner
|
12-8-26
|
Hi Nathan,
Log files of both SV2 workers show the logs showjava.io.<wbr>FileNotFoundException: File '/opt/storm/local/supervisor/<wbr>stormdist/DistributedSystem-1-<wbr>1344956702/stormconf.ser' does not exist</wbr></wbr></wbr>followed by
Halting process: ("Error on initialization").
On another note, the ZK1 log shows that clients are disconnecting every few seconds.
|
Moshe Bixenshpaner
|
12-8-26
|
Hey guys,
Problem is solved.
There were actually two of them:
1. The documentation specify to use a specific version of ZeroMQ, JZMQ, Python and JDK but doesn't specify anything about the Zookeeper, I assumed I can use the newest version (3.3.6) but it turned out to be a bad move. After a week with poor performance,I
checked the jars attached to Storm 0.8.0 and I saw that it is aimed for Zookeeper 3.3.3.
2. I'm not sure how it is with real clusters, but on virtual cluster you need to have each node specified in the /etc/hosts file of all other nodes - pay attention to the following form:
ip_address host_name.defaultdomain
Notice the .defaultdomain at the end of each host name - this was what actually solved the problem of having a cluster of supervisors working together simultaneously.
- 显示引用文字 -
分享到:
相关推荐
Farthest-Point Heuristic based Initialization Methods for K-Modes Clustering,何增友,,The k-modes algorithm has become a popular technique in solving categorical data clustering problems in ...
Error creating bean with name 'org.springframework.web.servlet.handler.SimpleUrlHandlerMapping#0' defined in ServletContext resource [/WEB-INF/springMVC-servlet.xml]: Initialization of bean failed;...
l Extensive glossary which explains most of the terms and concepts related to personal computer hardware. l Appendices brimming with practical advice, especially for programmers. -------------------...
we build on the initialization method proposed by Martinelli [1] and extended by Kaiser et al. [2], modifying it to be more general and efficient. We improve accuracy with several rounds of visual-...
java单元测试jar包,单元测试需遵守相应的语法及规则。避免initializationerror,unrooted tests错误需同时引用这两个jar包
If this option is not specified, gpinitsystem returns 1 if initialization completes with warnings, and returns value of 2 or greater if a fatal error occurs. PXF version 5.10.0 is included, which ...
Initialization 选卡..... 39 Data Properties 选卡......... 40 Libraries 选卡.......... 42 Outputs 选卡 43 Continuous Derivatives 选卡....... 45 Discrete Update 选卡........ 46 Build Info 选卡.......
Mapping Error Pages outside of Spring MVC 27.1.12. Spring HATEOAS 27.1.13. CORS Support 27.2. The “Spring WebFlux Framework” 27.2.1. Spring WebFlux Auto-configuration 27.2.2. ...
VINS-Mono is primarily designed for state estimation and feedback control of autonomous drones, but it is also capable of providing accurate localization for AR applications. This code runs on Linux,...
___________________________________________ EXECUTABLE AND LINKABLE FORMAT (ELF) ELF(可执行链接格式)手册 ...初始化和终止函数Initialization and Termination Functions 53 3. C LIBRARY 54
dm00104712-stm32cubemx-for-stm32-configuration-and-initialization-c-code-generation-stmicroelectronics.pdf
A proprietary API with parallel argument placement and meaningful return error codes is provided, with functions like FSFile_Wr(), FSFile_Rd() and FSFile_PosSet(). Alternatively, a standard POSIX-...
Error Types to Look For . . . . . . . . . . . 91 Prioritize . . . . . . . . . . 92 Design for Verification . . . . . . . . . . . . 93 Directed Testbenches Approach . . . 96 Group into Testcases . ...
JUnit报initializationError的解决方法在新搭建的环境上测试时,一个模块发现错误: java.lang.NoClassDefFoundError:org/hamcrest/SelfDescribing 一看就是缺少Class。多方查找,发现是少了hamcrest包。将hamcrest...
Apply the Resource Acquisition Is Initialization mechanism to a wide variety of problem domains Manage the sometimes arcane relationship between arrays and pointers Use template programming to ...
安装 eclipse colorer 报错: Error in initialization of native part of the Colorer library. This can be caused by absent net_sf_colorer.dll...
4.1.16 UPNP E NETWORK ERROR [-200] . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 4.1.17 UPNP E SOCKET WRITE [-201] . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ...
3.3.3 Initialization............................................................................................... 29 3.3.4 Higher-Layer Triggered Events ................................................
play-services-ads-lite-18.3.0是2019-11-23为止,最新的安卓admob sdk,用于原生安卓介入 加入工程后: ...import com.google.android.gms.ads.initialization.OnInitializationCompleteListener; 即可