Redis instances shut down when scheduler restarted #56

eastlondoner · 2017-03-30T12:07:48Z

We have built mr-redis from the latest master and are running it on DC/OS (using zookeeper rather than etcd).

The basics work ok but when the scheduler is restarted the existing redis instances shut down and don't come back.

If you call the /STATUS endpoint it says that the redis instances are up - but looking in mesos they're not running any more

eastlondoner · 2017-03-30T13:23:07Z

It looks to me like the failover_timeout logic is not quite right in mesoslib.go

see here:
http://mesos.apache.org/documentation/latest/high-availability-framework-guide/

recommended settings are much greater than the 60 seconds that is set

I think the logic of using the failover timeout in GetFrameworkID is not correct:
e.g. if my scheduler has been up for longer than failover timeout and then restarts it shouldn't loose the old framework id (and all the running tasks).

eastlondoner · 2017-03-30T14:43:26Z

See this PR which fixes the behaviour when a scheduler is restarted:
#57

dhilipkumars · 2017-04-01T05:08:25Z

the PR looks good to me.

dhilipkumars · 2017-04-01T05:18:47Z

First of all thanks a lot for the contribution. Glad to hear that you are using mr-redis. I think mr-redis needs the leader-follower logic to be implemented so that more than one instance of this scheduler can be run at once for high-availability. Would you like to contribute that functionality?

dhilipkumars · 2017-04-01T05:38:44Z

@eastlondoner
How are you running it with DC/OS?
if you have re-packaged it would you be interested in contributing it to universe as version 01.

eastlondoner · 2017-04-03T15:40:12Z

Hi @dhilipkumars We're running it by installing the package from universe then going into Marathon and changing the docker image to point at out docker image: https://hub.docker.com/r/tractableio/mr-redis/

eastlondoner · 2017-04-03T15:42:06Z

We also had to change the docker client API version setting in mr-redis to match the version of Docker running on our Agents before we built that docker image.
You can see the code change on my fork. I've not issued a PR because I think there is a better way of doing it where it determines the docker api version from DOCKER_HOST env variable - but I've not had time to look into it.

eastlondoner · 2017-04-03T16:08:59Z

I guess I could push a new version to the universe, but I wouldn't want to push something that includes code changes that aren't in this (mainline) repo. Furthermore for the latest DC/OS I think that the docker API should be 1.25!

eastlondoner · 2017-04-03T16:10:18Z

n.b. this is the commit I am concerned about:
eastlondoner@10bdba0

daguero · 2018-02-26T20:51:35Z

@eastlondoner
Hello, I'm trying to access the image of docker https://hub.docker.com/r/tractableio/mr-redis/ but it is not accessible, could you give me some other option ???

Thank you

daguero · 2018-02-28T14:24:47Z

Hi @dhilipkumars I have the same problem that is discussed in this issue, I would like to access the docker image https://hub.docker.com/r/tractableio/mr-redis/ to do some tests.

Thank you

eastlondoner · 2018-03-01T09:59:46Z

@daguero I don't work at Tractable anymore and I recall I did some hacky things that I didn't want to publish to make it work.
However you should be able to build your own docker image that will work if you use my fork: https://github.com/eastlondoner/mr-redis

daguero · 2018-03-01T15:41:21Z

@eastlondoner OK, Thanks for your help, I'll prove it

dhilipkumars closed this as completed Apr 1, 2017

dhilipkumars reopened this Apr 1, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Redis instances shut down when scheduler restarted #56

Redis instances shut down when scheduler restarted #56

eastlondoner commented Mar 30, 2017

eastlondoner commented Mar 30, 2017

eastlondoner commented Mar 30, 2017

dhilipkumars commented Apr 1, 2017

dhilipkumars commented Apr 1, 2017

dhilipkumars commented Apr 1, 2017

eastlondoner commented Apr 3, 2017

eastlondoner commented Apr 3, 2017

eastlondoner commented Apr 3, 2017

eastlondoner commented Apr 3, 2017

daguero commented Feb 26, 2018

daguero commented Feb 28, 2018

eastlondoner commented Mar 1, 2018

daguero commented Mar 1, 2018

Redis instances shut down when scheduler restarted #56

Redis instances shut down when scheduler restarted #56

Comments

eastlondoner commented Mar 30, 2017

eastlondoner commented Mar 30, 2017

eastlondoner commented Mar 30, 2017

dhilipkumars commented Apr 1, 2017

dhilipkumars commented Apr 1, 2017

dhilipkumars commented Apr 1, 2017

eastlondoner commented Apr 3, 2017

eastlondoner commented Apr 3, 2017

eastlondoner commented Apr 3, 2017

eastlondoner commented Apr 3, 2017

daguero commented Feb 26, 2018

daguero commented Feb 28, 2018

eastlondoner commented Mar 1, 2018

daguero commented Mar 1, 2018