-
Notifications
You must be signed in to change notification settings - Fork 539
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Localization jumps #112
Comments
Digging a little bit more: the TF map->odom is updated by setTransformFromPoses, but apparently if processor_type_ == PROCESS_LOCALIZATION then that method is never called. That is weird. Isn't localization about calculating the map->odom transform continuously? |
Looking here, you are correct, that was a mistake in my refactor & I'm not surprised I didn't catch it since it was all tested on simulation and bag data. Though I do test on a physical robot before I release large changes so I'm not sure how that was missed. How this should read is:
and then the Agree? |
Look good to you? |
Keep in mind |
I will try it |
It is much better. I think you should merge this changes. Anyway, how can I achieve a more aggressive localization (matching to map)? I know I should not expect "miracles" but it seems that big jumps in the localization are done only once or twice a minute. Which parameters are the most relevant? slam_toolbox-2019-10-17_21.00.42.mkv.tar.gz Attached video |
I have never seen something like that, I need to know more about your setup. What laser, is there enough CPU available, how did you generate the original map, how did you match against the original map, etc |
I have a very recent I7 and the software is compiled in release. Less than 50% of a single CPU is used (I have 8 CPUs). The laser is published a 10 Hz. I have another video where you can better appreciate that it is "supposed" to localize easily, but it does only after a huge time (more or less at 1:02 of the video) slam_toolbox-2019-10-17_21.20.19.mkv.tar.gz I don't know if it may help, but this is the update frequency of the map->odom trnasform |
Is this after the patch we just made? This is odd. If you can give me a mapping bag run and a bag run of this attempt at localization that is failing, I can take a look (and hopefully a video with those datasets of it not working on your end to compare) |
before the path, basically map->odom was NEVER updated, so we can't even compare. After the patch, map->odom is updated about once every 1.2 seconds and big jumps happen only once per minute. Let me prepare the rosbags. Thank you VERY much for your help. |
I forgot to mention something vital: I am using branch melodic in my computer... that has Kinetic!!! |
well that's just based on your settings / movement. Its not time based its angular distance and linear distance based like in AMCL. Sorry, to be clear, which version of this code is running? Both the branches compile with both versions, I'd actually be interested if you were running These videos... are they the beginning of localization or after some moving around? I assume you give it a good, legit, position when it starts up. If you're starting up with a poor match, then that might be part of this |
|
Mhm, good thought. Certainly wouldn't hurt. It's a little involved to change that but I'll file a separate ticket |
I'm going to rename this ticket to designate where the discussion has gone |
What happens if you start with a good match, using the If that works fine, then its probably just clearing the buffer and I can do that in an afternoon |
Ok, attached there is the relevant data: This first rosbag is the one I use to create the serialized map with online_sync.launch Note that I have to change the name of the base frame to base_link and subscribe to /front_laser/scan Afterward, I launch localization.launch using the serialized map.
Further, there is a video and the recorded rosbag slam_toolbox-2019-10-17_22.57.31.mkv.tar.gz https://drive.google.com/file/d/14MvnFYP5oLMBTmnPmuLSF9MVL5Djzaje/view?usp=sharing |
Awesome, I'll take a look tomorrow or this weekend. to summarize your thoughts, you think the issue is introduced because of the If its all surrounding initial pose, I'm inclined to agree with you and should be a medium difficulty fix I can hammer out in a day |
Once the robot is localized, there is no drift. But I would like to test how robust localization is to disturbances, this is the reason why I was "annoying" the robot myself from RViz. Let's put it this way: as a user, I expect that:
|
My guess is that it actually snaps back super quickly, its that you're "abusing" the intent of the 26 seconds seems suspect if it comes back, that probably indicates to me that's just when the rolling buffer from when you published a pose expires then. How fast are you moving? I think my default chain size is 10 so if you were slow moving I could see those numbers working out |
Keep in mind that in the first 40 seconds of the video, I do NOT use /initialpose , not even once.
The velocity of the robot is about 0.7 m/sec |
I'll take a look this weekend, this represents an annoying issue that needs to be fixed. I've done similar testing in my environments and I don't see this problem. What laser are you using? How good is your odometry? From the beginning of this video, you're very poorly localized. Is that the starting loading point or where you moving around a bit before that? Again, I'm trying to be really specific here because I'm not understanding exactly what you're doing and some of the information how I'm interpreting what you're saying is contradictory: Once this thing localizes to the map do you see these jumps or is it only when you initialize or artificially disturb it;. In all of my tests and for the use of localization, I assume you know very well where you are at the start. The example being that the robot lives at a dock or cached its last pose before being turned off. If you do not start well localized, that is something very different from the industrial usecase I'm supporting and I'd merge PRs to help but not on my plate to add (that's a global scan matching problem). But once its localized on startup from knowing the position, I've never seen jumps how you've shown on 3 different industrial robots with industrial grade lidars that I have access to There's probably something to resolve to fix what you're up to but I'm trying to understand very clearly what your test situation is prior to and during your experiments so I know where the issue(s) might be When you artificially disturb the graph by using With that said, fully resetting the state will help alot, but feeding bad global corrections (which are essentially taken like "the word of god") is going to mess up any graph based solution for a finite time horizon |
@SteveMacenski , first of all, thank you very much for your time.
The laser is obtained from a 3D Lidar, transformed using pointcloud_to_laserscan. The quality of the distances looks very good to me (low noise, long range).
Not in the last video and rosbags I sent to you. position is almost correct and rotation is off by maybe 10 degrees. other localization packages deal with this amount of error with no problem.
No. It takes an unreasonable amount of time in my opinion to eventually localize itself, but once it does, the localization is good.
I understand that you are not aiming to global localization, I really do. But I think that it is weird that it takes so much time /26 seconds!) to localize when a tiny initial error is present. I do industrial robots too, and errors of this magnitude may happen. As I said before, I probably need to understand if i can influence this behavior somehow tuning some parameters.
Thanks for the explanation, of course it makes perfect sense what you say. I am looking forward for issue #116 to be addressed. Said that, I am very grateful for your work and your prompt reply. Cheers |
No worries, I think you're the first non-me person to really dive into localization in a serious way so I appreciate the questions and your needs so we can make this work as well as possible for everyone. It's easy to miss more general peoples' desires and needs when its just me sitting in a dark dungeon alone making robots move. 3d lidar... ok then I assume the range is >>20m, but I'm only seeing a relatively short range. The walls also look a little hairy for a 3D lidar but I've never just taken the horizontal band and tried to do something with it in 2D mode. Probably isn't an issue just an observation. Though those usually have pretty low horizontal resolution so that might be a factor here to consider. What's the unit if you don't mind sharing (or just give me the horizontal spec). Odometry sounds good then. I see you made a map in the first place so that's probably OK. This is frame 1 that I have from your last video, it looks pretty off, but my correction factor step should take care of more of the fine tuning of it. I'm guessing the issue with that particular localization isn't actually the package as much as that region doesn't have alot of features to grab onto (that section of the map have that large gap of missing stuff, there's already some copies of that rounded object up top, there's some laser readings on the top right that dont match anywhere to the map). EI that starting location looks like the least-ideal place in the entire map that I can see to try to match against. I think what you see there is worst case situation.
OK good, so we're on a common starting point, this is also my experience. So we can work from here. So it looks like our problem is "how to localize faster" on initialization. Would you agree?
Eh, on the scheme of hours of operation 26s isnt much but I get your concern, we can work on this point. An option is we could, on initialization or
There are probably (definitely) parameters that can be tuned to make this faster. Now what they are is a long, long list and its a balancing act. All of the params in the Matcher Params section of the README are in play for that one. I'd first look at the
Glad I could make it clear -- a PF will act very differently than this. But once you know where you are and you don't "kick" the robot, you're as likely to delocalize as you are in SLAM mode, which given how I've modified the core library and tuned it, is really really unlikely. That's the huge benefit I derive from this work. In line with your "reset buffer" service, I think you'd see after "relocalization" than 26 seconds if the buffer size was smaller, so dynamically resizing that buffer would help. |
This is an outdoor map... so the hairy points are... bushes!
Yes, exactly! |
Ahhhhhhhhh I see this is a parking lot |
Now that I understand more about what is going on, I am tempted to close this issue... and open more focused one in the future ;) Nevertheless, I think you should keep in the back of your mind the fact that there might be a way to speed up the initial localization, may be being a little more aggressive with pose estimation when the circular buffer is still empty (just saying, I don't know if it make sense). |
OK, feel free! |
I'm going to pin this because it was a good discussion I think will yield value to others |
Quick followup - I haven't forgotten about this, I just haven't had the cycles yet to process your data and try to reproduce. My hope was this evening but I got caught up on IROS planning. My plan is to address tomorrow and into Wednesday. I'll probably have an update by Wednesday evening. If I'm blocking you, something you could try that would be helpful would be to see if you change the |
Assuming I downloaded all the right assets its my plan to play with this on the 14 hour flight to ROSCon. Hopefully we can meet up and I'll have some thoughts. |
Sorry, not attending ROSCon |
@facontidavide I'm having a very hard time trying to get your localization bag to play well. Not sure if you have some non-standard configuration or some changed some stuff in this package you didn't mention. I've found some things in this ROS1 package in melodic I've needed to change to get it to run so not sure if you made changes you hadn't mentioned. Can you give me the specific directions you used to run the localization bag file to localize? I'm failing to even do basic things after filtering out the maps/map->odom TF/etc to just the base data on this bag file. This is quite a little puzzle Edit: I think I'm finding something's off with my computer. I'll need to investigate this. |
Ah figured out why, you included |
OK @facontidavide, have some updates. It looks to me that this isn't probably an issue requiring new code to solve (beyond the clearing buffer on initial pose). I found that when I changed the buffer size and the loop match size to That particular location has alot of problems. There's large chunks of missing areas, the bushes are shaky with the laser scan so there's no good axes to pull from that. Also you only really have reliable information on one side but it represents <50% of all the information coming in from other distorted sources. I would then expect it would take awhile to converge, but I don't see the convergence relating to time or distance traveled, but rather related to getting out of that area. The 22 second time I count from motion starting to convergence is mostly just getting to an area where all the information is reliable. For example see below. In this case, I started the rosbag and only started the localization engine at t = 20 seconds, which is after we mostly leave the area of bad or no features. The time when you see convergence is exactly the same absolute time that starting from the beginning of the bag gives. Therefore the issue isn't related directly to the time or buffer sizes, but the fact that that area is just not a good position for positioning. This convergence time was 8 seconds, but again it relates more to having a few good sequential measurements without the poor information. You can also see I had the starting position very removed from the actual position, far more than you had in the bag file and we easily return back to the true position (faster and more reliably than I would have thought, actually). Finally, because I was curious about how extreme I could take this, I forwarded the bag now to around 45 seconds to do the same thing but then made the positioning of the robot exceptionally poor. In this case, we converged in about 6 seconds. I would only have expected that to work with a particle filter, but I guess by default expansion search region actually makes this work well enough by itself. The tl;dr - I'm updating a couple defaults, but I think the example situation you gave was not representative of an expected production map because of the massive missing sections of navigable space. Because of the bad starting point, incomplete map, and exceptionally poor data for most of the returns of the laser, it cannot accurately relocalize in that region. Once it leaves that region to an area sufficiently well mapped, it easily converges to a solution within a couple updates, even in the dataset you provide. The worst case I found was still < 10 seconds to convergence which is well in line with a particle filter. |
Thanks a lot for spending time investigating this! |
Hi,
I am trying slam_toolbox with a rosbag. I play it with the option --clock and setting use_sim_time to true.
The creation of the serialized map was ok, I used the command
As suggested in the README. I save the serialized map as "test_map". I double checked the
.ros
folder and it is created correctly.Things start getting more complicated in localization mode. I use
and the following parameters:
But apparently the localization is not working.
After a closer look at the data, it seems that the frame map->odom is published periodically, but it is always the same value.
Any clue?
The text was updated successfully, but these errors were encountered: