-
Notifications
You must be signed in to change notification settings - Fork 111
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Routing table error after long uptime or 3+ nodes #38
Comments
Yes both good ideas, the status has been discussed below Writing scrolling last-sent messages to the OLED has been mentioned on the mailing list. |
@samuk is correct, both of these issues have been on my mind. For what it is worth, the latest firmware (which I will release as 0.1.1 soon) includes a console interface accessed through serial that allows you to print the routing table by typing I'm planning on figuring out a way to display the routing table in the web app. |
@2E0PGS checkout new "Active Nodes" list feature mentioned in the related issue, #35 (comment) Should be working if you build (both the firmware and the web app) from latest, or I will be compiling a pre-built binary for 0.1.1 soon. No progress on utilizing the OLED screen yet. |
ok cool thanks! |
Sorry it took me a while to try v0.1.1 just flashed it on my two boards. Working great! I presume node |
It looks like there maybe a bug with the beacon length when the device is left on for a long time. 2020-01-25 17:20:312020-01-26 00:46:40I didn't change any settings in GQRX. The only changes I can think of maybe room temperature, laptop warming up, LoRa warming up, and HackRF warming up. Version 0.1.1 from the binary release. I had two nodes running there. Oddly enough unplugging and replugging didn't reset it. However back this morning with a cold room and cold devices (switch off over night) they're back to how it was in the first screenshot. |
I will try and replicate it by artificially heating up my board. Or leaving one on and one off and compare after hours. |
No sudden changes from artificially heating my SDR or my LoRa board. I am using TTGO. I will try leaving one running for now. Then I can turn the other on later and compare. |
The only code references I see are these two: https://github.com/search?q=org%3Asudomesh+beaconInterval&type=Code |
Or the glitch relates to the route message getting longer. Android phone on the WiFi slowing it down? |
Running one node on it's own for hours with no neighbours didn't have this behavior. This makes me think it's route message related. |
Interesting stuff, wonder if it's worth testing with the latest code? Realise not that much has changed, but might be worth verifying it's still an issue? |
Highly likely that there may be an unknown error with the routing message logic that only appears after a long uptime. My guess is that a byte gets shifted somewhere and starts filling the routing table with false routes. This would explain why it doesn't go away after only one node is restarted, because the node that was kept on immediately shares those false routes with the rebooted node. However, when both are rebooted, their routing tables are reset and their little network "forgets" about the false routes. Note: this is just my theory, I would need to do some actual testing and write some debugging code to demonstrate that this is happening. |
I don’t know if this is related, but I was seeing an issue where the
routing table exploded (dozens of new entries per minute) if I had 3+ nodes
running. I’ll try to reproduce it.
…On Mon, Jan 27, 2020 at 7:56 PM grant_____ ***@***.***> wrote:
Highly likely that there may be an unknown error with the routing message
logic that only appears after a long uptime. My guess is that a byte gets
shifted somewhere and starts filling the routing table with false routes.
This would explain why it doesn't go away after only one node is restarted,
because the node that was kept on immediately shares those false routes
with the rebooted node. However, when both are rebooted, their routing
tables are reset and their little network "forgets" about the false routes.
Note: this is just my theory, I would need to do some actual testing and
write some debugging code to demonstrate that this is happening.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#38?email_source=notifications&email_token=AAAEOEJDOBL7B4HIKGEEKG3Q76UGJA5CNFSM4KFYIP3KYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEKB56LA#issuecomment-579067692>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAAEOEODYW6YK4A6PRRXYLTQ76UGJANCNFSM4KFYIP3A>
.
|
Ref the hypothesis, this sounds about right to me. I suspect it's filling up and this causes a knock on effect of a longer TX length as the message is longer. |
Would you be up for trying to replicate your error with the latest routing? Hoping this bug has just gone away: #57 (comment) |
Yes, it would be good to test if this bug is resolved on the 1.0.0-rc.2 branch, which is using the latest updates to LoRaLayer2, which has switched to a more dynamic source routing (DSR) style and no longer requires that sharing of routing tables via routing table packets. |
Closing this issue and merging it with #81 since there is more activity on that thread and these seem closely (if not directly) related issues. |
Great firmware, I just successfully range tested two modules with impressive results on the 868MHz band.
A few improvement ideas
Cheers
The text was updated successfully, but these errors were encountered: