🐛 Bug Report: Failed to parse page JSON data: expected value at line 1 column 1 #301

obvious-hugh-mann · 2024-10-30T21:16:09Z

Describe the bug

When attempting to access the site, it loads slowly and then shows an error.

Steps to reproduce the bug

Steps to reproduce the behavior:

Enter the URL "redlib.privacyredirect.com" or "safereddit.com" or "lr.drgnz.club" in the URL bar, OR click on any link to one of those sites
Wait for the page to load
See error

What's the expected behavior?

The page should load correctly and show what it shows on Reddit.

Additional context / screenshot

Full text of the error when entering the bare URL: "Failed to parse page JSON data: expected value at line 1 column 1 | /r/popular/hot.json?&raw_json=1&geo_filter=GLOBAL"

Full text of the error when entering r/cats: "Failed to parse page JSON data: expected value at line 1 column 1 | /r/cats/hot.json?&raw_json=1"

Full text of the error when entering "r/cats/comments/qms1es/yall_i_did_not_realize_how_affectionate_and/" : Failed to parse page JSON data: expected value at line 1 column 1 | /r/cats/comments/qms1es/yall_i_did_not_realize_how_affectionate_and/.json?&raw_json=1

redlib.privacyredirect.com is running the latest commit, but the other 2 instances are not

I checked that the instance that this was reported on is running the latest git commit, or I can reproduce it locally on the latest git commit

np22-jpg · 2024-10-30T21:36:04Z

Same issue here. My personal instance is running bc95308. That being said, this might just be a wave of IP bans by Reddit.

AyoungDukie · 2024-10-30T21:43:17Z

For reference, folks will either want to reopen the original pinned issue, or pin a new opened one to avoid a flurry of dupes.

But yes, seems like a new method of attempting to filter/ban access.

Handrail9 · 2024-10-30T21:52:20Z

Im getting this on a personal 1 user instance

rc2dev · 2024-10-30T22:37:31Z

Im getting this on a personal 1 user instance

Same here.

Plus, spinned up a Libreddit instance and got the same error.

Owl-Tec · 2024-10-30T22:48:13Z

Same issue here on a personal instance. Reddit most likely changed something on their end again.

r7l · 2024-10-30T23:26:28Z

Same here.

gigirassy · 2024-10-30T23:40:21Z

Yeah, my basic-auth instance at rl.blitzw.in gets the same error.

vytskalt · 2024-10-31T00:18:26Z

It's clear that this is a global issue. I don't think we should be posting these "same here" comments as they're just causing useless notifications for others.

HairyMilkshakes · 2024-10-31T00:51:37Z

probably getting rate limited again.

notpushkin · 2024-10-31T02:16:31Z

@HairyMilkshakes Don’t think that’s the case – rate limiting shouldn’t affect one user instances

NovaCyntax · 2024-10-31T02:38:08Z

Has nothing to do with rate limiting, happens every few months it seems. Generally a quick fix.

toberoni · 2024-10-31T03:03:44Z

Restarting the redlib docker container lets me access Reddit for 1-2mins (repeatable). After that the instance throws the error again.

luutuyen2k9 · 2024-10-31T05:01:47Z

I have a same problem when trying to visit libreddit.freedit.eu

arch-btw · 2024-10-31T07:08:34Z

I haven't looked at the code, but I might see part of the issue, notice this part of the error message:

hot.json?&raw_json=

There's an ? followed by the &, which isn't valid. It's just supposed to be only one of those, ~~I forgot which one though~~.
The first parameter should start with a ? and then the subsequent parameters start with an &, never both.

So that might be resulting in an empty field right now:

@sigaloid

jimmydoh · 2024-10-31T07:46:54Z

I haven't looked at the code, but I might see part of the issue, notice this part of the error message:

hot.json?&raw_json=

There's an ? followed by the &, which isn't valid. It's just supposed to be only one of those, ~~I forgot which one though~~. The first parameter should start with a ? and then the subsequent parameters start with an &, never both.

So that might be resulting in an empty field right now:

@sigaloid

While it is not very 'neat', most servers will deal with the empty parameter in the query string.

In testing it works fine on Reddit as well (you can test by browsing directly to Reddit with the full path from the error message - with or without the extra &, you get the same json returned, assuming your request is not blocked outright).

EDIT: That being said, if you wanted to detect traffic from a specific app and knew it had that 'quirk', you could probably identify those requests and then kill the sessions that sent them.

e455a81e-d3ba-41a2-bc6d-7aafb1d9a5cd · 2024-10-31T13:56:29Z

I think it is quite interesting that restarting the container seems to help for a while. Is redlib generating some data on startup which is sent to the reddit api that could be used to block requests?

pimlie · 2024-10-31T14:00:45Z

@e455a81e-d3ba-41a2-bc6d-7aafb1d9a5cd See #229 (comment) from the last issue, looking at the commit log cache poisoning could probably still be happening.

e455a81e-d3ba-41a2-bc6d-7aafb1d9a5cd · 2024-10-31T14:19:11Z

Yes, that seems much more likely.

dormieriancitizen · 2024-10-31T15:10:29Z

Oddly, at least for me, even without restarting the issue is inconsistent.

Uptime Kuma is reporting 40% uptime, with seemingly random failures over the night

I can access my instance now but it seems like it's breaking at random (not from ratelimiting, from the parse failure)

EDIT: could still be ratelimiting, but it's not a 429

EDIT2: 6 minutes later, down again

davegallant · 2024-10-31T15:15:31Z

Gatus is telling me the endpoint was unhealthy for 1061 minutes with consistent ❌ [STATUS] (404) == 200 (parse errors).

After a reboot, it's been working fine for the past 60 minutes (with probes every 30s).

sigaloid · 2024-10-31T15:19:53Z

Going to take a deeper look at this later today if I can. On my radar as high priority though.

If I had to guess, it's some similar thing to last time, aka some server side change that blocks the kind of request redlib makes. If we could try to replicate one of them using curl, and it works, then we know it's in the TLS stack like last time. If it doesn't, it's a more complicated fix.

wuchyi · 2024-10-31T16:17:49Z

Not sure if it's the same issue, but my redlib error looks a bit different:

Couldn't send request to Reddit: Rate limit - try refreshing soon

Edit: Scratch this, updated to the latest docker release and it's now showing the same JSON error as others.

pimlie · 2024-10-31T16:19:53Z

Forcibly recreating the oauth token (which was the solution last time) here does not seem to work. Cache poisoning does not seem to be an issue while testing that as it also doesn't work when requesting a subreddit you did not visit yet.

As restarting redlib still works, I'm looking into the connection pooling now. A lot of people reported (this time and before) that after a restart it worked for a minute or so but then they started getting rate limited again. I'm quite sure this is not a minute but 90 seconds, which is the default connection pool idle timeout in hyper: https://docs.rs/hyper/0.14.31/hyper/client/struct.Builder.html#method.pool_idle_timeout

I can also reproduce that when I manually change the client config to:

client::Client::builder()
  .pool_idle_timeout(std::time::Duration::from_secs(10))
  .build(https)

With the above I'm not rate limited as long as I keep requesting pages but as soon as I don't request anything for 10s (or 5 or whatever) I'm rate limited. This seems counter-intuitive though, as my original thought was that re-using connections from the connection pool might be the issue but given the timeout it seems that creating new connections within the same pool is causing issues. Haven't look any further yet but it might very well be an upstream issue again.

Note, a possible workaround for now to not trigger this issue so often could be to specify .pool_idle_timeout(None) , but not sure what/if the disadvantages are of doing that.

sigaloid · 2024-10-31T19:54:59Z

Even setting the pool max size to 1 and timeout to none (meaning keep the one connection open continuously) doesn't work, nor does setting the pool max size to zero which should start a new connection every time..

sigaloid · 2024-10-31T20:13:59Z

Narrowed down the issue and fixed it (in my testing so far). I just pushed efdf184, latest tag is released on quay.io/redlib/redlib. All, please test!

Fix info

I replaced every client call with generating an entirely new client. This slows down Redlib marginally (larger instances may notice it more) but now it works. This is an emergency patch to fix it temporarily, I won't close this issue just yet as I want to get to the root of the problem.

I specifically tried a global static like below, replaced every CLIENT call with client::Client::builder().build(CONNECTOR), and it still broke.

pub static CONNECTOR: Lazy<HttpsConnector<HttpConnector>> = Lazy::new(|| {
	let https = hyper_rustls::HttpsConnectorBuilder::new().with_native_roots().https_only().enable_http1().build();
	https
});

This leads me to believe the issue lies with the HttpsConnectorBuilder and not the client builder line (client::Client::builder().build(https)). Because we only reuse the native roots line, and it still fails. It seems like we need to rebuild the HttpsConnector every time.

sigaloid · 2024-10-31T20:42:57Z

Ok, more discoveries: When I revert to before the fix to still use the CLIENT global, and I modify max_idle_per_host to zero, so that any connection is killed once it's done, it always fail to retrieve it.

pub static CLIENT: Lazy<Client<HttpsConnector<HttpConnector>>> = Lazy::new(|| {
	let https = hyper_rustls::HttpsConnectorBuilder::new().with_native_roots().https_only().enable_http1().build();
	client::Client::builder().pool_max_idle_per_host(0).build(https)
});

Given that restricting it to no kept-alive connections leads to permanent guaranteed failure, one would assume it's the issue of restarting a brand-new connection that causes it. Then why does the fix of creating a new pool every time work?

Perhaps it's because the new pool never allows two simultaneous open TCP connections...? Maybe if I set the timeout to zero...

	client::Client::builder().pool_idle_timeout(Duration::ZERO).build(https)

Works UNLESS one request is made while another is in-flight.

So conclusion is that if two connections within the same pool are in-flight, Reddit's CDN will block the second one. Why exactly the pool matters is unclear.

matrox471 · 2024-10-31T20:49:27Z

I had the issue. i updated my image and i am running 9aea9c9 and it seems to work so far. the deployment took a solid 4-5 minutes between the
Running Redlib v0.35.1 on [::]:8080!
message and actually being able to reach it but other than that, it works.
The app seems ever so slightly less reactive but nothing world shattering.
Keep the good work mate ! cheers

pimlie · 2024-10-31T21:08:10Z

So conclusion is that if two connections within the same pool are in-flight, Reddit's CDN will block the second one. Why exactly the pool matters is unclear.

Could that be a HTTP1 related issue? Maybe they strongly prefer / switched to HTTP/2 multiplexing when they know the client should be capable of that?

joelkoen · 2024-11-01T01:37:55Z

Small reminder to consider donating if you appreciate sigaloid's work on this: https://liberapay.com/sigaloid

sigaloid · 2024-11-01T02:50:22Z

Yep, seemed to be HTTP/2 changes. Thanks to everyone who reported info and tested the fixes I pushed, glad this has been fixed in a more permanent way :)

Cyrix126 · 2024-11-01T09:41:03Z

Yep, seemed to be HTTP/2 changes. Thanks to everyone who reported info and tested the fixes I pushed, glad this has been fixed in a more permanent way :)

In https://github.com/redlib-org/redlib#binary it says to add this line to the nginx config
proxy_http_version 1.1;
Is it still needed ?

lvxnull2 · 2024-11-01T11:15:45Z

Yes, redlib still serves over http/1.1 but only connects to reddit with http2

ggtylerr · 2024-11-01T17:34:22Z

Hi there, I updated my instance to the latest build but it's still experiencing this problem: https://nyc1.lr.ggtyler.dev/

It's on the latest commit too, 2fd358f3eda1c25992c2a1c2d0e1bef2506627cb.

EDIT: Nevermind, for some reason it just started working after ~30 minutes of me starting the container.

kumitterer · 2024-11-02T05:53:08Z

Our instance is still having that issue, unfortunately. It is on the most recent commit, so the latest fix should be applied. Is there any way I can help debugging this? https://redlib.private.coffee/info

sigaloid · 2024-11-02T07:03:06Z

It could be an IP ban. Can you reproduce it on a different IP?

tdtgit · 2024-11-02T10:32:58Z

Thanks! Fixed the issue, one user instance :)

kumitterer · 2024-11-02T12:13:32Z

It could be an IP ban. Can you reproduce it on a different IP?

I can. Tried routing through several tunnels, same result every time...

kumitterer · 2024-11-02T12:38:53Z

Hmm, after the umpteenth IP rotation and restart, it seems to be working now. 🤔

ggtylerr · 2024-11-08T17:46:49Z

Hi there, I updated my instance to the latest build but it's still experiencing this problem: https://nyc1.lr.ggtyler.dev/

It's on the latest commit too, 2fd358f3eda1c25992c2a1c2d0e1bef2506627cb.

EDIT: Nevermind, for some reason it just started working after ~30 minutes of me starting the container.

Update: Over the past week we're still getting this, not only on NYC-1 but also on CAL-1. It's pretty likely that the rate limit hasn't been resolved (especially like with @kumitterer here having to rotate IPs)

sigaloid · 2024-11-12T15:21:48Z

@ggtylerr #318

obvious-hugh-mann added the bug Something isn't working label Oct 30, 2024

obvious-hugh-mann changed the title ~~🐛 Bug Report: Failed to parse page JSON data: expected value at line 1 column 1 | /r/popular/hot.json?~~ 🐛 Bug Report: Failed to parse page JSON data: expected value at line 1 column 1 Oct 30, 2024

kylrth mentioned this issue Oct 31, 2024

🐛 Bug Report: Failed to parse page JSON data: expected value at line 1 column 1 | /r/popular/hot.json? #302

Closed

1 task

vytskalt mentioned this issue Oct 31, 2024

🐛 Bug Report: Failed to parse page JSON data: expected value at line 1 column 1 | /r/degoogle/comments/p82v2j/searx_vs_whoogle/.json?tl=fr #303

Closed

1 task

alexl8819 mentioned this issue Nov 1, 2024

enables http2 crate feature, replaces http1 protocol with http2 on co… #305

Merged

sigaloid closed this as completed in #305 Nov 1, 2024

NightSlayer-007 mentioned this issue Nov 1, 2024

[Bug] Unexpected char '<' at line 1, column 1 (JSON::ParseException) | 429 rate limit errors iv-org/invidious#5032

Closed

sigaloid pinned this issue Nov 1, 2024

sigaloid mentioned this issue Nov 1, 2024

🐛 Bug Report: Failed to parse page JSON data: expected value at line 1 column 1 [RATE LIMITS] #229

Closed

Guanran928 mentioned this issue Nov 8, 2024

redlib: 0.35.1-unstable-2024-09-22 -> 0.35.1-unstable-2024-11-01 NixOS/nixpkgs#354406

Merged

13 tasks

ggtylerr mentioned this issue Nov 12, 2024

IP ban investigation #318

Closed

sigaloid unpinned this issue Nov 19, 2024

lvxnull2 mentioned this issue Nov 19, 2024

🐛 Bug Report: Failed to parse page JSON data: expected value at line 1 column 1 (11/18) #324

Open

🐛 Bug Report: Failed to parse page JSON data: expected value at line 1 column 1 #301

🐛 Bug Report: Failed to parse page JSON data: expected value at line 1 column 1 #301

Comments

obvious-hugh-mann commented Oct 30, 2024 • edited Loading

Describe the bug

Steps to reproduce the bug

What's the expected behavior?

Additional context / screenshot

np22-jpg commented Oct 30, 2024

AyoungDukie commented Oct 30, 2024

Handrail9 commented Oct 30, 2024

rc2dev commented Oct 30, 2024

Owl-Tec commented Oct 30, 2024

r7l commented Oct 30, 2024

gigirassy commented Oct 30, 2024

vytskalt commented Oct 31, 2024

HairyMilkshakes commented Oct 31, 2024

notpushkin commented Oct 31, 2024

NovaCyntax commented Oct 31, 2024

toberoni commented Oct 31, 2024

luutuyen2k9 commented Oct 31, 2024

arch-btw commented Oct 31, 2024 • edited Loading

jimmydoh commented Oct 31, 2024 • edited Loading

e455a81e-d3ba-41a2-bc6d-7aafb1d9a5cd commented Oct 31, 2024

pimlie commented Oct 31, 2024

e455a81e-d3ba-41a2-bc6d-7aafb1d9a5cd commented Oct 31, 2024

dormieriancitizen commented Oct 31, 2024 • edited Loading

davegallant commented Oct 31, 2024

sigaloid commented Oct 31, 2024

wuchyi commented Oct 31, 2024 • edited Loading

pimlie commented Oct 31, 2024

sigaloid commented Oct 31, 2024

sigaloid commented Oct 31, 2024 • edited Loading

Fix info

sigaloid commented Oct 31, 2024

matrox471 commented Oct 31, 2024

pimlie commented Oct 31, 2024

joelkoen commented Nov 1, 2024

sigaloid commented Nov 1, 2024

Cyrix126 commented Nov 1, 2024

lvxnull2 commented Nov 1, 2024

ggtylerr commented Nov 1, 2024 • edited Loading

kumitterer commented Nov 2, 2024

sigaloid commented Nov 2, 2024

tdtgit commented Nov 2, 2024

kumitterer commented Nov 2, 2024

kumitterer commented Nov 2, 2024

ggtylerr commented Nov 8, 2024

sigaloid commented Nov 12, 2024

obvious-hugh-mann commented Oct 30, 2024 •

edited

Loading

arch-btw commented Oct 31, 2024 •

edited

Loading

jimmydoh commented Oct 31, 2024 •

edited

Loading

dormieriancitizen commented Oct 31, 2024 •

edited

Loading

wuchyi commented Oct 31, 2024 •

edited

Loading

sigaloid commented Oct 31, 2024 •

edited

Loading

ggtylerr commented Nov 1, 2024 •

edited

Loading