Dumb question: how do I know if an open source project is trustworthy?

BurningnnTree ( @BurningnnTree@lemmy.one ) · edit-2 1 month ago

Dumb question: how do I know if an open source project is trustworthy?

FizzyOrange ( @FizzyOrange@programming.dev ) · 1 month ago

Maybe all of the stars, forks, and discussions on the GitHub page are from fake accounts

All 9k stars, 10k PRs, 400 forks & professional web site are fake? Come on this is about the most obviously not fake project I’ve seen!

How do you know when a product like this can be trusted?

The same way you tell if anything can be trusted - you look at the signals and see if they are suss. In this case:

Lots of stars
Lots of real code in the repo
Professional looking website with commercial pricing
Lots of issues
Good English

The amount of effort it would take to fake this for very little benefit is enormous.

Maybe I’m just being paranoid.

Yeah just a little!

sus ( @sus@programming.dev ) · edit-2 1 month ago

All 9k stars, 10k PRs, 400 forks & professional web site are fake?

Technically, it is entirely possible to find a real existing project, make a carbon copy of the website (there are automated tools to accomplish this), then have a massive amount of bots give 9K stars and make a lot of PRs, issues and forks (bonus points if these are also copies of actual existing issues/PRs) and generate a fake commit history (this should be entirely possible with git), a bunch of releases could be quickly generated too. Though you would probably be able to notice pretty quickly that timestamps don’t match since I don’t think github features like issues can have fake timestamps (unlike git)

though I don’t think this has ever actually been done, there are services that claim to sell not only stars but issues, pull requests and forks too. Though assuming the service is not just a scam in itself, any cursory look at the contents of the issues etc would probably give away that they are AI generated

FizzyOrange ( @FizzyOrange@programming.dev ) · 1 month ago

Yeah possible, but this of the amount of effort that would take!

BurningnnTree ( @BurningnnTree@lemmy.one ) · 1 month ago

I agree it does look legitimate, I was just wondering what signs I should look out for in general. Like I’m sure fake GitHub engagement must be a thing, but I don’t know how widespread it is and I don’t know what the threshold is before a project can be considered definitely real. It sounds like you’re saying the level of engagement on this project is well beyond what can be considered sketchy, which is helpful information. Thanks

sus ( @sus@programming.dev ) · 1 month ago

for a large project, you can probably look at the history of issues, if there are lots of issues that are 5 years old, it’s almost certainly legit

magic_lobster_party ( @magic_lobster_party@kbin.run ) · 1 month ago

There has been instances of popular and well meaning projects become hijacked by hostile actors. A recent notable example is xz, but there’s also event-stream npm package a few years ago that got infected with Bitcoin stealing code.

Just because a protect looks good now doesn’t mean it won’t turn bad in the future.

And not only would you need to audit the project. You also need to audit all of its dependencies as well. The xz vulnerability made it in to SSH. Who would think about looking into xz for vulnerabilities?

The amount of effort it would take to fake this for very little benefit is enormous.

The benefit of installing back doors can be enormous.

FizzyOrange ( @FizzyOrange@programming.dev ) · 1 month ago

A recent notable example is xz, but there’s also event-stream npm package a few years ago that got infected with Bitcoin stealing code.

They’re asking if the entire project is somehow fake, not if it’s a real project that got backdoored. That’s obviously impossible to tell just based on stars, language quality, and similar heuristic signals.

Lettuce eat lettuce ( @Lettuceeatlettuce@lemmy.ml ) · 1 month ago

First off, good on you for being careful. Ultimately, use the same methods that you would use when vetting other sources, like academic or personnel for hiring.

Check reputation via stars, active contributors, see what accounts are contributing and what other projects they also contribute to. Check their LinkedIn profile and personal websites.

See if you can confirm the project is being used safely by reputable groups. See if people, especially public people you trust are using/recommending it without being sponsored.

Check in private forums with other devs and users, see what people are saying. Check the code yourself, etc.

Ultimately, there’s no way to know 100%, even large companies and organizations have been duped in the past by backdoors or security bugs in OSS they use. You can be very confident however, it’s all about how much investigation you are interested in doing.

And of course, don’t ever put all your eggs in one basket.

And if after lots of investigation, you still have a bad feeling in your gut, listen to that. Better to be a little too careful than to compromise yourself by ignoring that gut feeling that something just doesn’t pass the smell test.

haui ( @haui_lemmy@lemmy.giftedmc.com ) · 1 month ago

I think open source software has the huge advantage of being auditable. I suggest you and your team audit the entire code to see if anything is harshly wrong in there or you rely on other people doing it with you.

We actually dont know how many backdoors are in proprietary software and we never will until all code is finally forced into the open as it should be.

nik9000 ( @nik9000@programming.dev ) · 1 month ago

The only surefire way is to read it all. And understand it all. That ain’t happening though. So you decide how much to do.

You should figure out how many people are landing patches and get a rough sense of why. Same for folks filing issues or talking about the project in general. Maybe you trust one of the contributors for some reason. Either way, you want to know how alive the project is.

You could land a patch.

You could spot check parts of the code.

You could run vulnerability scanners on it.

I dunno. It’s hard.

RobotToaster ( @RobotToaster@mander.xyz ) · 1 month ago

Personally I wouldn’t trust it.

First red flag🚩: there’s an “enterprise” self hosted version.

Second red flag🚩: It isn’t open source, the licensing structure is confusing 🚩, but it appears to be at best some mix of source available🚩 and open core🚩 (core available?).

BurningnnTree ( @BurningnnTree@lemmy.one ) · 1 month ago

Can you explain why the enterprise version is a red flag? Would you expect the company to make money some other way?

RonSijm ( @RonSijm@programming.dev ) · 1 month ago

It’s not a big red flag, but it indicates that the product is not fully open source. You can get the full community edition from Github, but for the Self-hosted Enterprise version you have to contact sales.

So all the Enterprise features are most likely closed source, and when you buy/license it, you’ll just get the compiled version. And since their Cloud hosting model has a “Per 1,000 sessions/mo” model, their Enterprise self hosted model might have that as well. So it’ll have some kinda DRM/License managing, and maybe a “call home” to check your license or usage every once in a while

nik9000 ( @nik9000@programming.dev ) · 1 month ago

The point of the license combination they use is to allow the enterprise version to be open and live in the same repo as everything else. Dunno if that’s what they do, but that’s why the elastic license exists.

SorteKanin ( @SorteKanin@feddit.dk ) · 1 month ago

I don’t think it’s a dumb question.

Unfortunately I’m not sure there’s any guaranteed method to establish trustworthiness. It’s especially difficult because if there was, it would probably be easy for the scams to utilise and thus it would stop being a good method.

Anyways, I would say try to look at the people behind the software - do they have personal websites or do they work on other stuff that also seems reliable? What about the users, do they seem legitimate? Are the issues actual issues, not fake ones? Does the code seem maintained on a regular basis with non-trivial commits? Can you find online third party mentions that seem trustworthy?

That’s just what I could think of. But essentially, there is no silver bullet and you’ll just need to make a thorough assessment and decide if you trust it enough.

eveninghere ( @eveninghere@beehaw.org ) · 1 month ago

Basically the same as fake news. Check web articles and so on. (Reading source code is often infeasible.)

You can also check Linux package managers. Official repositories from, eg, Red Hat and Suse are well maintained by the companies. I’d trust also the official Arch repo. I guess Debian is trustworthy, too, but don’t know the process there.

Regarding OpenReplay, you could also check the companies listed as using OpenRepay. (I couldn’t find any official source from those companies that mentioned OpenReplay, but that’s rather expected given that they don’t have to open their software stack.)

Kangie ( @Kangie@lemmy.srcfiles.zip ) · 1 month ago

I’d trust also the official Arch repo.

Yeah they’ve only rolled out a version of curl that broke the package manager a few times.

eveninghere ( @eveninghere@beehaw.org ) · 1 month ago

Oh, I agree with you there, but the topic was about something different.

key ( @key@lemmy.keychat.org ) · 1 month ago

You could hire a team of security experts to audit it for you

corsicanguppy ( @corsicanguppy@lemmy.ca ) · 1 month ago

look at code
if it looks good, build it and use it.
there is no #3

intensely_human ( @intensely_human@lemm.ee ) · 1 month ago

Read … the source code?