Module Idea: TruffleHog #922

domwhewell-sage · 2023-12-08T09:54:57Z

domwhewell-sage
Dec 8, 2023

Description
I see bbot discovers and reports github repos, S3 buckets, AzureStorage etc.
In my testing I haven't seen it scan the github repos it discovers for secrets.. trufflehog has an organization flag which will enumerate all repos belonging to an organization and search them for secrets. It also has flags to include members and the repository's and to include forks which may be useful.

TheTechromancer · 2023-12-08T14:18:58Z

TheTechromancer
Dec 8, 2023
Maintainer

This is a great idea, however there are some challenges in implementing it. Right now, our github module searches github for files containing the target domain and raises them to httpx, which pulls down the raw file and raises it as an HTTP_RESPONSE. By default this event type is not displayed, but it is distributed internally to modules such as excavate and secretsdb which parse it for URLS, domain names, secrets, etc.

Looting the target's entire github is something I've wanted to do for a while, but the difficulty is that it's very hard to know for sure whether the github org is truly owned by the company. This is the same difficulty as we have with buckets. We sort of loot them as we see them rather than searching for them, because it's very easy to step out of scope and generate garbage data.

So basically, we are searching github for secrets etc. via the secretsdb module, but not quite in the way you've described. It's possible there's a better way to implement it that I haven't thought of. Curious to know your thoughts.

0 replies

domwhewell-sage · 2023-12-08T16:19:59Z

domwhewell-sage
Dec 8, 2023
Author

Hi @TheTechromancer,
My thoughts initially went to trufflehog as it allows the organization flag and the ability to include members and forks. It also seems quite fast when running across large organizations.

For either module a good way to validate would be:

Obtain the top level domain by stripping out the .com/.co.uk/.org etc. So blacklanternsecurity.com would become blacklanternsecurity which would be a potential organization name
Make an unauthenticated API request to https://api.github.com/orgs/{potential_org_name} (Unauthenticated because if a token is included you may get 403 if the organizations security policy blocks PAT's)
This response JSON will contain a "blog" key with the link to the organizations site match this with the original DNS_NAME or skip the module if not

A list of potential organization names could be created using all of the discovered DNS_NAME events that are in scope

0 replies

TheTechromancer · 2023-12-08T22:10:50Z

TheTechromancer
Dec 8, 2023
Maintainer

Ohh interesting. I like that approach; try the API and if it returns any in-scope domains, we can assume it's in-scope.

If we're doing it that way we can also try:

Azure tenant name (*.onmicrosoft.com)
Github links (extracted by the social module)

0 replies

TheTechromancer · 2023-12-09T01:28:00Z

TheTechromancer
Dec 9, 2023
Maintainer

@domwhewell-sage I have just noticed a bug in secretsdb which seems to be causing it not to work properly with the github module. Up to now secretsdb has been our "one stop shop" for extracting secrets, etc. We are using the regexes from this repo, which combines the signatures of trufflehog and gitleaks along with some custom validation logic to weed out common false positives.

Anyway, it's a small bug and an easy fix, but it highlights the need for some polishing in this area. Secrets-patterns-db was a cool idea but it's not being actively maintained. Rather than maintaining our own regexes, I would ideally like to have a weekly CI pipeline that aggregates the latest signatures from all the competing "secrets mining" tools, cleans/dedups them, and publishes them in a JSON file for the BBOT module to consume. The only reason I haven't done this already is simply because I haven't found time.

I am not inherently opposed to implementing trufflehog directly, but I think there's the potential for us to create something much more powerful if we can build it natively into BBOT's recursion.

0 replies

domwhewell-sage · 2023-12-09T09:21:45Z

domwhewell-sage
Dec 9, 2023
Author

Ok that sounds good, by implementing truffle hog we may loose other goodies that could be in the HTTP_RESPONSE like user emails, and the ability to produce CODE_REPO events

Also an interesting side note from blackhat while at a GitHub copilot talk, they also mentioned that they would be using AI to detect generic secrets https://docs.github.com/en/enterprise-cloud@latest/code-security/secret-scanning/enabling-ai-powered-generic-secret-detection

0 replies

TheTechromancer · 2023-12-20T13:07:10Z

TheTechromancer
Dec 20, 2023
Maintainer

Migrating to discussion.

0 replies

Sh4d0wHunt3rX · 2024-02-27T11:34:30Z

Sh4d0wHunt3rX
Feb 27, 2024

Not sure, but maybe this helps in any way:
https://github.com/trickest/insiders

0 replies

domwhewell-sage · 2024-03-07T17:55:49Z

domwhewell-sage
Mar 7, 2024
Author

If GitHub and Docker are being raised as CODE_REPOSITORY events we might need to tag them as such

0 replies

domwhewell-sage · 2024-03-07T17:57:57Z

domwhewell-sage
Mar 7, 2024
Author

Trufflehog could still be used to ingest these CODE_REPOSITORY events and output secrets.

I'm thinking of docker here as trufflehog is good and fast at pulling secrets out of docker image layers

0 replies

TheTechromancer · 2024-03-07T18:15:41Z

TheTechromancer
Mar 7, 2024
Maintainer

Is there any way we could have a module that downloads the code repos and then gives them to trufflehog offline?

2 replies

TheTechromancer Mar 7, 2024
Maintainer

My thought is if we could have a single module that downloads it, we could potentially have multiple modules (including trufflehog) that consume it without having to download it again.

domwhewell-sage Mar 7, 2024
Author

Oh I see what you mean now...
Just had a look and only the git and docker modules support the file:// attribute which is all we need. Testing them you also get the same results as using the online command

TheTechromancer · 2024-03-25T20:53:19Z

TheTechromancer
Mar 25, 2024
Maintainer

Closing as completed.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Module Idea: TruffleHog #922

{{title}}

Replies: 11 comments 2 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Module Idea: TruffleHog #922

domwhewell-sage Dec 8, 2023

Replies: 11 comments · 2 replies

TheTechromancer Dec 8, 2023 Maintainer

domwhewell-sage Dec 8, 2023 Author

TheTechromancer Dec 8, 2023 Maintainer

TheTechromancer Dec 9, 2023 Maintainer

domwhewell-sage Dec 9, 2023 Author

TheTechromancer Dec 20, 2023 Maintainer

Sh4d0wHunt3rX Feb 27, 2024

domwhewell-sage Mar 7, 2024 Author

domwhewell-sage Mar 7, 2024 Author

TheTechromancer Mar 7, 2024 Maintainer

TheTechromancer Mar 7, 2024 Maintainer

domwhewell-sage Mar 7, 2024 Author

TheTechromancer Mar 25, 2024 Maintainer

domwhewell-sage
Dec 8, 2023

Replies: 11 comments 2 replies

TheTechromancer
Dec 8, 2023
Maintainer

domwhewell-sage
Dec 8, 2023
Author

TheTechromancer
Dec 8, 2023
Maintainer

TheTechromancer
Dec 9, 2023
Maintainer

domwhewell-sage
Dec 9, 2023
Author

TheTechromancer
Dec 20, 2023
Maintainer

Sh4d0wHunt3rX
Feb 27, 2024

domwhewell-sage
Mar 7, 2024
Author

domwhewell-sage
Mar 7, 2024
Author

TheTechromancer
Mar 7, 2024
Maintainer

TheTechromancer Mar 7, 2024
Maintainer

domwhewell-sage Mar 7, 2024
Author

TheTechromancer
Mar 25, 2024
Maintainer