Geoffrey Frogeye
885d92dd77
Added LICENSE
3 years ago
Geoffrey Frogeye
8b7e538677
Updated links
(could not bother guessing them)
3 years ago
Geoffrey Frogeye
cd46b39756
Merge branch 'newworkflow'
3 years ago
Geoffrey Frogeye
38cf532854
Updated README
Split in two actually (program and list).
Closes #3
Also,
Closes #1
Because I forgot to do it earlier.
3 years ago
Geoffrey Frogeye
53b14c6ffa
Removed TODO placeholders in commands description
It's better than nothing but not by that much
4 years ago
Geoffrey Frogeye
c81be4825c
Automated tests
Very rudimentary but should do the trick
Closes #4
4 years ago
Geoffrey Frogeye
4a22054796
Added optional cache for faster IP matching
4 years ago
Geoffrey Frogeye
06b745890c
Added other first-party trackers
4 years ago
Geoffrey Frogeye
aca5023c3f
Fixed scripting around
4 years ago
Geoffrey Frogeye
dce35cb299
Harder verficiation before adding entries to DB
4 years ago
Geoffrey Frogeye
747fe46ad0
Script to automatically download from Rapid7 datasets
4 years ago
Geoffrey Frogeye
b43cb1725c
Autosave
Not needed but since the import may take multiple hour I get frustrated
if this gets interrupted for some reason.
4 years ago
Geoffrey Frogeye
f5c60c482a
Merge branch 'master' of git.frogeye.fr:geoffrey/eulaurarien
4 years ago
Geoffrey Frogeye
12ecfa1a5d
Added outdated documentation warning in README
4 years ago
Geoffrey Frogeye
e882e09b37
Added outdated documentation warning in README
4 years ago
Geoffrey Frogeye
d65107f849
Save dupplicates too
Maybe I won't publish them but this will help me for tracking trackers.
4 years ago
Geoffrey Frogeye
ea0855bd00
Forgot to push this little guy
Good thing I cleaned up my working directory.
It only exists because pickles created from database.py itself
won't be openable from a file simply importing databse.py.
So we create it when in 'imported state'.
4 years ago
Geoffrey Frogeye
7851b038f5
Reworked rule export
4 years ago
Geoffrey Frogeye
8f6e01c857
Added first_party tracking
Well, tracking if a rule is from a first or a multi rule...
Hope I did not do any mistake
4 years ago
Geoffrey Frogeye
c3bf102289
Made references work
4 years ago
Geoffrey Frogeye
03a4042238
Added level
Also fixed IP logic because this was real messed up
4 years ago
Geoffrey Frogeye
3197fa1663
Remove list usage for IpTreeNode
4 years ago
Geoffrey Frogeye
a0e68f0848
Reworked match and node system
For level, and first_party later
Next: add get_match to retrieve level of source and have correct levels
... am I going somewhere with all this?
4 years ago
Geoffrey Frogeye
aec8d3f8de
Reworked how paths work
Get those tuples out of my eyes
4 years ago
Geoffrey Frogeye
7af2074c7a
Small optimisation of feed_switch
4 years ago
Geoffrey Frogeye
45325782d2
Multi-processed parser
4 years ago
Geoffrey Frogeye
ce52897d30
Smol fixes
4 years ago
Geoffrey Frogeye
954b33b2a6
Slightly better Rapid7 parser
4 years ago
Geoffrey Frogeye
d976752797
Store Ip4Path as int instead of List[int]
4 years ago
Geoffrey Frogeye
4d966371b2
Workflow: SQL -> Tree
Welp. All that for this.
4 years ago
Geoffrey Frogeye
040ce4c14e
Typo in source
4 years ago
Geoffrey Frogeye
b50c01f740
Merge branch 'master' into newworkflow
4 years ago
Geoffrey Frogeye
ddceed3d25
Workflow: Can now import DnsMass output
Well, in a specific format but DnsMass nonetheless
4 years ago
Geoffrey Frogeye
189deeb559
Workflow: Multiprocess
Still trying.
It's better than multithread though.
Merge branch 'newworkflow' into newworkflow_threaded
4 years ago
Geoffrey Frogeye
d7c239a6f6
Workflow: Some modifications
4 years ago
Geoffrey Frogeye
5023b85d7c
Added intermediate representation for DNS datasets
It's just CSV.
The DNS from the datasets are not ordered consistently,
so we need to parse it completly.
It seems that converting to an IR before sending data to ./feed_dns.py
through a pipe is faster than decoding the JSON in ./feed_dns.py.
This will also reduce the storage of the resolved subdomains by
about 15% (compressed).
4 years ago
Geoffrey Frogeye
269b8278b5
Worflow: Fixed rules counts
4 years ago
Geoffrey Frogeye
ab7ef609dd
Workflow: Various optimisations and fixes
I forgot to close this one earlier, so:
Closes #7
4 years ago
Geoffrey Frogeye
f3eedcba22
Updated now based on timestamp
Did I forget to add feed_asn.py a few commits ago?
Oh well...
4 years ago
Geoffrey Frogeye
8d94b80fd0
Integrated DNS resolving to workflow
Since the bigger datasets are only updated once a month,
this might help for quick updates.
4 years ago
Geoffrey Frogeye
231bb83667
Threaded feed_dns
Largely disapointing
4 years ago
Geoffrey Frogeye
9050a84670
Read-only mode
4 years ago
Geoffrey Frogeye
e19f666331
Workflow: Automatically import IP ranges from ASN
Closes #9
4 years ago
Geoffrey Frogeye
57416b6e2c
Workflow: POO and individual tables per types
Mostly for performances reasons.
First one to implement threading later.
Second one to speed up the dichotomy,
but it doesn't seem that much better so far.
4 years ago
Geoffrey Frogeye
b076fa6c34
Typo in new source URL
4 years ago
Geoffrey Frogeye
12dcafe606
Added alternate source of Eulerian CNAMES
It was requested so.
It should be temporary, once I have a bigger subdomain list
that shouldn't be required.
4 years ago
Geoffrey Frogeye
1484733a90
Workflow: Small tweaks
4 years ago
Geoffrey Frogeye
55877be891
IP parsing C accelerated, use bytes everywhere
4 years ago
Geoffrey Frogeye
7937496882
Workflow: Base for new one
While I'm automating this you'll need to download the A set from
https://opendata.rapid7.com/sonar.fdns_v2/ to the file a.json.gz.
4 years ago
Geoffrey Frogeye
62e6c9005b
Tracker: intendmedia?
4 years ago