diff --git a/README.md b/README.md index 5010829..972c971 100644 --- a/README.md +++ b/README.md @@ -34,30 +34,6 @@ uc.TARGET_VERSION = 84 driver = uc.Chrome() ``` -### datadome.co #### -These guys have actually a powerful product, and a link to this repo, which makes me wanna test their product. -Make sure you use a "clean" ip for this one. -``` -# STANDARD chromedriver -from selenium import webdriver -chrome = webdriver.Chrome() -chrome.get('https://datadome.co/customers-stories/toppreise-ends-web-scraping-and-content-theft-with-datadome/') -chrome.save_screenshot('datadome_regular_webdriver.png') -True -# after this detectioon, you'll keep being nagged with puzzles, even if you use another machine from the same same network (they use a very tight but effective regime, possibly combination of fingerprinting and ip-flagging). - - -# UNDETECTED chromedriver (headless,even) - -import undetected_chromedriver as uc -options = uc.ChromeOptions() -options.headless=True -options.add_argument('--headless') -chrome = uc.Chrome(options=options) -chrome.get('https://datadome.co/customers-stories/toppreise-ends-web-scraping-and-content-theft-with-datadome/') -chrome.save_screenshot('datadome_undetected_webddriver.png') -``` - #### patches selenium module #### Needs to be done before importing from selenium package @@ -91,6 +67,34 @@ driver.get('https://distilnetworks.com') ``` +### datadome.co #### +These guys have actually a powerful product, and a link to this repo, which makes me wanna test their product. +Make sure you use a "clean" ip for this one. +``` +# STANDARD chromedriver +from selenium import webdriver +chrome = webdriver.Chrome() +chrome.get('https://datadome.co/customers-stories/toppreise-ends-web-scraping-and-content-theft-with-datadome/') +chrome.save_screenshot('datadome_regular_webdriver.png') +True +# after this detectioon, you'll keep being nagged with puzzles, even if you use another machine from the same same network (they use a very tight but effective regime, possibly combination of fingerprinting and ip-flagging). + + +# UNDETECTED chromedriver (headless,even) + +import undetected_chromedriver as uc +options = uc.ChromeOptions() +options.headless=True +options.add_argument('--headless') +chrome = uc.Chrome(options=options) +chrome.get('https://datadome.co/customers-stories/toppreise-ends-web-scraping-and-content-theft-with-datadome/') +chrome.save_screenshot('datadome_undetected_webddriver.png') + +``` +## Check both saved screenhots [here](https://imgur.com/a/fEmqadP) ## + + + ### important note ### the default blank page on start plays a BIG role in the anti-detection workings of the module. You will only become undetectable from the moment you use driver.get(url) to navigate to some url (and next and next and next). This automatically means that if you enter a url in the browser screen by hand right after launch, you are NOT protected! New Tabs: same story. If you really need multi-tabs, then open the tab with the blank page (hint: url is `data:,` including comma, and yes, driver accepts it) and do your thing as usual. If you follow these "rules" (actually its default behaviour), then you will have a great time for now. @@ -108,3 +112,5 @@ In [5]: None # Undetectable! ``` ### end important note ### + +