This project is used to study and share the simulated Login methods of major websites

Time:2020-11-22

Python simulation of the major websites landing methods, as well as some crawler programs

  • For practice only, code comments are very detailed

  • This project is used to study and share the simulated landing methods of major websites and the crawler programs of some websites. Some of them log in through selenium, some directly simulate login through packet capture, and some use scratch, hoping to be helpful to Xiaobai. This project is used to study and share the simulated landing methods and crawler programs of major websites, which will be updated continuously…

  • Yesterday, I added a class to refactor and test some old code. Most of them can be used.

  • Welcome to star

Simulated Login to some common websites

  • If you have any websites that are difficult to log in, such as those that can’t be logged in with selenium + webdriver, please give me an issue
  1. requests
  2. selenium
  3. rsa
  4. phantomjs

Project address

Github

about

Simulation login basically adopts the way of direct login or selenium + webdriver. Some websites are difficult to log in directly, such as QQ space and B station. If selenium is used, it will be relatively easy.

Although selenium is used when logging on, in order to be efficient, we can maintain the cookie obtained after login, and then call requests or scrapy to collect data, so that the speed of data acquisition can be guaranteed.

Completed

tips of pull request

  • Welcome to pull request

problem

  • Some captcha need to be manual, I will try to correct it later
  • Code failure: due to site policy or style changes, resulting in code failure, please give me the issue, if you have solved, you can raise PR, thank you!

in addition

  • If you have any websites that are difficult to log in, such as those that can’t be logged in with selenium + webdriver, please give me an issue
  • If the repo is helpful, give encouragement to a star

Welcome to you

  1. After writing the project for a period of time, we found that there are some problems in the style of the code and the usability, extensibility and readability of the code. Therefore, the most important thing is to refactor the code so that we can easily make some small functions of our own.
  2. If you think the login of a certain website is representative, you are welcome to put forward it in the issue

test

BiliBili automatic login test is normal, success rate is 98%

This project is used to study and share the simulated Login methods of major websites

Web wechat

This project is used to study and share the simulated Login methods of major websites

Picture crawler

This project is used to study and share the simulated Login methods of major websites

This project is used to study and share the simulated Login methods of major websites

Project address

Github

specific

  • Please jump to the project address

last

  • Big guys, slow down. I can’t bear it—_ —

This work adoptsCC agreementThe author and the link to this article must be indicated in the reprint

Recommended Today

Regular expression sharing for checking primes

This regular expression is shown as follows: Regular expressions for checking prime numbers or not To use this positive regular expression, you need to convert the natural number into multiple 1 strings. For example, 2 should be written as “11”, 3 should be written as “111”, 17 should be written as “11111111111”. This kind of […]