total descendants::8 total children::3 |
Chcel by som pomocou requests/beautifulsoup v Python 3 prejst cez nalogovanie do webstranky. Tu je login/password tlacitko: ..input maxlength="100" size="30" id="loginName" name="loginName" type="text" title="E-mail ID:" value="" ..input maxlength="32" size="30" id="password" name="password" title="Password:" type="password" value="" Tu je submit button: ..input value="Submit" name="button_okButton" type="submit" onclick="javascript:login(['Mode','Topic'],['login','100'])" Doleuvedeny kod mi nacita uvodnu stranku, ale neprihlasi ma. Aj ked som zahashoval ten posledny geturl request, tak aj tak to neukaze dalsiu stranku. Viem sa tam dostat a vsetko vytahat cez selenium/chrome, ale chcel by som nieco co by pracovalo na pozadi... import ssl import requests from lxml import html USERNAME = "myusername@domain.com" PASSWORD = "password" LOGIN_URL = "https://ehnsarmciratihs.extnet.ibm.com/services/security/cirats/index.jsp" URL = "https://ehnsarmciratihs.extnet.ibm.com/services/security/cirats/index.jsp" def main(): ssl._create_default_https_context = ssl._create_unverified_context session_requests = requests.session() # Create payload payload = { "loginName": USERNAME, "password": PASSWORD, #"csrfmiddlewaretoken": authenticity_token } # Perform login result = session_requests.post(LOGIN_URL, data = payload, headers = dict(referer = LOGIN_URL)) # Scrape url result = session_requests.get(URL, headers = dict(referer = URL), verify=False) print(result.text) if __name__ == '__main__': main() |
| |||||||||||||||||||||||||