Hacker News new | past | comments | ask | show | jobs | submit | f311a's comments login

Yeah, these books are outdated. A lot have changed.


>BeautifulSoup

> Features: Excellent HTML/XML parser, easy web scraping interface, flexible navigation and search.

It does not feature any parser. It’s basically a wrapper over lxml.

>lxml

> Features: Very fast XML and HTML parser.

It’s fast, but there are alternatives that are literally 5x faster.

This article is just another rewrite of a basic introduction. It’s not a guide, since it does mot describe any issues that you face in practice.


Parsing HTML super-fast is very low on the list of priorities when web-scraping things. Yes, in practice.

Most of the time it won't even register on the scale, compared to the time spent sending/receiving requests and data.


Beautiful Soup comes with a "html.parser", and by default it doesn't not use or even install lxml.


lxml is written in Cython and is very efficient in my tests. Much faster than BeautifulSoup, which is pure Python.

What alternatives are 5x faster?


I'm sorry but BeautifulSoup is not just a wrapper over lxml.

lxml even has a module for using beautifulsoup's parser.

> lxml can make use of BeautifulSoup as a parser backend

https://lxml.de/elementsoup.html

> A very nice feature of BeautifulSoup is its excellent support for encoding detection which can provide better results for real-world HTML pages that do not (correctly) declare their encoding.


That’s from a single person most likely, who used sqlmap to test for sql injection. I haven’t seen internet wide attempts of testing sql injections.


Our WAF logs are fun reading. We see so much traffic from bots looking for PHP files and posting to inputs.


Yeah, so much noise. I enjoy screwing around with them on my free time, "imposing cost" by giving back unexpected things. I don't know if it actually does something, but I bet returning either a gzip-bomb or a 5 MiB really obscure (but valid) HTML file will crash quite a few scanners.

https://nitter.net/gnyman/status/1181652421841436672


Are you familiar with OpenBSD tarpitting?


Not specifically openBSD but the concept yes, I've played with it also

https://nyman.re/super-simple-ssh-tarpit/


It is called autophagy and happens regularly in our body, but in very small amount. Do experiece greater results, you need prolonged fasting (3-5 days)


It’s possible to minimize the loss by consuming large amounts of protein, but some amount of it will be converted to glucose unfortunately. The difference is very drastic, you can preserve a lot of muscle mass by doing proper diet vs IF. It all adds up when you so it for 6 months.


An old fashioned way of fasting preserves muscle mass way better due to hormonal response. It only kicks in after 30-40 hours of fasting. I don’t think that a lot if people practice 3-4 of fasting when doing IF.


Longer periods of fasting reduce testosterone, which probably negates whatever GH benefit it provides: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9182756/

And why would your body go out of its way to preserve metabolically more expensive tissue during longer periods of deprivation?


Any kind of caloric restriction lowers it temporarily. There is some evidence that testosterone levels can be even higher after the fasting https://josepheverettwil.substack.com/p/how-fasting-changes-...


Can someone archive it?


fyi you can archive it, too. Just go to archive.is, paste in the link, and hit save.

https://archive.is/p6FGg


Cloudflare does that


Cloudflare is a MITM you voluntarily setup yourself, innit?


You don't have to setup anything


I'm also curious about what kind of companies are still using it.


A lot of lab instrumentation companies across EU still use it. Also quite a few companies producing industry machinery, industrial displays etc.


This. It’s a great little system for such use cases. It is mainly falling out of favor for new development because it’s hard to find devs. But in terms of quickly getting automation equipment designed and running it’s a workhorse.


+ you can write both UI and low level code in one language


Might is the most important word here. The majority of papers about nutrition or supplements will have it in the conclusion.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: