#199: Big news for a very small Python runtime

Published Thu, Sep 17, 2020, recorded Wed, Sep 9, 2020

Sponsored by us! Support our work through:

via Matt Trentini
v1.13 is packed with features and bugfixes including solid asyncio support and tasty BLE improvements. Heck, we've even got the walrus operator.
a new implementation of the uasyncio module which aims to be more compatible with CPython's asyncio module.
The main change is to use a Task object for each coroutine, allowing more flexibility to queue tasks in various places, eg the main run loop, tasks waiting on events, locks or other tasks.
It no longer requires pre-allocating a fixed queue size for the main run loop.
Most code in this repository is now auto-formatted using uncrustify for C code and Black for Python code.
BlueKitchen BTstack bindings have been added for the ubluetooth module, as an optional alternative to the NimBLE stack. The unix port can now be built with BLE support using these bindings
Other Bluetooth additions include: new events for service/characteristic/ descriptor discovery complete; new events for read done and indicate acknowledgement; and support for active scanning in BLE.gap_scan().
PEP 526 has been (Walrus)
There has been an important bug fix when importing ARM machine code from an .mpy file: the system now correctly tracks the executable memory allocated to the machine code so this memory is not reclaimed by the garbage collector.
For testing, a multi-instance test runner has been added (see tests/run-multitests.py) which allows running a synchronised test across two or more MicroPython targets.
There are breaking changes
First release since Dec 19, 2019

Brian #2: respx: A utility for mocking out the Python HTTPX library

When using requests, you can mock it with responses.
When using httpx, mock with respx.

Quick start:

    import httpx
    import respx

    @respx.mock
    def test_something():
        request = respx.post("https://foo.bar/baz/", status_code=201)
        response = httpx.post("https://foo.bar/baz/")
        assert request.called
        assert response.status_code == 201

Documentation includes examples of using respx with both pytest and unittest, including how to set up mocked_api fixtures for pytest.
There’s call statistics you can assert on.
Ability to raise exceptions, return non-200 status codes, set custom return content.
Content can be generated in a callback method.
JSON content can be returned
Tons of nice options to help test your httpx based application.

Michael #3: GetPy - A Vectorized Python Dict/Set

The goal of GetPy is to provide the highest performance python dict/set that integrates into the python scientific ecosystem.
GetPy is a thin binding to the Parallel Hashmap (https://github.com/greg7mdp/parallel-hashmap.git) which is the current state of the art unordered map/set with minimal memory overhead and fast runtime speed.
The binding layer is supported by PyBind11 (https://github.com/pybind/pybind11.git)
The gp.Dict and gp.Set objects are designed to maintain a similar interface to the corresponding standard python objects.
Simple example:

    import getpy as gp

    key_type = np.dtype('u8')
    value_type = np.dtype('u8')

    keys = np.random.randint(1, 1000, size=10**2, dtype=key_type)
    values = np.random.randint(1, 1000, size=10**2, dtype=value_type)

    gp_dict = gp.Dict(key_type, value_type)
    gp_dict[keys] = values

Brian #4: isort and black now play nice together easily

Contributed by John Hagen
isort “sorts your imports, so you don’t have to”
black reformats all of your code to a consistent code style, including import statements
There is a config page on black documentation that shows how to set isort to be compatible with black. It also shows how to make flake8 and pylint play nice with black, but they are less complicated.
Now, however, with isort 5 introduction of built in profiles, you can just use isort --``profile black . and the profile sets everything for you.
There’s a profile page for isort that describes all that it does.
Other profiles include: django, pycharm, google, open_stack, plone, attrs, hug
And as always, you can configure your own with config files.

Michael #5: Scientists rename human genes to stop Microsoft Excel from misreading them as dates

Via Chris Moffitt
There are tens of thousands of genes in the human genome
Each gene is given a name and alphanumeric code, known as a symbol, which scientists use to coordinate research.
Over the past year or so, some 27 human genes have been renamed, all because Microsoft Excel kept misreading their symbols as dates.
Excel is regularly used by scientists to track their work and even conduct clinical trials.
But its default settings were designed with more mundane applications in mind, so when a user inputs a gene’s alphanumeric symbol into a spreadsheet, like MARCH1 — short for “Membrane Associated Ring-CH-Type Finger 1” — Excel converts that into a date: 1-Mar.
One study from 2016 examined genetic data shared alongside 3,597 published papers and found that roughly one-fifth had been affected by Excel errors.
See 12 of the Biggest Spreadsheet Fails in History for more examples: https://blogs.oracle.com/smb/10-of-the-costliest-spreadsheet-boo-boos-in-history
The scientific body in charge of standardizing the names of genes, the HUGO Gene Nomenclature Committee, published new guidelines for gene naming. From now on human genes and the proteins they expressed will be named with one eye on Excel’s auto-formatting.
Check out the Excel to Python course and webcast to escape this.

Brian #6: Never Run ‘python’ In Your Downloads Folder

by Glyph
This is really a nice, short tutorial on how sys.path is populated, why you should care, and why you need to make sure it’s only trusted locations.
“downloads” is definitely not trusted.
So never, ever, ever run python from the downloads directory, even with python -m something, as that adds the download dir to the include path.
Example includes a demonstration of malicious js code that downloads a fake pip.py to your downloads folder, so when you call python -m pip install ./legit_package.whl you get the fake pip.
Further examples show how you need to be vigilant to check your dot files for weird PYTHONPATH extensions and additions.

Extras:

Michael:

We recently passed 5,000,000 downloads of the audio files over at Python Bytes and are the 130th most popular tech podcast in the world. Thank you everyone!
Got a new LinkSys WiFi 6 mesh router, and wow, highly recommended.

Joke

Are you a real programmer? Check with XKCD to find out.

Episode Transcript

Collapse transcript

WebVTT format On GitHub

00:00 Hello and welcome to Python Bytes, where we deliver Python news and headlines directly to

00:04 your earbuds. This is episode 199, almost 200, Brian. Almost. Almost. Recorded September 9th,

00:12 2020. I'm Michael Kennedy. And I'm Brian Okken. And this episode is brought to you by us. We'll

00:17 tell you more about the things that we're doing and what we have to offer later. But for now,

00:22 I want to talk about not any huge announcement, but a very, very small announcement that it's

00:29 also kind of huge. Very cool. Yeah. So micro, you might say. You might say it's microscopic,

00:33 but it's a big deal. So MicroPython has just had a new release. And this is significant because the

00:39 last release from MicroPython was December 19th of last year. So it's been, you know, what is that?

00:47 Almost 10 months since the last release. So if you care about MicroPython, this is like a pretty big

00:51 deal. So version 113 is out. And one of the major things is it has improved asyncio,

00:58 async and await support. And it has cool Bluetooth improvements. And it even comes with the PEP 526,

01:06 that walrus operator. Oh, nice. Yeah. So there's a lot of stuff going on here. They basically,

01:11 MicroPython, as you can imagine, comes with a micro asyncio module. I'm guessing the U is really a MU.

01:19 U asyncio module that's supposed to be more compatible with CPython's asyncio. So the main idea is that the

01:28 task object, you can use a task object for each coroutine, which allows you to have like unbounded

01:34 queues of work and stuff like that. Whereas previously, I think you had to pre-allocate it for some reason.

01:39 I never tried to do asyncio on MicroPython, but apparently it was a little bit wonky. And so now it's more

01:45 consistent. Another thing that I thought was interesting about this release, Brian, was they went through and

01:52 auto-formatted all the code in the entire repository. And if you look at their repo, there is Python and MicroPython,

01:59 but a lot like CPython, it's a lot of C as well. Right? I think it's like 80% C or something.

02:05 Okay.

02:06 And so they auto-formatted the Python code with black, and that probably won't surprise too many people,

02:10 but they auto-formatted C with that community's equivalent of black, and get ready for the name,

02:18 Uncrustify. They uncrustified the C code.

02:22 Okay.

02:22 Do you know about this?

02:23 No, actually, I'm writing it down because I'd like to check this out.

02:26 Isn't that great? So you can basically configure how it formats your C code. And it's like black. It's

02:30 like, you know, it just runs automatically and fixes your C code. And boy, if you thought Python

02:35 code needed some fixing, I'll bet you the C code. And there's more variety in there, right?

02:40 Yeah. I've seen some ugly C code in my day.

02:43 Yeah, for sure. For sure. Because with the curly braces and semicolons, you have a lot more flexibility

02:48 in how you format stuff, whereas Python forces some of that structure upon you.

02:53 All right. They also added Blue Kitchen. Apparently Blue Kitchen is a Bluetooth stack. And so they added a

03:00 micro Mew Bluetooth module as an alternative to the Nimble stack as well. And there's a Unix port,

03:09 and you can build the Unix port with the Bluetooth support for either of these bindings. That's pretty

03:14 cool. That comes with new events. So if you're doing like Bluetooth discovery and things like that,

03:20 you can like scan for Bluetooth stuff more easily. It also came with a big memory leak fix for ARM

03:27 chips, which is big in the small space, ironically. So apparently, there was certain memory that was

03:35 not reclaimed by the garbage collector and other types of things. So if you've, you know, and at the

03:38 same time on these very small devices, you care a lot about not wasting memory, right? It's not like

03:43 you got 32 gigs like my laptop does. So like whatever, right? It's going to matter if you lose memory on

03:48 like a little micro Python chip. And also they set it up so you can run parallel tests against multiple

03:56 micro Python targets. So I'm guessing like different chips and different devices, you can just say go test

04:01 them, run it synchronized across two or more micro Python targets. I don't know if that's to speed it

04:07 up or to like, test the whole thing on different platforms in parallel or whatever. But it sounds like a

04:12 good improvement. And it does come with a few breaking changes. So people can check that out.

04:16 Anyway, if you care about micro Python, and I'm guessing this may make its way over to CircuitPython

04:22 as well at some point, knowing that those projects are really close. So anyway, this looks really good

04:27 if you're doing big stuff on small things.

04:29 Yeah, it's good to see this going forward.

04:31 And thanks to Matt Trentini for sharing this and bringing it to our attention.

04:35 I wish I was better at remembering names, but somebody I had on testing code brought up

04:40 respx and or respix. I should stop. R-E-S-P-X. It's a utility for mocking out the Python HTTPX library.

04:49 So that somebody brought that up and I had never heard of it before. So I figured we'd bring it up

04:54 here. So with requests, if you want to mock it, there's a whole library called responses and cleverly

05:02 named a bit hard to search for. And then with HTTPX, if you want to mock it, there is respx.

05:08 I'm going to name it response X, but we just the other part, you just don't write it.

05:12 It's like the opposite of a silent E. It's it's not written.

05:15 Maybe.

05:17 Sorry, keep going. It sounds like a great idea.

05:21 Well, interestingly enough, when you write the two imports out, they're the same number of characters.

05:25 Maybe that's part of the reason why they did that. Anyway, that quick start is like shows it's really quick.

05:32 You just it's a really just a few lines of code to mock out a request response call or response call.

05:39 You can do things like things that you would expect, like an assert that something was actually called

05:44 and what the status code was and things like that.

05:48 But you can also have custom content and you can return JSON content.

05:53 of course, there's some examples. What I like is the documentation is pretty nice.

05:57 There's some examples on how to use it with both pytest and UnitTest, including how to set up like a mocked API fixture for pytest.

06:05 You can have multiple tests using the same mocked endpoint.

06:09 And there's a bunch of nice things about it.

06:12 Like, for instance, if you want to mock out a whole bunch of different URLs,

06:16 you can use some regular expressions to set what URLs you want to catch with that.

06:22 and then also the content that you return from your mocked object, your mocked request is you can have a callback that can generate that on the fly based on what you pass in it.

06:35 If you have a HTTPX based application, you definitely need to know about this for testing.

06:40 Yeah, absolutely.

06:41 And one of the things that's challenging about testing the HTTPX is the async stuff, right?

06:47 And so this is a really cool way to get in there and just make it work, right?

06:52 Yeah.

06:52 Just put a decorator there and it just mocks it out for you like you would expect.

06:57 Looks very clean.

06:58 Yeah, pretty easy.

06:59 And there's a lot of magic going on under the hood to make that a clean interface, but I think they did a good job.

07:05 Yeah.

07:06 So HTTPX comes from sort of an async derivative of requests, which I think is a really fantastic library.

07:14 And I've done some cool stuff with HTTPX.

07:16 So yeah, I'm a fan of this.

07:18 It's great.

07:18 Now, before we move on, let's talk about a couple of things that we got going on that people can check out that will definitely help support the show.

07:27 And they might find interesting.

07:28 I know that you've got some updates for your book.

07:31 Is that right?

07:32 Yeah.

07:32 The Python testing with pytestBook, just a minor, the book itself has a minor update.

07:38 So there's, it wasn't a big enough change to change the hardback version, but the ebook will be updated.

07:44 I don't think the hardback is going to be updated, but it's a couple lines of code in a couple of code examples in chapter five.

07:53 So it's a very minor thing.

07:56 But the big difference is the code download.

07:59 So we've updated the code download.

08:01 So even if you're working with the hardback book or using the old version and don't update the ebook, please redownload the code.

08:09 It'll make your learning experience better.

08:11 And the main change is that I've pinned the dependencies in the target project or the example project so that everything works better.

08:20 Good.

08:20 There was a tiny DB is a database that I use in the project.

08:25 And it had some incompatible changes.

08:28 And instead of trying to update everything to use the new, the new tiny DB, I just pinned it to an old version.

08:35 Yeah, that makes sense.

08:35 I mean, it's not like you're trying to teach people tiny DB.

08:39 You're like, here's a dependency.

08:40 We're trying to get around to something.

08:42 Yeah.

08:42 So that's not important to the content of teaching people how to write tests.

08:46 Yeah, that's the problem of creating content.

08:49 The world moves on and they're not always compatible with what you did.

08:52 How about that?

08:53 Yeah.

08:53 Over at Talk Python, we've got a whole bunch of stuff coming.

08:57 And so what I'd like to encourage people to do is just to go to talkpython.fm or training.talkpython.fm.

09:04 And right there you can, at the front of the training site, you can just enter your email.

09:08 So if you get a bunch of announcements, because I believe we are working on five or six courses right now are under active development.

09:14 We've got all sorts of great stuff.

09:16 And rather going through it, just sign up to get notified when those come out.

09:18 Very cool.

09:19 Indeed, indeed.

09:20 So a while ago, I went on this Twitter journey, let's say.

09:26 I don't know how to really explain it.

09:27 It was like I posted a quick question that led to a ton of feedback.

09:32 And then, wow, so many pieces of information and ideas and variations were sent around.

09:39 What I was trying to say, like, I'd like a dictionary that contains objects that I can access, say, with different keys.

09:45 Like, I'd like to put a bunch of users and access them by ID, but maybe also by email or by city.

09:52 And that would come with a bunch.

09:52 So it was like this sort of exploration there.

09:55 And one of the recommendations that came over sort of around that is like, hey, you should check out this thing called GetPy.

10:02 Okay.

10:02 I don't know where the name comes from, but it doesn't tell a lot about what it is.

10:06 But it's a vectorized Python dictionary and set implementation.

10:10 And vectorized as in it matches up perfectly with NumPy and so pandas and all the things that are built upon NumPy to plug straight into them.

10:20 So imagine I want to have a dictionary that has data in NumPy, but lets me treat it like a regular Python dictionary or a set and things like that.

10:31 So that's what this is.

10:33 It basically brings a super high performance Python dictionary and set implementation that automatically integrates into the Python scientific ecosystem, which I think is pretty cool.

10:43 Yeah.

10:44 And it's built upon this thing called parallel hash map.

10:47 So parallel hash map is apparently the current state of the art unordered map set with minimal memory overhead and super fast runtime.

10:55 So like a C binding.

10:58 And so this is just a Python wrapper on top of the C library.

11:02 That's a super fast dictionary and set.

11:05 So that's pretty awesome.

11:06 And here's another one for you, Brian, for your C++ adventures.

11:11 The integration between getpy and parallel hash map is this thing called pybind11.

11:18 So pybind11 is a compatibility layer between C++11 and Python.

11:25 So if you want to write like modern C++ and then plug it in easily to Python, here you go.

11:32 Nice.

11:32 Yeah.

11:33 So that's pretty cool.

11:34 Okay.

11:34 There's two classes, GP, so getpy, gp.dict and gp.set.

11:39 And they're designed to be basically similar to the standard dictionaries and sets from Python.

11:44 But there's a few differences.

11:45 So check out the docs.

11:46 And then I threw in a quick little example here that has two numpy arrays.

11:51 And you can say, here are the keys, here are the values, put them together.

11:55 And then you can just access different values.

11:58 And it's also typed, which I think is kind of interesting, right?

12:01 Like it has like an unsigned 8-bit integer or something like that.

12:06 Much like numpy is.

12:08 But it's more like a Python array where you specify the numerical type than it is just an unbounded list.

12:16 Right.

12:17 So there's some interesting stuff going on here.

12:18 Does it solve your problem that you were looking for?

12:20 No, not at all.

12:21 But it's very interesting.

12:23 But it's still neat.

12:24 Yeah.

12:24 Yeah, it's still neat.

12:25 Speaking of neat, we already talked about black once, but it's pretty neat.

12:28 Let's talk about it again.

12:29 Yeah, this was sent in by John Hagen.

12:31 And he mentioned that, I mean, I'm pretty sure we've mentioned ISORT before.

12:36 I know we've mentioned black.

12:37 But a lot of projects use both.

12:40 So ISORT will sort your imports so you don't have to.

12:45 And then just so they're consistent and they're alphabetically sorted.

12:50 And then there's some other.

12:51 Right.

12:51 And there's PEP 8 recommendations about that, I believe, about like grouping stuff that comes from the library, from standard library versus externals and whatnot.

12:58 Yeah.

12:59 Yeah.

12:59 And so you don't have to think about it.

13:00 You can just use ISORT.

13:01 Black also is becoming more and more popular, I think.

13:05 Or it's just constantly very popular.

13:08 Reform, it's all of your code, but it includes the import statements as well.

13:12 So there was this issue that some projects wanted to use both ISORT and black.

13:17 And there is, and they kind of fight with each other right out of the box.

13:22 If you run ISORT, you'll have one answer and black will do something different in some cases.

13:28 So what do you do?

13:29 Well, there's black had a configuration page, and we'll link to this in the show notes, that has documents what settings you have to set for ISORT so that it's compatible with black.

13:40 They also do that for FLAKE 8 and PILINT, which is nice.

13:44 But FLAKE 8 and PILINT are not that complicated.

13:47 It's the ISORT that's, there's like six settings you have to change.

13:50 But ISORT 5 just came out.

13:53 And ISORT 5 has what they call profiles.

13:56 And so if you run ISORT with Profile Black, black is one of the built-in profiles, it will sort the imports such that they're compatible with black.

14:07 Nice.

14:08 So now if you use ISORT with the black profile and black, also they won't fight with each other anymore.

14:16 That's very cool.

14:18 Black also came out with some new changes.

14:21 So if you're interested in black, check those out.

14:24 They're not huge changes, but some minor fixes.

14:27 The profile feature is pretty cool.

14:29 And even if you don't care about black, I think you should check it out.

14:33 They do, apparently, Django, PyCharm, Google, OpenStack, Clone, Adders, and Hug are other profiles that are included.

14:41 And also, they're just good examples so that you can look at how different projects are configuring their ISORT.

14:47 You can configure yours if you want.

14:49 Yeah, I really like it.

14:50 That's super cool.

14:52 And it is nice to have the top of your files nice and clean and organized in some certain way.

14:59 It's interesting.

14:59 There's a lot of people do different linters or code reformatters.

15:04 But this combination of ISORT plus black is something that's becoming more and more common in a lot of projects.

15:11 Yeah.

15:12 Very cool.

15:13 So I have something interesting for you.

15:15 If you were to think of the influence of Microsoft Excel relative to the influence of, like, all of human genetics and the genome science of the world, and they got into a fight, who would win?

15:28 Like a superhero.

15:30 You seem like apples and oranges.

15:32 I have no idea.

15:33 Well, they did get into a fight, and Excel won.

15:35 So here's the story.

15:36 There's an article that was sent over by Chris Moffitt.

15:39 He's the guy who wrote the Moving from Excel to Python and Pandas course over at Talk Python.

15:44 So we did a bunch of research into, like, all these funny things and weird things around Excel.

15:48 So he sent this one over, and I thought I'd cover it because it's fun.

15:51 So on The Verge, there's an article that talks about how scientists have renamed human genes to stop Excel from misreading them as dates.

16:00 Okay.

16:02 Is that crazy?

16:03 Yeah.

16:04 Yeah.

16:04 So you think it was just, like, formatting and stuff like that.

16:08 So there's, like, tens of thousands of genes in the human genome, and each gene is given a name and a numerical code, and then you used to talk about it.

16:17 So, like, this one controls, like, what color of hair you have or whatever, right?

16:22 So over the past year or so, 27 human genes have been renamed all because Excel kept rereading them as symbols.

16:31 They're symbols as dates.

16:32 Okay.

16:32 Well, the examples are important.

16:34 Yeah.

16:35 So, like, March 1.

16:37 No spaces.

16:39 March 1 is one of the genes, right?

16:42 But it gets converted to 03 slash 01 slash the year, or if you're in Europe, 01 slash 03 slash the year, right?

16:50 Either way, that's not what you wanted.

16:52 So March 1 actually stands for membrane-associated ring CH-type finger 1, which, of course, is the first of March.

17:03 Now, it sounds funny, and, like, we're making fun of, like, ah, whatever.

17:06 There was a study in 2016 that examined the genetic data shared alongside 3,600 published papers and found that one-fifth of them, which, what is that?

17:20 That's, like, 700 papers were affected by Excel errors.

17:24 Oh, man, yeah.

17:25 Isn't that messed up?

17:26 So, there's a scientific body in charge of naming genes called Hugo Gene Nomenclature Committee.

17:33 Who knew?

17:34 There's a committee for naming genes, but apparently.

17:36 So, they published new guidelines for naming genes, and they said, you have to consider what happens if you type into Excel.

17:43 If it gets reformatted, that's not okay.

17:45 Got to pick a new name.

17:48 So, they put, like, weird suffixes and stuff on these things to make them work, like, March 1 is now, like, March 1 N1 or something like that.

17:55 Anyway, so, why do I bring this up on the Python show?

17:57 You know, so much of this work that people are doing there can totally be solved by Pandas and Jupyter Notebook and stuff.

18:06 And so, you know, the guy who was quoted in there was like, hey, I made these mistakes when I was in grad school because all I knew how to use was Excel.

18:14 So, here's a bit of an example along with a plea to help folks who are overusing Excel to take a step forward and use, you know, something like Pandas and Jupyter.

18:28 And, you know, you're going to be able to do a lot more cool processing anyway, which I think is great.

18:33 Yeah.

18:33 Also, one other thing, if you think that geneticists have an exclusive right to these mistakes, there's a really cool article, blog post over on Oracle's blog, actually, called The 10 Costliest Spreadsheet Boo-Boos in History.

18:50 And they're really hilarious.

18:52 And, like, thank God this didn't happen to me.

18:54 Well, and just, like, awesome stock photo that they have for this, too.

18:58 Yeah, it's so bad.

19:00 Yeah, it's, like, perfect.

19:01 It could just come out of Excel.

19:03 Yeah.

19:03 So, some of them are fairly mundane, but others, like MI5, the British Intelligence Agency, bugged over 1,000 wrong phone numbers because the 000 and the last three digits of the number got, you know, misstated.

19:20 Oh, no.

19:21 Yeah.

19:22 Or Eastman Kodak was forced to reinstate financial results for two quarters by from 2 million and 13 million respectively due to a spreadsheet mistake.

19:33 Yeah, there's just all these crazy examples of stuff just going wrong.

19:37 Like, oh, sorry, that was a billion dollar mistake because we were off by some random thing.

19:42 Anyway, there's a bunch of errors like this.

19:44 And it's really interesting to think how you might use the Python tools to not have these errors.

19:48 Yeah.

19:49 I think Excel has too much influence over the world.

19:52 It does.

19:52 But if you were going to use Python and you had a Jupyter Notebook, would you run it in your downloads folder?

19:57 No.

19:58 No, definitely not.

20:01 Yeah, so this is another just kind of an interesting and shocking, wherever you think about it before.

20:07 But Glyph has written an article called Never Run Python in Your Downloads Folder.

20:12 It's not just about that.

20:15 Okay, so there's your advice.

20:16 Don't run Python in your downloads folder.

20:18 But I think it's a good article to spread around and read because it's a nice short tutorial on how syspath works and how it's populated.

20:26 So Python has this thing called syspath that is, that's where it looks up.

20:32 So if you say import something or mostly that, where it's importing things, also the dash M or something.

20:39 If you say dash M and then some module, where does it find that?

20:44 And it finds it in lots of places.

20:45 One is the normal system include directory or the system packages.

20:51 But there's other places too.

20:53 And one of the places where Python uses is the location you're in when you run Python.

20:59 So a little example, he talks about a lot of people are now using, instead of running pip directly, they run python-m pip, which I also am including that in my advice usually, because I'm tired of trying to fix people's errors when their Python and their pip are pointing to two different places.

21:20 That's frustrating.

21:21 So what happens if you run python-m pip install something, some wheel in your downloads directory?

21:31 Well, normally, everything's fine.

21:34 Except if there's a malicious fake pip.py in your downloads directory that happens to have gotten there because somebody wrote some malicious JavaScript code, stick it there.

21:45 It's possible.

21:47 I don't know if it would happen, but it's possible.

21:49 It's more likely now.

21:50 Yeah, especially now.

21:52 Thanks, Cliff.

21:53 But then it's going to run that instead of the pip that you expect it to run, which is bothersome.

21:59 So this is hidden.

22:00 There's some extra advice in here.

22:02 I encourage everybody to read the whole article.

22:04 Understanding how the Python path variable works as well, because sometimes other applications will, and if the installer of an application can change things and change your Python path.

22:18 It shouldn't, but sometimes they'll do that out of convenience.

22:21 So occasionally look at your Python path and make sure that there's nothing weird in there, and maybe contact application developers or if they're doing something odd.

22:32 If you are mucking with Python path, the recommendation is put absolute paths.

22:36 Don't put anything relative in there.

22:38 You want to be able to have complete control over that.

22:41 The problem with the downloads folder is not that the downloads folder has weird permissions or higher permissions or anything of that nature.

22:49 It's just that web virus maliciousness might, it's most likely to drop the payload there, and the Python path plus that operation or that aspect is what is likely to lead to trouble.

23:03 Yeah.

23:03 Right.

23:04 Like you wouldn't say don't run it in your documents folder unless you copy virus Python files in there.

23:10 Yeah.

23:10 And the example was real of like sometimes, like let's say you're behind a firewall or something like that, and pip install just doesn't work, and you haven't figured out how to do proxies yet, and you really just need some package.

23:22 You might just download the wheel somewhere.

23:24 Okay.

23:25 You might have that, and the wheel might be fine, but put it somewhere else.

23:29 Don't leave it in your downloads directory.

23:31 Put it someplace else.

23:32 And please use virtual environments.

23:34 That'll help things as well.

23:35 Yeah.

23:36 Absolutely.

23:36 Good advice, and definitely another thing to be aware of.

23:40 That internet, it's a scary place.

23:42 Yes, but we love it.

23:45 It's also a good place.

23:46 It goes both ways, because we do love it.

23:47 That's it for our main topics, but I do want to share two quick things, and they both have to do with the internet.

23:52 Number one is we just passed 5 million downloads, Brian.

23:57 Wow.

23:57 That's awesome.

23:58 That's awesome.

23:58 Yeah, and we're in the top 150 or so of all tech podcasts in the world.

24:04 So I just want to say thank you, everyone, that's helped to make that happen, because that's kind of a milestone.

24:08 So that's really cool.

24:09 Yes, thank you.

24:10 Indeed.

24:10 Also, I recently finally broke down and got a Wi-Fi mesh router.

24:17 Routers?

24:18 Can you say singular router?

24:19 It's got to be plural, right?

24:20 So one of those routers where it has a bunch of different stations, and they all work together.

24:25 But because of the way it works, you don't actually have to connect to different ones at different places.

24:31 It's just all one Wi-Fi.

24:32 Okay.

24:33 And man, I got this Linksys Wi-Fi 6 mesh router, which is quite pricey, but man, it is so, so awesome.

24:42 So if people are suffering from being home all the time, they're like, gosh, I got to get better internet.

24:46 My internet was actually pretty good here, but we started getting a ton of radio interference from different sources, and it would degrade it and stuff.

24:54 So I have my recording area office above my garage.

24:58 It's like a little studio apartment thing we built above the garage, which is a separate building across from the house.

25:04 And over here, if I go speed test, I get 400 megabit measured off my Wi-Fi now.

25:10 That is solid.

25:11 Okay, so how far away are the mesh nodes?

25:14 It's probably 50 feet, but it goes through several outside walls, through a couple inside walls.

25:21 There's layers it's got to go through.

25:23 But now with the mesh, I could put one of the nodes much closer to the wall that I'm close to that spans that gap between houses.

25:30 Okay.

25:30 And how are they talking to each other?

25:32 They have their own channel, and they do some kind of back channel Wi-Fi 6 thing.

25:38 So anyway, super good recommendation.

25:40 Yes.

25:40 How about you?

25:41 Any extra stuff you want to share with the world?

25:43 No.

25:43 Just that I'm sort of sick of it and fires and all that sort of stuff.

25:48 Yeah, man.

25:49 It's looking a little smoky outside.

25:51 And yeah, the whole West Coast.

25:53 And my sympathies go out to the folks in Northern California.

25:55 I just had a meeting with someone, and oh my gosh, is it smoky there.

26:00 It looked like night in the daytime.

26:02 In the background, I could see their window, and it was dark, as if they were in a different time zone.

26:07 But they weren't.

26:08 Not good.

26:09 So hopefully that'll end soon.

26:12 But before we end this episode, I got a question, Brian, for you.

26:17 Are you a real programmer?

26:19 Like a real one?

26:20 Like a hardcore?

26:21 I mean, you do C++, so that puts you a little closer.

26:24 Oh my God.

26:24 Imposter syndrome flares every time I get that question.

26:28 So I don't think so.

26:30 Let's go to XKCD and answer it.

26:32 Okay?

26:32 Okay.

26:32 So XKCD has this cartoon called Real Programmers.

26:35 And it starts with two people debating about what kind of editor they're using, right?

26:42 It says there's somebody working.

26:44 They're using Nano, the editor.

26:46 And the person, come on, Nano?

26:48 For real?

26:49 Real programmers use Emacs.

26:51 And then Brian walks in.

26:53 Hey, real programmers use Vim.

26:55 Come on.

26:55 Oh yeah, well, real programmers use Ed.

26:58 No, no, real programmers use Cat and just stick it on the end.

27:01 Real programmers use a magnetized needle in a steady hand.

27:05 Excuse me, but real programmers use butterflies.

27:09 Everyone turns and looks at the person.

27:11 They open their hands and they let the delicate wings flap once.

27:15 The disturbance ripples outward, changing the flow of the eddy currents in the upper atmosphere,

27:20 which acts as a lens that deflects incoming cosmic rays, focus them to strike the derived platter

27:25 and flip the desired bit.

27:27 Nice.

27:28 Of course, there's an Emacs command to do that.

27:31 Oh yeah?

27:32 Yeah.

27:33 Command X, Command M, Command Butterfly.

27:39 Nice.

27:39 Yeah.

27:40 Yeah, yeah.

27:41 CXCMCM Butterfly.

27:45 Indeed.

27:45 Damn it, Emacs, they say.

27:47 Does anybody use Emacs?

27:50 Yeah.

27:50 Well, I don't know, but I'm not a real programmer because I don't use butterflies.

27:53 I'm sorry.

27:54 No, I don't use butterflies.

27:55 Okay.

27:56 Well, we're in the joke section.

27:57 I got to share a terrible joke that my wife came up with.

28:00 You wait.

28:01 Okay.

28:01 99 bottles of beer on the wall.

28:03 99 bottles of beer.

28:04 Take one down.

28:05 Pass it around.

28:06 Now everybody's infected.

28:08 Morons.

28:09 Hey, that sounds like college here in the US.

28:15 I think that's what they're doing.

28:16 I'm going to tell from all the news I'm watching.

28:18 Oh my gosh.

28:18 Jeez.

28:19 Sorry.

28:20 Yeah.

28:20 These jokes are the 2020 jokes.

28:23 Oh yeah.

28:23 So this won't make sense in the future.

28:25 Yeah.

28:26 Yeah.

28:26 So in the future, when you speak to a historian and you say, oh, you're a historian.

28:32 What do you study?

28:33 Oh, I actually study the year 2020.

28:37 Like that won't be enough.

28:38 You have to say, well, which part do you study?

28:40 Like the beginning?

28:42 Do you study the financial crash?

28:45 Do you study this crazy election?

28:47 Do you study the pandemic?

28:48 Like you got to be like 20, that year is not enough to say what your specialty is.

28:52 You got to pick one.

28:53 Pick a quarter.

28:54 Yeah.

28:55 It's a crazy time, right?

28:56 I study the flyers.

28:58 Oh boy.

28:59 Anyway.

28:59 Well, good talking with you as always.

29:02 You as well.

29:03 Catch you later.

29:03 Thank you for listening to Python Bytes.

29:05 Follow the show on Twitter via at Python Bytes.

29:07 That's Python Bytes as in B-Y-T-E-S.

29:10 And get the full show notes at pythonbytes.fm.

29:14 If you have a news item you want featured, just visit pythonbytes.fm and send it our way.

29:18 We're always on the lookout for sharing something cool.

29:20 On behalf of myself and Brian Okken, this is Michael Kennedy.

29:24 Thank you for listening and sharing this podcast with your friends and colleagues.

Want to go deeper? Check our projects

Course: Python for the Absolute Beginner course

Beginners

HTMX + Flask

FastAPI

pytest book

Full transcript