#153: Auto format my Python please!

Published Wed, Oct 23, 2019, recorded Wed, Oct 16, 2019

Sponsored by DigitalOcean: pythonbytes.fm/digitalocean

Michael #1: Building a Python C Extension Module

Tutorial, learn to use the Python API to write Python C extension modules.
And
- Invoke C functions from within Python
- Pass arguments from Python to C and parse them accordingly
- Raise exceptions from C code and create custom Python exceptions in C
- Define global constants in C and make them accessible in Python
- Test, package, and distribute your Python C extension module
Extending Your Python Program
- there may be other lesser-used system calls that are only accessible through C

Steps: Writing a Python Interface in C

Figure out the arguments (e.g. int fputs(const char *, FILE *) )

Implement in C:

    #include Python.h
    static PyObject *method_fputs(PyObject *self, PyObject *args) {
        char *str, *filename = NULL;
        int bytes_copied = -1;
        /* Parse arguments */
        if(!PyArg_ParseTuple(args, "ss", &str, &filename)) {
            return NULL;
        }
        FILE *fp = fopen(filename, "w");
        bytes_copied = fputs(str, fp);
        fclose(fp);
        return PyLong_FromLong(bytes_copied);
    }

In line 2, you declare the argument types you wish to receive from your Python code
line 6, then you’ll see that PyArg_ParseTuple() copies into the char*’s
Write regular C code (fopen, fputs)
Return: PyLong_FromLong() creates a PyLongObject, which represents an integer object in Python.
a few extra functions that are necessary
- write definitions of your module and the methods it contains
- Before you can import your new module, you first need to build it. You can do this by using the Python package distutils.

Brian #2: What’s New in Python 3.8 - docs.python.org

We’ve already talked about the big hitters:

assignment expressions, (the walrus operator)
positional only parameters, (the / in the param list)
f-strings support = for self-documenting expressions and debugging

There are a few more goodies I wanted to quickly mention:

More async: python -m asyncio launches a native async REPL
More helpful warnings and messages when
- using is and is not to compare strings and integers and other types intended to be compared with == and !=
- Missing the comma in stuff like [(1,2) (3,4)]. Happens all the time with parametrized testing
you can do iterable unpacking in a yield or return statement
- x = (1, 2, 3)
- a, *b = x
- return a, *b <- this used to be a syntax error
  - you had to do return (a, *b)
New module importlib.metadata lets you access things like version numbers or dependent library required version numbers, and cool stuff like that.
- quite a few more goodies. I run through all my favorites on testandcode.com/91

Michael #3: UK National Cyber Security Centre (NCSC) is warning developers of the risks of sticking with Python 2.7, particularly for library writers

NCSC likens companies continuing to use Python 2 past its EOL to tempting another WannaCry or Equifax incident.
- Equifax details: a vulnerability, dubbed CVE-2017-5638, was discovered in Apache Struts, an open source development framework for creating enterprise Java applications that Equifax, along with thousands of other websites, uses…
Quote: "If you're still using 2.x, it's time to port your code to Python 3," the NCSC said. "If you continue to use unsupported modules, you are risking the security of your organisation and data, as vulnerabilities will sooner or later appear which nobody is fixing."
Moreover: "If you maintain a library that other developers depend on, you may be preventing them from updating to 3," the agency added. "By holding other developers back, you are indirectly and likely unintentionally increasing the security risks of others.”
"If migrating your code base to Python 3 is not possible, another option is to pay a commercial company to support Python 2 for you," the NCSC said.
NCSC: If you don't migrate, you should expect security incidents
Python's popularity makes updating code imperative: The reason the NCSC is warning companies about Python 2's impending EOL is because of the language's success.

Brian #4: Pythonic News

Sebastian A. Steins
“A Hacker News lookalike written in Python/Django”
“ powering https://news.python.sc"
Cool that it’s open source, and on github
Was submitted to us by Sebastian, and a few others too, so there is excitement.
It’s like 6 days old and has 153 stars on github, 4 contributors, 18 forks.
Fun.

Michael #5: Deep Learning Workstations, Servers, Laptops, and GPU Cloud

GPU-accelerated with TensorFlow, PyTorch, Keras, and more pre-installed. Just plug in and start training. Save up to 90% by moving off your current cloud and choosing Lambda.
They offer:
- TensorBook: GPU Laptop for $2,934
- Lambda Quad: 4x GPU Workstation for $21,108 (yikes!)
- All in: Lambda Hyperplane: 8x Tesla V100 Server, starting at $114,274
But compare to:
- AWS EC2: p3.8xlarge @ $12.24 per Hour => $8,935 / month

Brian #6: Auto formatters for Python

A comparison of autopep8, yapf, and black
Auto formatters are super helpful for teams. They shut down the unproductive arguments over style and make code reviews way more pleasant. People can focus on content, not being the style police.
We love black. But it might be a bit over the top for some people. Here are a couple of other alternatives.
autopep8 - mostly focuses on PEP8
- “autopep8 automatically formats Python code to conform to the PEP 8 style guide. It uses the pycodestyle utility to determine what parts of the code needs to be formatted. autopep8 is capable of fixing most of the formatting issues that can be reported by pycodestyle.”
black - does more
- doesn’t have many options, but you can alter line length, can turn of string quote normalization, and you can limit or focus the files it sees.
- does a cool “check that the reformatted code still produces a valid AST that is equivalent to the original.” but you can turn that off with --fast
yapf - way more customizable.
- Great if you want to auto format to a custom style guide.
- “The ultimate goal is that the code YAPF produces is as good as the code that a programmer would write if they were following the style guide. It takes away some of the drudgery of maintaining your code.”
Article is cool in that it shows some sample code and how it’s changed by the different formatters.

Extras:

Michael:

New courses coming
Financial Aid Launches for PyCon US 2020!

Joke:

American Py Song
From Eric Nelson:
- Math joke. “i is as complex as it gets. jk.”

Episode Transcript

Collapse transcript

WebVTT format On GitHub

00:00 Hello and welcome to Python Bytes, where we deliver Python news and headlines directly to

00:04 your earbuds. This is episode 153, recorded October 16, 2019. I'm Brian Okken. And I'm

00:11 Michael Kennedy. This week's episode is sponsored by DigitalOcean. We'll talk more about them later.

00:15 But first, Michael, could you extend my knowledge a bit? Yeah, by like extending

00:20 the entire Python ecosystem, maybe? Yeah. Yeah. So there's actually a cool real Python

00:26 article called Building a Python C Extension Module. So Brian, you know how to write C code,

00:33 right? Yes. Or at least that's the theory. I used to know how. Yeah. I have this really awesome

00:37 former self of mine that was super good at C++. Yeah. I kind of remember that person that I was.

00:43 I used to be able to write a lot of C. That was my main job is to write C and 3D stuff and OpenGL and

00:49 things like that, right? So it's definitely the main way to extend Python these days. And there's

00:55 other options like there's some cool Rust options and whatnot. But, you know, primarily people know

01:00 C. It runs everywhere, has like light runtime requirements. You're already running CPython

01:04 probably, so you already have those requirements met, right? So extending your code with some kind

01:10 of C extension gives you a couple of options. One is clearly performance. I love to talk about

01:17 Python performance because one, it always surprises me. And two, like people are usually wrong about it.

01:21 They say Python is slow. Like I was just reading something on Quora about why, like compare C#

01:28 to Python. And somebody says, well, you can't even compare them. C# is 50 times faster.

01:33 Well, that's true for certain operations, unless maybe part of that it's done in C and then probably

01:37 Python is faster because like now it's down in like NumPy and doing it in C, which is actually

01:41 faster, right? There's just, it gets really interesting. So one reason you might care about

01:45 writing a C module is just for performance. And I think that's what most people think of,

01:49 but there's also like low level operating system APIs or other C APIs, like some library you can get,

01:57 you might want to use that happens to be written in C and doesn't have a Python way to talk to it,

02:03 right? Yeah. There's lots of stuff with DLLs that are available with C header files,

02:08 but you don't have a Python binding. Exactly. And I bet you have a lot of experience with that,

02:13 with all of your devices and stuff like that. Yep. Yeah. Okay. So those are the two main reasons I can

02:18 think of writing C extensions. I mean, obviously throw some Cython at it if it's a performance thing

02:23 to give it a try, but there's a cool tutorial on real Python and it talks about how you can,

02:28 you know, like things you'll be able to do is like import C functions within Python,

02:32 pass arguments from Python to C, raise correct exceptions in your C code. So they surface bubble

02:38 back into your Python code as a proper, like a value error type exception or something like that,

02:44 all sorts of cool things and even how to test and distribute it. So let me just sort of talk

02:48 through the process and then people really care. They can go read the big long article, right? So

02:52 if you want to basically get access to some C functionality, or if you want to just like write

02:59 your implementation and see for some degree, first thing you got to do is go and figure out, let's

03:04 suppose you want to call like a, some C function, right? So the article uses F puts, which puts a string

03:10 into a file pointer, right? Like basically writes a string to a file in C. So you have to write a

03:16 function, which is pretty interesting because it returns, you have to start talking in the CPython

03:23 language, not Python, right? So everything that gets passed around is a Python object pointer or the

03:29 return value is like a py object pointer, right? So you pass these things around and first of all,

03:36 you declare like whatever inbound arguments you're really expecting and you get basically passed a

03:41 single pointer that is the arguments to your function, but that's really a tuple. So there's

03:45 a pyarg parse tuple, give you the arguments, a format thing, and then you give it the address of the

03:51 pointers. You pass them by reference basically. And then you just do your CPython code. In this case,

03:56 the function that they're wrapping, F puts, returns the number of bytes copied when it does that.

04:02 And so this function wants to return the bytes copied, but you can't just return an integer or a

04:07 long. No, no, no, because everything in Python is a py object at the C level, py object star,

04:12 even numbers. So you have to convert from a long to a py long from long, which is a function that you

04:20 get from the Python.hc header file. Okay.

04:23 It's actually pretty simple. There's like some weird non-obvious like structure at the beginning

04:28 of the function. So it can be called by Python and the return value is weird, but everything else in

04:32 the middle is like straight C. So you don't really have to think about what's going on. The GIL will

04:37 protect you from like race conditions, all that kind of stuff.

04:40 Yeah. And actually one of the things I love about this article is that it's using a fairly simple

04:44 example so that you're not distracted by the example. It's just the boilerplate junk that you got to

04:50 learn about. Yeah, absolutely. Which is, you know, the, probably the thing you don't know,

04:53 even if you know C, right? Yeah. And it says also, there's a few other things that are necessary if

04:57 you actually want to use this code and not just write it and compile it and see, because you have

05:02 to write a definition for your module in C and the methods that it contains. So there's a few C

05:09 functions that you call there, and then you have to build it for Python, which you basically create a

05:13 setup.py file and use just utils and it will compile and create the right base library and

05:20 install it for you. Okay. Pretty cool, huh? Pretty cool. Yeah. One of the issues with this is,

05:23 is that people that have to, a lot of times when you need to do this, it isn't a hardcore C compiler

05:31 person or a hardcore Python, CPython person that needs to do this. It's just your casual user that

05:38 happens to have a use case that they need to connect Python to C. And so this is great.

05:42 Yeah. And it's super approachable. And like you said, the examples are pretty straightforward.

05:47 Obviously you're writing C, which puts you in a different category of hard, right? I mean,

05:52 free malloc, pointers, pointers by reference, like all that kind of stuff that you learn when you learn

05:56 C, but that's the world you got to live in when you go down and you don't, you know, take the blue

06:02 pill or whatever it is.

06:05 Is the blue one the good one? I think.

06:07 No, I always forget. I know that there's a pill that's good and there's a pill that's like bad.

06:11 It keeps the facade, but yeah, probably the, I don't know.

06:15 Do you know what else is good?

06:17 Documentation.

06:18 No, Python 3.8. Python 3.8 is good.

06:21 But also Python 3.8.

06:23 For even the URL, sorry.

06:24 Python 3.8 dropped just this week. So it is no longer beta. It is final. And you can download it

06:33 from the front page. The default is Python 3.8.0 now when you download it. So yay.

06:38 Yes, that's awesome.

06:39 We've talked about a lot of stuff. We've, on this podcast, we've talked about things going into 3.8.

06:44 Like the walrus operator, of course, that's come up a lot of times. Those are assignment expressions,

06:49 positional only parameters, and fstrings. Fstrings have the little equal signs so you can debug with

06:55 them easier.

06:56 Right. Fstrings have been here since 3.6, but now they have this like self-documenting

07:00 short print statement thing, right?

07:02 Yeah, and it takes longer to describe than to show, and it's cool. What I wanted to highlight

07:07 is the what's new in Python 3.8 document that came out from, that's at docs.python.org. And it's a

07:14 really great summary of all the stuff that's in 3.8. It does have all of those new things,

07:19 that all those big hitters, but it also has some stuff that I was surprised by that I hadn't heard

07:25 of before. One of them is, we've talked about a lot of async stuff, and now you can type

07:31 python-m async.io, and it launches a async native REPL.

07:37 That is so cool, and I had no idea that that was there. I guess it would have been a pain in the

07:43 butt before to like work with async stuff over there in the REPL, right? So...

07:47 Yeah, I guess. Now you can just, because I often drop into the REPL to try something out,

07:51 now you can try out async stuff in there. So that's cool.

07:54 Yeah, that's super cool.

07:55 A couple other things that'll just help you while writing Python. A couple new warnings and

08:00 messages for things that you might do wrong. So when you're not supposed to compare, use is or is

08:05 not to compare non-objects, like strings or integers or something. It's just like, if x is 3, don't do

08:13 that. But apparently the warning wasn't very good, and so now the warning is better. It tells you to use

08:18 double equal or not equal. So that's cool. And then one of the things that I often get, because I do a

08:24 lot of parameterized testing, is if you've got a list with tuples inside, or, you know, basically a list of

08:31 lists or a tuple of tuples, and you forget the commas between some of the things, because maybe they're on a

08:37 new line or something. The warning was weird before, but now it is a more helpful message.

08:43 So I love things like that.

08:44 Yeah, you know, it drives me crazy if those are strings. Like if you're creating like a JSON document

08:49 or something like that, or a multi-line, like a list of strings, you forget a comma, it just concatenates

08:54 them, even though they're on separate lines. I'm like, oh, really? That's the default behavior?

08:58 But I understand where it comes from, but it drives me crazy.

09:00 That probably still there.

09:01 Yeah, yeah. I don't see how you would fix that without changing what that means.

09:05 Yeah. This one, it took me a while to get my head around, but I didn't know that this was an issue

09:10 before. Iterable unpacking, so if you like packed a bunch of stuff into a variable, you can unpack it

09:17 with star variable name. You can't return that in a return statement, or you couldn't before out of a

09:25 tuple. So you had to put parentheses around it before you return it. But now that's gone away,

09:30 you can just return it.

09:31 Yeah, there's a lot of good stuff in here, actually. And you just did an episode on it,

09:35 didn't you?

09:35 Yep. Episode 91 of Test and Code. I just read through the entire article, and it's still just 20 minutes.

09:41 I didn't read through everything, but it highlighted all the stuff that I thought was cool.

09:45 Super.

09:46 You know, something else that's cool is DigitalOcean.

09:48 I love DigitalOcean.

09:49 This episode is sponsored by DigitalOcean, and Python Bytes infrastructure runs on DigitalOcean,

09:55 thanks to Michael putting that all together. And it's quite solid, and we're super happy with it.

10:00 But did you know that not all web applications and services have the same memory and CPU demands?

10:05 It's shocking, isn't it?

10:06 Shocking. Anyway, so DigitalOcean has embraced this diversity in their droplet structure, which is cool,

10:12 with the ratio of memory to CPU powers in droplets. The general purpose droplets have a ratio of four

10:19 gigabytes of memory per CPU, and you can scale those up. They added, not too long, a couple years ago,

10:25 I think, CPU optimized ones. So they doubled the number of CPUs per the amount of memory,

10:30 and that's great for CPU bound tasks. But there's some applications like high performance databases or

10:36 in-memory caches or data processing of large sets that a lot of memory might be a really great thing.

10:43 So there's now a memory optimized droplet that reverses that structure and makes it like eight

10:49 gigabytes of memory per CPU. It's pretty cool.

10:52 Yeah, very cool.

10:53 Yeah. Use the right kind of droplet for the right service that you're using,

10:56 and try it out at pythonbytes.fm/digitalocean, and they'll give you a $50 credit for new users.

11:02 You and I have mentioned that folks should put legacy Python where it belongs in the past.

11:08 Last time we spoke about 35 million lines of Python code at JPMorgan Chase and their

11:14 journey to work on that. And that's all interesting. But we just recently got this announcement from

11:21 the UK's NCSC, the National Cyber Security Center.

11:25 Wow.

11:26 Yeah. And they're warning developers of the risk of sticking with Python 2, particularly library writers.

11:32 Okay.

11:32 That's interesting, right? That they actually go so far as to call that out as a thing.

11:38 So they say, look, this could be like, basically, the companies that continue to use Python 2 past its end

11:44 of life could be like tempting or setting the environment for another WannaCry or even an Equifax incident.

11:52 So Equifax was a horrible data breach. Basically, it's one of these companies that gathers up so much private data.

12:00 Like, they know stuff about my financial past that I have forgotten and don't even know, right?

12:04 They go, did you know you had this account in California? Like, I did? Oh, okay. Well, I guess I do.

12:08 Right. They know all of that. And it was broken into. Why? Because there was a vulnerability in

12:14 Apache Struts, which is an open source framework. People at Struts are like, guys, this is super bad.

12:19 You just have to send like a special ACRP request to the server and it's owned, right? Well, the folks at

12:25 Equifax got that message, but they didn't really want to get around to like upgrading it to the new

12:30 version because, hey, it's kind of hard to upgrade this thing. It's like a new version, which probably was

12:34 old and was slightly incompatible or something. Anyway, that's where Equifax came from is running

12:39 an old version of one of these frameworks, not Java itself, but like the web framework on top of it.

12:44 Anyway, there's some cool quotes in here. They say, if you're still using Python 2x, it's time to put

12:51 your code to Python 3. If you continue to use unsupported modules, you are risking the security of

12:56 your organization and data as vulnerabilities will sooner or later appear, which nobody's fixing.

13:02 Okay, that's one. One interesting quote. Another one is, if you maintain a library that other

13:07 developers depend upon, you may be preventing them from updating to 3. And by holding back other

13:12 developers, you're indirectly and likely unintentionally increasing the security risk of

13:17 basically all the computers in the world.

13:19 Yeah.

13:19 Yeah. So, I mean, we've said this before, right? You and I have said this, but if the NSA or the

13:26 NCSC, they come out and publicly call out Python 2 like this, well, that maybe carries more weight

13:33 than Python bytes. Not that we don't carry some weight, I'm sure.

13:36 Yeah. It actually makes me think though. Let's say you have a library that now works on both Python

13:43 2 and 3 and somebody else is depending on it. And they're also depending on another library that is

13:49 2 only. They're going to stick with 2. Yeah.

13:52 But if, like, for instance, you could push them if you, like, stopped supporting Python 3 or Python 2.

13:58 It's a good question. Like, in six months, do we have a obligation to actually cut Python 2 out

14:04 of our libraries? I mean, I don't have any libraries people care about, but...

14:08 Maybe? To force people to upgrade? Maybe you could do some help?

14:12 Yeah. Most of these changes have been more self-serving or self-centered, right? Like,

14:16 NumPy and Django, all those folks dropped Python 2, not because they're like, we're going to fix the

14:21 world, but, like, we don't want to maintain this stuff. We want to just move forward and use the

14:25 cool features, and we can't right now.

14:26 Yeah.

14:27 Yeah. Pretty cool.

14:27 I guess one other kind of interesting thing to call out from this report, article, whatever you call it,

14:33 is that they said that Python's popularity makes updating the code imperative, which I thought was

14:39 pretty interesting. It's like, Python is so successful. It's so broadly deployed. Like,

14:43 we can't just ignore this. It's not like Adobe Flash. It's now running an old version. We should

14:48 deal with it, right? It's like, this is one of the really important parts of the computer infrastructure

14:53 that they called out. So, yeah, I mean, there's got to be other places where we get this kind of news,

14:58 right?

14:58 So, I got a notification from a guy called Sebastian Steins, and he put up a, it's basically

15:07 a Hacker News lookalike site called news.python.sc. I don't know what sc stands for. Yeah, it looks a

15:14 lot like Hacker News, but it's just got Python stuff on it, and it's pretty neat. So, I thought, oh,

15:21 that's cool. We should talk about it. But one of the neat things about it is he put it all together

15:25 relatively quickly in like a week or so, and it's built on Django, and all of it's open source. So,

15:32 you can take it and look at how it's done and everything. Plus, it's up, and it's live, and you

15:38 can post stuff. It's neat. And I thought, yeah, maybe we'll cover this. And then while I was thinking

15:43 about covering it, we got like two or three other people tell us about this new news site. So,

15:49 I think people are using it. It's kind of fun. What do you think?

15:52 I like it. It definitely looks like Hacker News, but more Pythonic colors. And, you know,

15:57 looking through this, these are all legitimately interesting things here. I'm like, yeah, oh,

16:01 yeah. Well, I read about that. That was cool. And, oh, I didn't know about that, but interesting. Yeah,

16:06 I feel like this is great.

16:07 And even if it doesn't take off, I think it's cool to have an example of a working model

16:11 of simple with like people being able to vote things up and down. And that's kind of a neat,

16:18 kind of a neat model to say, there's a working website, a working user model. How can I emulate

16:24 that in Python?

16:25 Yeah, it's super cool. I'm definitely going to start checking it out for, as one of my news sources,

16:29 in addition to Redis and Twitter and other places.

16:31 Yeah, like we don't have enough to do.

16:33 I know. Now you just gave me work, man. Come on, it's homework.

16:36 So you've heard that most people are moving to the cloud and data science is moving to the cloud.

16:42 There's all sorts of interesting stuff happening up there. But, you know, a lot of times this type

16:47 of work, especially training like machine learning models and stuff is super, super intensive.

16:51 So if you've got like a laptop, some of the GPU processing and other really interesting things

16:58 are inaccessible to you. Like, for example, my MacBook is super killer, but it's got, you know,

17:03 like 12 cores if you count the hyper threads and it's got 32 gigs of RAM, but it has a ATI, not a

17:09 NVIDIA graphics card. So you can't use CUDA on it, for example, right?

17:13 So what do I do? I go to the cloud. Well, if you're really into deep learning and you really want to do

17:20 like data science with GPUs, there's this place called Lambda, this company called Lambda,

17:25 that is creating these deep learning workstation servers and laptops. Have you heard about this?

17:31 Huh? No. Just to be clear, this is like a super commercial product, right? These are like

17:36 servers that you buy and we have no, this is not like a product placement. I just ran across this

17:40 and thought, dang, this is interesting. So I thought I would just talk about it. So they're selling GPU

17:45 accelerated TensorFlow, PyTorch, Keras, and other types of pre-configured machines. Just, they say,

17:53 just plug in and start training. You're good to go. And they talk about how you can save a bunch of

17:58 money, right? You don't run on the cloud. The cloud can save you money for short work,

18:02 but if you got to do it over a long time, it can get expensive. So they offer a TensorBook,

18:07 which is a GPU training available laptop, capable laptop for $2,900. That's a pricey laptop, right?

18:15 Yeah.

18:16 Actually, it's less expensive than my MacBook, but still. But if you were going to do GPU stuff,

18:22 you know, this is a really cool option to be able to do it on the go or be mobile. Then they also have

18:28 a workstation, which is called Lambda Quad, which has four GPUs in it. And this one is $21,000.

18:35 Okay.

18:36 Okay. That's a lot. But if you go and grab a second tier GPU enabled EC2 instance,

18:42 specifically a P3-8X large, that's over $12 an hour, which comes out to close to $9,000 a month

18:51 in cloud bills. If you were to run it all the time, I mean, obviously probably not all the time,

18:55 but so, you know, like $21,000 is a lot, but a $9,000 monthly bill for AWS is also a lot.

19:01 Yeah. It's something to pay attention to is just as your bill starts getting bigger,

19:04 maybe a dedicated hardware makes sense.

19:08 Anytime I run across something like this, if it were Alien, or if it were gaming laptops,

19:11 or like the Apple MacBook Pro or whatever, it's like, all right, well, what if you're all in?

19:16 What if you turn all the knobs to 11? What, like, what could you get? So they have this thing called

19:21 the Lambda Hyperplane, which has eight Tesla V100 GPUs. And it starts at, it's not the final price.

19:27 It starts at $114,000.

19:29 Nice. That's without the pinstriping.

19:32 Yeah, exactly. That's not even the leather bound keyboard or whatever. I don't know.

19:36 Anyway, if you're into deep learning and you need GPUs for computational stuff, data science and whatnot,

19:44 this is actually pretty cool.

19:46 Yeah. Also, I'm sure there's applications where you really don't want to use the cloud. You want

19:50 to use in-house computers and not go out, or the connection is bad. You're sticking some data in

19:55 the middle of nowhere or something, and you can't get to the internet.

19:58 Right, right. If you've got terabytes of data, like, that takes days to upload. So,

20:02 maybe it's better to just run it locally. Who knows?

20:05 Yep.

20:05 Black has been a big hit.

20:06 Yeah, I like Black.

20:08 Yeah, for sure.

20:09 A lot of people do.

20:10 Oh, yeah.

20:10 One of the things, so I ran across an article. It's not a new article, but it's all still relevant.

20:15 It's auto-formatters in Python. And big shock, Black's in there. But one of the things I

20:21 liked about it is they spent a little bit of time talking about why you want to use Black

20:27 or something else. And I'm finding this more and more as a team lead that just, it's not great to

20:34 have, like, if you're doing code reviews, you don't want to have, like, style be part of the code review.

20:39 Yeah.

20:40 It's way better to have a tool just dictate the style. And so, people can argue with the tool

20:46 instead of arguing with each other.

20:47 Yeah. It's like, if the code review, the people there, I'm sure they feel like, well, I have to

20:51 make a constructive or critical comment about something. It shouldn't be, why are you indenting

20:57 like that? Or why is there not a space by those commas? Like, that's the stuff machines can agree

21:01 upon and just do for us, right? Like, have architectural or algorithmic conversations, right?

21:06 Yeah. You should be using three double quotes there instead of one. So, get off the style

21:10 police and use an auto formatter instead. I love Black. A lot of people do. But there's reasons

21:16 for some people, like an established code base or other predefined style guide, that maybe it's too

21:24 much. It does do things that sometimes I don't like it to do. So, there's a couple other options.

21:30 And this article talks about auto PEP 8 and YAPF. Now, auto PEP 8 is essentially just, it just does

21:38 PEP 8. It's, or uses PyCodeStyle, the utility, to detect PEP 8 violations and just change the code.

21:45 You can do both with it. It does less than Black, but it doesn't do much more. So, if that, if really

21:51 you're just trying to stick to PEP 8, maybe that's, yeah, that'd be better to use. And the other end of

21:57 it is YAPF, which is a tool out of, and I don't know how to say that, YAPF? It's probably yet another

22:04 Python formatter. Yeah, it probably is. It's a Google tool. I think it's Google. I think it's good if you

22:09 want, it's got a lot of knobs and dials, a lot of customization. So, if Black doesn't have enough

22:14 controls for you and you really want to tweak it to be your personal company's code style, this might be

22:20 great for you. In the documentation, it says it takes away some of the drudgery in maintaining your code

22:27 code. And what, just ultimate goal is to code, is that it produces as good a code as that a programmer

22:33 would write if they were following the style guide. That sounds pretty good, honestly. One of the

22:37 interesting things, I was researching this story, is I didn't know this about Black. After it's changed

22:44 your code, it does a check to see if the reformatted code still produces a valid abstract syntax tree

22:52 that is equivalent to the original. That's pretty cool. I didn't know it did that.

22:57 Yeah, that is cool. So, run it through the Python parser and turn it into

23:01 bytecode and then just see if the essence is the same, which yeah, I mean, because you don't actually want to change

23:07 the meaning of the way the code actually gets interpreted. It's just formatting, right?

23:10 So, the meaning change is like, well, that might be a problem.

23:13 Yeah. The other thing I wanted to highlight this article for is it took a few code examples and just did the, what does Black change it into? And what does

23:21 Yapf change it into? And what does Autopepate change it into?

23:24 Oh, that's sweet. I like that.

23:25 Yeah.

23:26 Very, very cool. All right. Well, that's all of our main items. You got anything else you want to throw out there while we're here?

23:30 No. You?

23:32 Yes. A couple things. I'm getting excited for PyCon US. It's earlier this year, in April at some point, I'm guessing. But the announcement I want to make is that the applications for financial aid are open and they'll be accepting them through January 31st, 2020. So, 30 days into a world with only Python 3. The Python Software Foundation and PyLadies are making this financial aid possible and check it out.

23:56 Yeah. So, like PSF is contributing $130,000 towards that. And yeah, it's pretty good. So, if you're thinking, hey, I would really love to go to PyCon and make some connections, kind of new to this world, use some networking and learn more about it, but I just can't justify the expense or afford it, maybe do that.

24:13 Yeah. Nice. Indeed. Indeed. And I'm working on some new courses. I got one that's all done and recorded, just getting edited. Another one, I spent like six contiguous hours recording videos yesterday. That doesn't sound like a lot of time if you haven't done it, but six straight.

24:26 Eight hours recording, that's a lot. So, I'm really, really excited about what's coming out. We'll share more with it when I can.

24:31 Very exciting.

24:32 Oh, yeah. Now, sometimes we have really short jokes. I see that you have one.

24:36 We got a short joke that was contributed by Eric Nelson. Thanks, Eric. It is a math joke. The joke is, I is as complex as it gets. JK.

24:47 The letter I. Yeah.

24:48 I love it. I love it. I studied a bunch of complex analysis and things like that when I was doing math and yeah, I like it.

24:55 Yeah. We have another one too that it's long.

24:57 It's long and I'm not going to be able to do justice to it, so you have to check this out. So, you know the song American Pie, right?

25:04 Yes.

25:04 I drove my Chevy to the levee, but the levee was dry. That sort of song?

25:08 Yeah. You can sing it.

25:09 No, no, I can't sing it. I could recite it.

25:11 If I sing it, it's not going to be singing. It's going to be something else. There's another one. One of our listeners, I only know his username on Reddit, I'm afraid. I can't find the tweet in time.

25:22 Wait. It said, hey, you inspired me to write this song called American Pie, American P-Y, and it's basically the story of like Legacy Python done to American Pie, the song.

25:35 Yeah, it's pretty awesome.

25:36 It's really, really well done. I'll just like recite a little bit here one of the verses.

25:41 So, bye-bye to your Legacy Pies. Made decisions about divisions, so you'll have to revise. And you decode official, it's not a bunch of bite lies. Singing, that'll be the day it dies. That'll be the day it dies. It's really good. Yeah, people should check it out. If somebody can perform this and give it to us, he's given us permission to take that and put it on the air. If it's good enough, man, we'd love it. That'd be awesome. I cannot do this.

26:03 I want somebody to sing it because it includes the phrase, I was a crusty old fart coding guy.

26:09 Yes, I know. You could be a YouTube sensation if you just take this chance here. Jump on it.

26:16 Yes, and if you do, let us know.

26:18 Yeah, for sure. Let us know. That'd be awesome. All right. Well, yeah, this really nice song and a nice job there. Well done on that. And Brian, thanks for everything. Thanks for being here.

26:30 Thank you. Yep, you bet. Bye. Bye.

26:32 Bye. Thank you for listening to Python Bytes. Follow the show on Twitter at Python Bytes. That's Python Bytes as in B-Y-T-E-S.

26:39 And get the full show notes at pythonbytes.fm. If you have a news item you want featured, just visit pythonbytes.fm and send it our way.

26:47 We're always on the lookout for sharing something cool. This is Brian Okken, and on behalf of myself and Michael Kennedy,

26:52 thank you for listening and sharing this podcast with your friends and colleagues.

Want to go deeper? Check our projects

Course: Python for the Absolute Beginner course

Beginners

HTMX + Flask

FastAPI

pytest book

Full transcript