Transcript #146: Slay the dragon, learn the Python
Return to episode page view on github00:00 Hello and welcome to Python Bytes, where we deliver Python news and headlines directly to earbuds.
00:04 This is episode 146, recorded September 4th, 2019. I'm Michael Kennedy.
00:10 And I'm Brian Okken.
00:11 And we have a special guest joining us this time. Welcome back, Trey Hunter.
00:14 Thank you. It's nice to be here.
00:16 Yeah, it's great to have you here. And it's always nice to get a new perspective. And so
00:21 it'll be fun to see what you're into. We've got some cool Django stuff lined up because of that,
00:25 I know. But Brian, why don't you start off by telling us why there might be a forward slash
00:30 in our Python function definition?
00:31 I sometimes forget that this is that 3.8 isn't out already, because I've been using Python 3.8 for a
00:38 while now. But it's coming soon. And one of the features for 3.8 is positional only arguments.
00:45 So positional only arguments are a way we have keyword only arguments that say you have to include
00:51 the keyword in the argument. We also have normal arguments that you can include the keyword or not.
00:58 In 3.8, they also add positional only, which means you can't provide the keyword and they have to just be
01:05 in the correct order, kind of like C functions.
01:08 functions. So my first thought for this was that the reason for this is probably to, you know, when you're integrating with a DLL or something, it might be just better for integration. But we ran across this argument by a guy named Sanket. It's called positional only arguments in Python. And it's just a nice little overview of what it is. But I like the example. So the example, how you specify it is that you specify your normal arguments,
01:35 and then you put a slash as one of the arguments, like a normal division slash that isn't one of your variables. It just denotes the difference between the where the positional only stop.
01:46 And the example he gave was a power function. And this is like a math type function or something like that makes a lot of sense where the names of the arguments really don't matter. And the order kind of really does matter. Because if you, you know, like if you're thinking that the power function was x to the y, but it's really y to the x, that would be weird.
02:07 So specifying them in the correct order is kind of necessary. So anyway, I just wanted to throw this out there because it was a good example of a place where positional only arguments makes a lot of sense.
02:18 Yeah, it's interesting. You know, when I first looked at it, I thought, well, what is the value of this? But there are a couple interesting things. One, functions that are called positional only, without the keywords have some kind of significant speed up.
02:32 I can't remember if it's in 3.7 or if it's coming in 3.8 or something like that, like 20% quicker. So there's some kind of performance optimization for this. I also heard that it might make other implementations like PyPy and other things easier to have compatibility with some of the lower level stuff in there if you can have these positional only arguments.
02:54 So it's interesting. It's a feature coming in 3.8. I'm curious to see how it gets used, right? Like how many people actually go and use it. We've had, like you said, the keyword only arguments, which is the star, not star args, but star comma arguments, where it says you have to do the keyword argument style. And I don't see that very often, but I'd kind of like that as a feature for sure. So maybe this one will grow on me.
03:17 Trey?
03:17 This is an interesting one because I'm pretty sure this was discussed when they were planning the feature that this is actually already possible at the C layer in Python. And in fact, I brought this up while you were talking about pow, Brian. If you look at help on the built-in pow function Python, Python actually has the slash in the documentation.
03:35 Because that slash is already there. It means something. You just can't use it as an actual Python syntax right now. So you already cannot call the pow function that's built in with anything but positional arguments. This just kind of reveals or allows this feature to be used by actual Python programmers outside of that C level.
03:52 So, I mean, as a teacher, I kind of feel like it's yet another thing to teach, but it's also kind of consistency, right? I can now explain the help documentation with an actual Python feature.
04:02 Yeah, it brings it up for all the regular developers, not just the core developers. Okay, interesting. Trey, I promised something about Django. What you got for us?
04:12 Yeah, Django stubs. So this is type checking in Django. So type annotations, which are a cool, I'd say new Python 3 thing. I guess they're not so new anymore, although they keep adding little edges to them.
04:26 It's a pretty new library. I think there's a blog post that I saw on this recently. Let me see when it was written. Yeah, it was just last week, I guess. And because of that, I guess it's in beta in the sense that, you know, without actual users giving you feedback, it's only going to work for your use case, maybe, especially because Django is used so widely.
04:43 To be clear, I don't use type annotations in my Django code. But you know, this library might allow me to eventually and I figure more people knowing about it means more users, more people may be fixing little edges in there because there's so much magic kind of, I mean, as much as they say there was a big magic removal in Django, there's just a lot of corners in Django to kind of fix with annotations.
05:01 Yeah, I think annotations are great. They can definitely overwhelm the code and go crazy. But judicious use here and there, where you're crossing boundaries and stuff like what the heck is that supposed to be? Or even something as simple as like putting a type annotation on a request object. So you can say request dot, how do I get to headers again? And what exactly is headers? Is it a dictionary? Is it a multi dict is like, what is it? Right? So those kinds of things I think are super valuable. And I'm happy to see this.
05:31 Right. Well, and the edges between the code, you know, if Django or a third party library is expecting something that what you're sending it works 80% of the time, but there's that one edge case that technically you're not using yet. And no one's testing that thing you don't even know about. You know, it could break, but a type annotation might actually hint at a test you haven't written yet.
05:49 So are the Django type checking, is that going to extend to like APIs as well?
05:56 Ooh, that's a good question. Yeah, I wonder about that. Because it says in the article, it talks about Django rest framework.
06:01 And I didn't even think about the fact that you could probably piggyback on top of this for some kind of other kind of type checking on top of it. But I don't know. I think this is kind of more geared toward the actual programmer side of things, but I'm not certain.
06:15 Okay. Yeah, that's a good one. I'm sure a lot of Django programmers out there will like, especially if you're already using type annotations. Be cool.
06:21 So Trey, you talked about as a teacher earlier, and I've done a lot of training myself. I also have kids who I would like to have some programming literacy, not necessarily make them into little programmers, but have them be able to use programming like they might in math or something or statistics, right?
06:41 In some other aspects. So there's this cool thing I found a while ago, but it's in the news again, called Code Combat. Have either of you heard of this?
06:48 I don't think so.
06:49 Yeah. So Code Combat is a place that you can go and it's aimed at teachers and educators. But honestly, anybody who wants to help somebody who's really early stage, get involved in coding with Python. This is a great opportunity.
07:03 Let me tell you what it is. So it's basically like this Dungeons and Dragons game or something along those lines. And you go into these different worlds and you open them up and each world has maybe 40 or 50 puzzles or challenges you've got across to get through visually, right?
07:21 So you might go in there. It might be a dungeon. It's got like a maze and then a door and then an enemy. And you might have to write some code that'll say, you know, take my hero and move them over here.
07:31 Have them go down, then have them pick up this thing and then find the nearest enemy and attack them and then open the door and whatnot, right?
07:38 Maybe you do that just imperatively. Maybe you write a while loop because like there's some kind of repeating pattern in the maze they've got to do.
07:45 But what's really unique about this is it's very visual. It's got like all the gamification. You've won a badge. You've got a new skill. Like you have to earn the while loop skill.
07:57 So you get like a little, I don't know, special boots that you run really fast that they call like, wow, I don't remember exactly what it is.
08:02 But it's really like quick feedback for little kids. What I like about it, though, is unlike many of these things, you write real Python code.
08:11 There's a hero and you say hero dot move right or speed up or whatever it is. You actually type it.
08:18 Now that sounds bad for like younger kids who are not great at typing or frustrating, but the autocomplete is insane for this.
08:26 So if you're in the editor and you want to say hero find next enemy, you could just type E and hero find next enemy comes up with the enemy variables.
08:36 Like it's crazy helpful on how much autocomplete that it'll give you. And there's not that much that you can do.
08:42 And so it's super interesting. My daughter at the time, I think this was 10. She was all about it.
08:48 She made me get her a subscription and everything. And she just played it probably for a month.
08:52 Then she kind of got tired of it, but she was really engaged. And I just thought I'd throw that out there.
08:56 The reason it's back in the news now is they just got six million dollars in VC funding to keep up this mission.
09:02 Oh, nice. That's awesome.
09:03 Yeah. So just a free resource. Anyone out there who's a teacher or wants to help someone else learn or maybe you want to learn yourself.
09:09 Right. Like it's really constrained and simple Python, but it's real Python code.
09:14 You write to like go through the journey. It's pretty cool.
09:17 Right. And that style of learning really shouldn't be written off because, I mean, you're anchoring,
09:21 you know, some kind of thing in the game that's exciting and fun with the knowledge of that little tidbit of some Python that you learned.
09:28 Yeah. And it lets you work in a like a real way. You don't have to go, well, I know you draggy, you did some draggy droppy stuff and that was sort of programming.
09:36 But now it's time to open up a blank text editor and you have no idea what you're doing.
09:39 Like you've been doing that the whole way.
09:42 So it's like it really introduces you to what professional programming is.
09:47 It doesn't sugarcoat it too much for you, which is what I think is cool about it.
09:51 Awesome.
09:51 Yeah. Worth checking out. All right. Now, before we get to the next one, I want to tell you quickly about our sponsor.
09:57 So this episode is brought to you by DigitalOcean. Now, DigitalOcean has previously had Postgres as a service.
10:03 That's something they announced not too long ago. So if you're using Postgres, you can go sign up, say, hey, just take care of my database for me.
10:10 Back it up. Make sure it's running. All those kinds of things.
10:12 Well, now they're adding a few more things that go along with that.
10:16 They just announced that they have MySQL and Redis hosting as well.
10:20 So if you want to use Redis for a cache and make things go faster, check a box.
10:24 You've got your own Redis cluster.
10:26 Or if you want to use MySQL instead of Postgres, also an option now.
10:29 So check them out. Pythonbytes.fm slash DigitalOcean.
10:33 $50 credit for new users and lots of cool services coming online.
10:37 Speaking of stuff, services and background things like Redis, Celery is definitely one that might land in that realm, Brian.
10:44 I picked this topic. So there was an article by Nick Janatakis called Four Use Cases for When to Use Celery in a Flask Application.
10:52 However, I think it's just really any web framework. I'm not sure if this is specific to Flask.
10:57 But I like this sort of, okay, so this tool, in this case, it's Celery. When would you want to use it?
11:04 I picked this because I'm not sure if he hit the nail on the head or not.
11:08 And I wanted to pick you and Trey's brains on this.
11:12 So Celery, people don't know, it's a module that you can run with your web application that helps you run asynchronous or periodic schedules code in the background.
11:23 So mostly just extra task multitasking, I guess.
11:27 Sometimes it's super, it's very much the wrong thing to try to run that code right in the view handler, right?
11:33 Like if I've got some code that is going to take 15 seconds, you don't want to just have the user sit there and spin for 15 seconds.
11:40 You want to say, awesome, we're working on it. We kicked it off. Enjoy your day, right?
11:46 The first example, I'm going to go through the four here.
11:49 The first example is to send out an email.
11:51 And this totally makes sense because all the work of actually sending it out, you don't actually care.
11:57 As long as you've, as the central task, collected enough information, you can push that onto some background task to actually get the email sent out.
12:05 The user doesn't have to wait for that, right?
12:07 That totally makes sense.
12:09 The other three are connecting to third-party APIs, performing long-running tasks, and running tasks on a schedule.
12:17 And the third-party API one, I don't actually quite get.
12:22 So I was hoping that one of you could explain if this is a good idea or a bad idea.
12:26 Any thoughts?
12:27 Yeah, I use it for all four of these things, actually.
12:30 I mean, in Python, I use it for all four because sending emails, that's the thing.
12:35 Every Monday, 7 a.m. Pacific time, I have to send hundreds of emails.
12:40 And I actually was just wrestling with Celery the last couple of days to get the email sending down from 10 minutes to 23 seconds, which was only possible because I realized I could make it a little bit more concurrent because you can kind of just throw more Celery at the issue sometimes.
12:53 And then the third-party APIs, if you've got a thing like maybe not a credit card processing, but something that could happen, it could happen in the background.
13:01 You just have to go notify some analytics thing or something.
13:04 The user doesn't need to wait for that.
13:06 So you can just spin that off on a Celery task on a schedule.
13:09 Sending emails, in my case, was on a schedule as well.
13:12 And long-running tasks, sending emails is actually a pretty long-running task for me.
13:16 It used to be 10 minutes.
13:18 Is there cases where, like, the user's waiting to see something coming back, but you still want to use Celery?
13:24 Well, I would say sometimes.
13:26 And I'm pretty much with Trey on all of these things.
13:29 I don't use Celery.
13:30 I just have, like, a real simple, like, background thread that I kick stuff off to because the work that I'm doing is 10, 20 seconds, whatever.
13:38 Like, that's good enough to just not have more servers talking to each other where there's more places they could fail.
13:44 But when you're talking to a third-party API, if somebody comes and says, buys a course on my website, I absolutely want to wait until I get a response.
13:53 Say, yes, that was approved.
13:54 No, it was denied or whatever, right?
13:57 So that one, I would never kick purchasing, like, a purchasing call or something like that off to a background queue.
14:03 But what I may well do is, hey, I want to add this person to my mailing list.
14:08 There's no reason that they don't want a response.
14:11 I think the thing is if they do not expect a response in the web page, it's a totally good candidate for this background work, right?
14:19 Like, if you say, I can't log in.
14:21 I need to reset my email.
14:22 They don't expect that to happen.
14:23 You can just say, hey, yeah, great.
14:25 We emailed you.
14:26 Go check your email.
14:26 Right.
14:27 There's always, like, a five-second delay anyway before it makes it through the email pipes.
14:31 So it's totally easy to kick that to something like Celery, send off that email, but get right back.
14:37 On my course site, if I put a new office hour event, there's a bunch of people, like, thousands of people have signed up to get an email when I put that in there.
14:46 At first, I tried to just send it out.
14:49 But then it started timing out because it was taking so long.
14:51 So now that's a background job and things like that.
14:53 Okay, cool.
14:54 I guess it's a great article then if all these hurt.
14:58 I think the third-party one needs a big caveat, right?
15:01 Like, do they expect a response from that third party, like a credit card purchase?
15:05 Or do they not care at all about that response, like, they're now on your mailing list, right?
15:09 I think that's the dividing line there.
15:11 Trey, what do you think?
15:12 I'd agree with that.
15:12 I'm kind of in the same camp in the sense of when I add someone to my mailing list, that API call, they don't even know about that API call.
15:19 They don't need to wait for it if that checkbox fails.
15:22 Whereas charging your credit card, like, that's got to be in the request response, in my opinion.
15:25 Right.
15:26 That's the primary thing they're trying to do.
15:28 And, like, it'll make them really nervous if they don't get some kind of feedback from that.
15:31 Yeah.
15:32 So you probably should test it, huh, Trey?
15:33 Yes, you should.
15:34 The next thing we're going to talk about is pytest, which is, you know, one of Brian's favorite things, obviously.
15:39 I feel like I stole this one from Brian because it's about pytest.
15:42 But so this is pytest steps is what I'm sharing, which is I didn't really know what it was when I first looked at it.
15:49 Then I kind of scrolled through and saw the code.
15:51 And personally, I would use this kind of coming from a Django perspective of my functional test.
15:56 If I have a test where, by necessity, I've got a lot of steps in it.
16:00 You know, I have to go to the login page, log in, click here, fill out another form, do a thing.
16:04 It's not a unit test.
16:06 It's this, like, multi-step process.
16:08 It would be nice to kind of put little checks in there and say, did it pass this far?
16:12 Or if it didn't, don't show the rest of it as failed, show it as skipped because the failure was really at this part here.
16:18 We don't even know if the rest of it failed.
16:19 And so this is a way to kind of break up one longer test into steps.
16:24 And there's a whole bunch of different syntaxes for it.
16:26 I really like the generator syntax that's on this page.
16:29 The other ones I find a little bit less readable.
16:31 But you're going to really have to read through the page because there's kind of a lot of things buried in it as far as different ways you can use it.
16:37 Yeah, I think it's very interesting.
16:39 Yeah, you're always told, well, many people tell you that you should have one assert per test.
16:44 You should be testing one thing.
16:45 But also, a lot of times, you've kind of got to build it up, right?
16:48 So this seems like a nice way to have the infrastructure break it apart.
16:52 And doesn't it do reporting based on like what step it's at?
16:55 I feel like it kind of gives you better information when it fails.
17:00 Yeah, I think Brian would know more about this.
17:02 But I think it just integrates with kind of the way pytest does its reporting.
17:05 And like it pretends as if it has run multiple tests in a sense, kind of like how parameterization works, I think.
17:12 Yeah, it looks like the output just it looks just like a parameterization.
17:16 So you have the test name is the same, but each step will be like listed in a bracket after the test name.
17:24 Looks like it is piggybacking off the parameterization stuff.
17:27 I think it's a nice implementation.
17:28 I'm going to have to play with it.
17:29 My fear in working with teams is it is helpful to try to focus in and have a test that really focuses on something.
17:37 And workflow tests are a red flag, but we do have workflow tests.
17:42 Those are real things.
17:44 And so this is a way to have workflow tests be in place.
17:47 I would just say probably don't abuse it too much.
17:50 But in some cases, it might be the perfect hammer.
17:52 Yeah, if you've got to have it, you might as well do it with a nice tool.
17:56 Yeah, right, right.
17:57 I feel like this is, you know, it's a smaller use case in terms of those very few workflow tests, but it's still useful.
18:02 Yeah.
18:02 All right.
18:03 Let me wrap us up with this last main topic here.
18:06 And now before I tell you all what it is, let me tell you the scenario which it makes sense to use.
18:10 Because it came out of law and you might think, well, I'm not a lawyer, so this is irrelevant to me.
18:14 And it may be, but it might not be.
18:16 So the idea is with this thing I'm about to tell you about, if you want to conduct a survey, think SurveyMonkey or something like that, or you want to create kind of a self-guided interview process.
18:29 But you want a lot of workflow and control.
18:32 There's this really creative project called Doc Assemble.
18:35 And it comes out of the legal space.
18:37 There's a guy named Jonathan Pyle.
18:39 I actually interviewed him for Talk Python on this, and that's coming out in a couple weeks, something like that.
18:44 Probably about the same time this episode ships, actually.
18:47 And the idea is he worked for a, like, public defendant type place where it wasn't a big fancy law firm, but they needed to talk to lots of people and do, like, legally valid interviews and gathering up information.
19:00 So he wrote this open source thing called Doc Assemble.
19:04 And it can be used anytime you kind of want to conduct interviews or surveys that are way better than, say, SurveyMonkey, where you have lots of flow.
19:13 The way you can create the workflow is wild.
19:16 You write basic Python code, and then it evaluates the Python, the execution path.
19:24 So I could say, if user.isUScitizen, then something else, something else.
19:31 And it will actually realize, okay, I have to first determine if they're a U.S. citizen.
19:36 And only if they say yes, do I have to ask them this other question that's inside that if statement.
19:42 And the way it does it is really wild.
19:44 So this thing is packed with all sorts of features.
19:46 It has sends SMS, email, does OCR.
19:50 It has, like, plugins.
19:52 It has background tasks.
19:54 We just talked about Celery.
19:55 It does a bunch of background stuff.
19:57 I think actually using Celery, you define the interviews in YAML, and you write the flow in basic questions in Python.
20:04 And it's just a really interesting way to make, like, advanced interview workflow survey type things.
20:12 So people might check that out.
20:13 It's open source, and it looks pretty cool.
20:15 When I looked at this, I definitely want to try to come up with a reason to use it because it looks kind of fun.
20:20 You can even go, and there's, like, a demo, like, try it or take a survey or whatever.
20:24 And when you do that, there's at the top, there's something that shows the source for each question.
20:28 It's pretty cool.
20:29 It'll show you the actual YAML, and I think the Python code that's making it go.
20:33 But it also shows you things like the reading level.
20:37 There's a bunch of different measures for how hard or complex are these questions and terms.
20:41 And so you can get it like, oh, this question's too complicated.
20:44 How do we rephrase it?
20:45 There's, like, just tons of little edge cases that are super well polished.
20:49 Like, there's a plugin for Word docs, so you can write them there instead of an EAML.
20:53 All kinds of stuff.
20:54 I'm actually going to have to use this.
20:55 This is, or at least look at it, because this is something I've needed in asking, like, surveys for my Python Morsels customers.
21:02 If you're using it professionally versus if you're a hobbyist, you've got a different set of concerns and different questions.
21:07 Yeah, exactly.
21:07 In Python, it'd be like, if user.ishobbyist, colon.
21:11 And, like, that's how you write the workflow.
21:13 It's really cool, this thing.
21:14 Awesome.
21:14 All right, cool.
21:15 Well, that's our main topics.
21:16 Do you guys have any extra stuff you want to just throw out there real quick?
21:19 I don't.
21:19 All right, well, good thing I brought some then.
21:21 Yeah.
21:22 Actually, so really quickly, we are at 194,740 projects on PyPI, which means we are on the edge of 200,000 packages on PyPI, which is, I think, pretty awesome.
21:36 So very exciting to have that growing.
21:40 I remember when I was blown away that there were, like, 70,000 packages.
21:42 That's awesome.
21:43 Yeah.
21:44 Super cool.
21:44 So I'm linking to a tweet that Raymond Hedinger put out around that.
21:48 Also, really quick, NumPy 1.0 was released.
21:52 There's a bunch of mathematical stuff that if you care about, cool new features and improvements, probably too specific to go into.
21:59 And also, Brian, you talked about Python 3.8.
22:01 Python 3.8 beta 4 is now available for those who want to test on the slightly closer to finished version.
22:07 Yeah, I love 3.8.
22:08 All right.
22:09 All right.
22:09 I've got a joke for you.
22:09 We haven't told this one yet, have we, Brian?
22:11 It seems familiar here.
22:12 Well.
22:13 But let's do it anyway.
22:14 We're doing it anyway because it was fun to have it go through your office as well.
22:17 All right.
22:18 Knock, knock.
22:19 Who's there?
22:19 Recursive function.
22:20 Recursive function who?
22:21 Knock, knock.
22:22 Yeah.
22:24 How long can we go for this?
22:25 So it's a good joke, but you only understand it if you know recursion, right?
22:29 So how do you learn about it?
22:30 Do you Google it, Brian?
22:31 Yeah, you can Google recursion and then you'll see a link that says, did you mean recursion?
22:35 That's awesome.
22:38 All right.
22:39 I don't know who put this next one here, but I really like it.
22:41 You have to do it though.
22:42 Okay.
22:42 Yeah, I put it down.
22:43 So, hey, what's your address?
22:45 192.7.7.3.
22:47 No, your local address.
22:49 127.0.0.1.
22:50 No, your physical address.
22:53 A-C-5-E-C.
22:56 Give us a Mac address.
22:58 I love this.
23:00 Yeah, that's my Mac address.
23:01 Yeah.
23:01 Yeah.
23:02 Yeah.
23:03 So, thanks.
23:04 That's a good one.
23:04 All right.
23:05 Well, those were some good laughs and Trey and Brian, thank you both for being here today.
23:09 Thanks for having me.
23:10 Thank you for listening to Python Bytes.
23:12 Follow the show on Twitter via at Python Bytes.
23:14 That's Python Bytes as in B-Y-T-E-S.
23:17 And get the full show notes at pythonbytes.fm.
23:21 If you have a news item you want featured, just visit pythonbytes.fm and send it our way.
23:25 We're always on the lookout for sharing something cool.
23:27 On behalf of myself and Brian Okken, this is Michael Kennedy.
23:31 Thank you for listening and sharing this podcast with your friends and colleagues.