Transcript #13: Python making the move to GitHub and Dropbox is stepping back from Pyston
Return to episode page view on github00:00 This is Python Bytes. Python headlines and news delivered directly to your earbud.
00:04 It's episode 13 on Monday the 13th. Oh my god. Recorded February 13th, 2017.
00:11 Hey Brian, I want to start by saying thanks and hello. I want to say thanks to a new sponsor of
00:17 the show. We have a new sponsor. That's wonderful. We do. And I want to say thank you to Metis.
00:21 They're a data science training company. We have some boot camp type stuff that's really excellent
00:25 and I'll tell everyone about that later. But I kind of like to learn about alternate Python
00:31 implementations and where they are these days. We had heard about Piston, which is spelled P-Y-S-T-O-N,
00:38 developed by Dropbox and an announcement came out at the end of last month saying that Piston is no
00:45 longer to be supported by Dropbox. Yeah, so this is a really interesting development. It's a little
00:52 unfortunate. I don't really know how to interpret as a positive or negative. I mean, it could be just
00:59 there was an experiment. We're moving on to the next experiment. But what it was is an alternate
01:03 version implementation. I have a hard time calling an interpreter when it's actually JIT compiled, but
01:09 alternate interpreter for Python. And the idea was it supported Python 2.7 and it did JIT compilation
01:17 instead of interpreting. So its main goal was to be a very fast but highly compatible alternate
01:26 implementation of Python 2.7. Yeah. And so there's a couple of blog posts, one on the Piston blog and
01:33 one on by Kevin Modzilewski and one on his own site, kevmod.com. And it looks like there's lots of
01:43 reasons around it. But the general flow of it is that the cost and benefit, it cost more than they were hoping
01:50 and it benefited them not as much as they were hoping. Yeah, so we look at the different implementations of
01:56 Python. We've got obviously CPython and its versions, but we've got Piston, we've got Unladen Swallow,
02:04 we've got Cython, we've got PyPy, P-Y-P-Y, IronPython, Jython. There's a bunch of different ones and they all have these
02:13 trade-offs. And so they were actually able to make Piston pretty quick in the benchmark story. So they got it up to 95%
02:20 faster than CPython 2.7. But they also ran into other drawbacks and they found like actually when it ran the Dropbox
02:29 code itself, it was only 10% faster, but it put a lot more memory pressure on it because it does GC instead of reference counting.
02:37 I've got mixed feelings about it. I like that the project was out in the open, open source, and people could watch this experiment.
02:46 I wasn't really thrilled that they were targeting 2.7 to start with. The outcome of them just, I guess, not working on it anymore.
02:54 It totally makes sense, but it is, I guess, it's just a thing. It's interesting.
02:58 Yeah, I think it's a case of a couple of things. I mean, nobody said it, but I feel like, you know,
03:04 moving to Python 3 is also causing like a sort of a rethinking of this as well. Like if they're going to be moving to Python 3,
03:10 then they'd have to redo this work and so on. One of the problems or criticisms, mild criticisms, let's say,
03:16 was that this was like a from scratch thing. It didn't build on existing stuff. So it didn't build on PyPy
03:22 because PyPy wants to be fast at the cost of compatibility and to run a multi-million line Python application that is Dropbox.
03:30 Compatibility is like super, super critical. So they're like, well, we can't have any incompatibility,
03:36 but it wasn't directly based on CPython, even though it took a bunch of pieces in. And so I don't know,
03:42 it was like this sort of other experiment where the thing that gives me the most hope right now is what
03:47 Brett Cannon and Dino Veland are working on with Pigeon, P-Y-J-I-O-N at Microsoft, where they're trying
03:53 to add like JIT plugin capabilities to the existing CPython.
03:57 Yeah. There is some more CPython news.
04:00 There's big CPython news. CPython has just as of three days ago moved. It used to be
04:10 long, long ago in 2006, it moved to Subversion, SVN. And from 2006 to 2011, it was on Subversion.
04:19 Then it moved to Mercurial.
04:21 What's kind of interesting because that's when Git came out and Mercurial, they both, all this whole
04:25 distributed version control system story kind of happened around then. And Mercurial is interesting
04:31 because it's written in Python. Like for example, Facebook does some really interesting things
04:36 with Mercurial. And so it seemed like a natural home for Python to live on Mercurial.
04:41 But, you know, GitHub has really been the place where people go to do open source.
04:47 Not just a container for open source stuff like SourceForge used to be or something,
04:52 but there's like a whole social ecosystem where everybody comes and collaborates in GitHub.
04:58 And so the big news is as of Friday, Python, CPython and a few of its pieces have moved
05:04 to GitHub.
05:05 I think that's great. And I think that one of the great things about it is the culture around
05:10 GitHub. Whether you're a pro GitHub person or not, that's where a lot of people look for
05:15 opportunities to help out in open source projects. It's a higher incentive for people to get involved
05:19 with helping to maintain CPython.
05:21 Yeah, absolutely. Yeah, I think that's actually the main motivation, which is really interesting.
05:25 So there's a link to the mailing list where they talk about it. We'll put that in the show notes.
05:30 There's obviously the link to the repository in GitHub.
05:32 There's a Reddit thread where people are talking about this.
05:35 And a lot of people are bemoaning the fact that it's not on something totally open source.
05:41 It's not on something that is based on Python. Like it used to be on a source control system
05:48 built on top of Python.
05:49 And while that's cool, like you shouldn't, my opinion is you shouldn't do that at the cost of people loving to work on the
05:57 project. So the real thing that you want to read to get the deep insight on this is at snarky.ca.
06:04 So this is Brett Cannon's blog, and it's the history behind the decision to move Python to GitHub.
06:08 And he talks about how he worked with a bunch of people to come to this decision.
06:13 And he says, you know, back in 2007, it became really obvious that working on the source code and doing code reviews and collaboration was actually a burden.
06:23 There were people submitting, you know, patches and stuff.
06:28 And they would sit there for maybe years before somebody actually reviewed it.
06:33 Like how excited would you be to work on a project to contribute to open source if you submit a patch and it doesn't get looked at for a year, right?
06:39 And so the workflow on GitHub is just so much easier for pull requests and reviewing it.
06:47 And so basically, Brett said, like, I want it to be so easy that I could take my tablet and sit on the beach where I have some kind of internet connectivity
06:53 and review pull requests and chill out.
06:56 Like, that's just not how it works now.
06:58 Like, you've got to have a machine with SSH keys and all sorts of stuff.
07:00 So the main driver behind this is trying to make it easier for core developers to be more responsive to people who contribute to the project,
07:08 which I think is super positive.
07:09 Yeah, I think I'm really excited about it.
07:12 And actually, I had never really considered contributing myself, but I definitely consider it more now.
07:17 Yeah, I definitely consider it more accessible now.
07:21 And I think this is absolutely the right move, even if it's breaking some conventions.
07:25 Like, it's moving off of Python as the thing that supports it.
07:28 It's moving off a fully, fully open source thing to, like, this somewhat closed thing, which is GitHub.
07:35 But still, I think it's great.
07:36 So remember last time when we talked about the top 10 Python articles of 2016?
07:44 Yes.
07:45 Yeah, so I said, well, let's just take a survey of the kind of category of these things.
07:49 And I would say, like, a full two-thirds or more of those were data science.
07:54 Data science is definitely important in Python.
07:56 So I want to tell you about our sponsor, Metis.
07:59 So Metis is a data science training company based in New York, San Francisco, Chicago, and Seattle.
08:04 And they provide full-time immersive boot camps and evening courses that you can come.
08:10 So if you have a full-time job, you can go, like, two nights a week for six weeks and take classes on all sorts of data science things.
08:17 Machine learning and things like that.
08:19 If you are wanting to learn more about data science, especially in a hands-on way, you live in New York or San Francisco, or you're willing to spend some time there, check out thisismetis.com slash talkpython.
08:32 And maybe we'll feature your article next year.
08:34 Yeah, that actually sounds pretty interesting.
08:37 Yeah, I would love to go do more data science.
08:40 So TensorFlow, they've got classes on that, all sorts of cool stuff.
08:43 So check it out, thisismetis.com slash talkpython.
08:45 All right.
08:47 You know what they do a lot of in data science, a lot of functional programming, a lot of expressions, things like that, right?
08:53 Yeah.
08:53 Actually, I don't know.
08:55 I'm not a data science person.
08:57 The ones I've hung around seem to.
08:59 There's an article called Using Functional Programming in Python Like a Boss, Generators, Iterators, and Decorators.
09:05 And these are generators and iterators.
09:07 Okay, all three of those, generators, iterators, and decorators are things that trip up intermediate Python people.
09:12 Beginner people just ignore them for now.
09:16 But this article is a Jupyter Notebook, which I like this trend of people just publishing Jupyter Notebooks as articles.
09:25 It actually is good.
09:27 And there's some good diagrams in here.
09:29 Although the use of really large images, it's kind of annoying to me, but it's okay.
09:37 Anyway, this article talks about, it also talks about just basically what a function is and what things can act like functions.
09:47 And then talks about generators, iterators, and containers and how they all work together to make your code more expressive.
09:53 And it seems like kind of an esoteric sort of thing to think about, but I think it's good.
09:59 And this is a good, I like this article about thinking about this stuff.
10:02 Yeah, I really think this is one of those things that when you come to Python from other languages, it takes you a while to get into these, right?
10:10 Like the concept of a generator is quite different to somebody coming from C or I think Java, I don't think Java has this, things like that.
10:20 But once you get the Zen of them, they're amazing.
10:23 And they make such a big difference.
10:25 But it's really a nice mental shift.
10:29 Yeah, so I really think it's great that he's covering it.
10:31 Also one of those topics that I don't think I remember when I finally grokked what people were talking about, but it took like three or four different articles talking about it for it to sink in for me.
10:43 So here's another one.
10:44 Yeah, excellent.
10:44 Yeah, so we'll link to that one.
10:45 And by some strange coincidence, just so everybody knows, the way that we pick things for this show is I go find a couple of interesting things.
10:54 Brian goes and finds a couple of interesting things.
10:56 We throw them together and then we just talk about them.
10:59 And it happens to be the very next one is the same person in a Jupyter Notebook, but it's actually a totally different topic.
11:06 But it's presented in a similar way.
11:08 So this one is called It's Metaclasses All the Way Down.
11:12 This is a quick but fairly deep dive into what metaprogramming is.
11:18 And it starts out by saying, look, metaprogramming is a technique for writing computer programs that can treat themselves as data.
11:24 So you can introspect, generate, and or modify them while they're running.
11:28 And that's definitely Python.
11:29 You can, at runtime, create types.
11:32 You can add functions or even remove features from classes and objects, right?
11:39 But mostly metaprogramming comes down to metaclasses where you can actually control what the class does.
11:48 So there's all sorts of interesting examples of how you might use them.
11:51 It starts out with an interesting quote saying, look, metaclasses are deeper magic than 99% of the users should ever worry about.
12:00 If you wonder whether you need them, you don't.
12:02 So if you don't, like, that was a quote by Tim Peters.
12:06 If you don't deeply understand, like, yeah, this is totally, I'm going to solve this with a metaclass, like, you probably shouldn't mess with it.
12:11 But they are the power that makes a lot of the magic inside Python happen.
12:16 So a couple of examples.
12:18 One is Django RRM.
12:20 We could substitute SQLAlchemy ORM or PeeWee ORM models, right?
12:24 So when you define, like, a model in Django or SQLAlchemy, you put what looks like column types with database constraints as the fields of the class.
12:32 But then when you interact with them at runtime, they look like strings and integers and dates, right?
12:36 And that transition, that sort of two-faced behavior of these things that they can be used in a database query in one way but then act as normal objects as others, that's metaclasses.
12:45 Another example, there's a cool example about using them to make abstract classes and abstract functions that have to be implemented by things that inherit it and so on.
12:55 Again, this is a nice Jupyter notebook.
12:58 It has some cool pictures, but it really does help you understand this stuff pretty well with nice examples.
13:03 Yeah.
13:03 Yeah.
13:04 I like this, too.
13:05 It's funny that we both picked the same person's article.
13:07 It's a small world.
13:08 Yeah.
13:09 I think we're kind of going on the stuff that most people don't need train right now.
13:16 But next up is lambdas.
13:18 And speaking of another thing that I think newbies get a little confused by, but there are lambda functions.
13:27 Lambda functions are just unnamed functions that are, you can put the entire expression on one line.
13:36 And so they're convenient in a lot of places.
13:39 So here's an article from Dan Bader titled, Lambda functions in Python, what are they good for?
13:45 And what I like about this article is it's just a fairly quick jump into it.
13:50 I think everybody that thinks of themselves as a little bit more than just a beginner in Python should at least understand lambda functions.
13:58 And the thing I really like about this article is it talks about places to not use them.
14:03 Because when you finally get your head around lambdas, all of a sudden it's your hammer and everything looks like a nail.
14:10 He talks about a couple of places like creating class methods and other places where maybe you shouldn't use a lambda for that.
14:19 You can, but don't.
14:20 And then also there's some places where list comprehensions and dictionary comprehensions are wonderful.
14:27 And they're way more expressive, but you could use lambdas in those two cases.
14:32 Yeah, absolutely.
14:33 I think the trick with lambdas that make them so wonderful is like there's little places in your code.
14:38 You're like, if I could just pass a behavior here, I don't want to go out of where I am and write a function and then like give it a name and then say the name here.
14:46 I just want to go, yeah, sort by this reversed or, you know, grab this value out and transform it or something.
14:53 And lambda functions are beautiful for that.
14:57 They're not so beautiful for writing your whole program.
14:58 Yeah, actually sort is one of those, one of those places where one of the parameters is you can pass in a key or how to find the key.
15:07 And it's, I've used lambdas a lot there because I don't really need a named function.
15:13 I just need a function to pass in and do the work.
15:15 Yeah, I've got a list of customers.
15:16 I want to sort by last name.
15:17 So lambda, C goes to C dot, last name, done.
15:20 Beautiful.
15:21 Yeah, it's great.
15:22 So you wanted to do a quick follow-up on Jopronto.
15:25 That's our super, super fast web framework, right?
15:28 Theoretically.
15:28 Yep.
15:29 Yeah, that's what we were talking about that last week.
15:31 But I ran across a Reddit thread that we'll post in the show notes.
15:38 And it is titled, sticking with Flask versus switching to one of the new faster frameworks.
15:47 And the Jopronto author actually posted on this and said, yeah, if you're looking at Jopronto, please read the read me there because I've noted that it's not ready for production, basically.
16:03 So there's, he's not even, he's something about, I'll just read it.
16:08 He's, as stated in the read me, you should not build anything serious with Jopronto now because it's going to probably eat your laundry.
16:14 Remember, it's hand coded in C and this needs a lot of testing.
16:18 On top of that, I plan to do several iterations of API changes in a largely incompatible way.
16:24 I hope, though, it's going to make a serious player in the field one day.
16:28 I applaud the author for putting that out there.
16:30 He's basically developing in the open, but he's not promising to stick with the same model or the same API.
16:37 And I think that's completely fair.
16:39 And he's pretty open about it.
16:40 Yeah, that's really great of him.
16:42 And it's, you know, he seriously is optimizing this stuff in some really interesting but out there ways.
16:50 And so, and all that part is done in C, right?
16:53 And so, it probably needs a little bit of careful validation before it becomes a thing that you really, you know, depend on for your e-commerce site or whatever.
17:02 But it's definitely, it's got so much promise, that project.
17:04 So, I hope he gets there.
17:06 Oh, we're already done.
17:07 So, there we go.
17:08 That's it.
17:09 That's the news we got for this week.
17:11 Anything you want to share with everyone while we're wrapping things up, Brian?
17:14 No, nothing, nothing right now.
17:16 All right.
17:17 Awesome.
17:17 Well, how about you?
17:19 I'm cranking out some classes.
17:20 I just am pretty much finished up my Consuming Restful Services course.
17:25 So, stay tight.
17:26 You know, hang tight for that.
17:27 It's going to be out in maybe a week or so.
17:30 All right.
17:30 Cool.
17:31 Yeah.
17:31 Fun to be building new things for everyone.
17:34 All right.
17:34 Well, thank you everyone for listening.
17:36 Brian, thanks for chatting with me today.
17:38 Thank you.
17:39 Yep.
17:39 See you all later.
17:41 Thank you for listening to Python Bytes.
17:43 Follow the show on Twitter via at Python Bytes.
17:46 That's Python Bytes as in B-Y-T-E-S.
17:49 And get the full show notes at pythonbytes.fm.
17:52 If you have a news item you want featured, just visit pythonbytes.fm and send it our way.
17:56 We're always on the lookout for sharing something cool.
17:59 On behalf of myself and Brian Okken, this is Michael Kennedy.
18:02 Thank you for listening and sharing this podcast with your friends and colleagues.