Brought to you by Michael and Brian - take a Talk Python course or get Brian's pytest book


Transcript #1: Intro to the show and pip 9 is out!

Return to episode page view on github
Recorded on Monday, Nov 7, 2016.

00:00 This is Python Bytes, episode number one, recorded Monday, November 7th, 2016.

00:05 Hello, everybody, and welcome to Python Bytes, where we deliver Python headlines directly to your earbuds.

00:13 This is one of your hosts, Michael Kennedy, along here with Brian Okken.

00:17 Hey, Brian, how's it going?

00:18 It's going great.

00:20 I'm super excited to be doing this podcast with you.

00:23 I'm really thrilled to be bringing these little tidbits to everyone each week or biweekly,

00:29 whatever we would decide to go with.

00:30 Yeah, I'm really excited, too.

00:32 I think this is definitely something that's missing out in the community right now.

00:36 Yeah, the goal is you subscribe to Python Bytes.

00:39 And if you listen every week, you'll basically get a lot of the top headlines that you need to know about.

00:46 So let's jump right into the headlines.

00:47 The first one that I want to talk about is PyData, specifically PyData from Washington, D.C.

00:53 The videos for this conference are out.

00:56 So if you didn't make it to PyData, D.C.

00:59 And you want to check them out, there are 64 videos, which is about 30, 35 hours worth of content on YouTube right now.

01:07 And so a couple of the videos that I thought were really great, you know, looking through them, where one of them was Elasticsearch and Redis, how and when to use them.

01:16 And I don't really know too much about Elasticsearch.

01:18 So I'm really excited to watch this video and learn more about it.

01:21 Another one is the one by Renee Tate.

01:23 She hosts a podcast and a website called Becoming a Data Scientist.

01:27 And she did a talk on advice from her podcast guests about becoming a data scientist.

01:33 So I think that's really cool.

01:34 One that was also cool was visual diagnostics for more information about machine learning and basically tuning your machine learning.

01:42 And then one about data reproducibility and scientific computing called You Got Your Engineering and My Data Science Addressing the Reproducibility Crisis.

01:51 Yeah, I was really impressed with these videos.

01:53 And there's a couple of things that I really liked about them.

01:56 I like the format that shows you can see both the slides and the presenter on a little kind of a picture in picture sort of thing of the presenter talking as well.

02:05 I think that was done well.

02:06 I also really like the, man, these are like 35 minute or half an hour presentations and they're just packed with data.

02:14 Yeah, it's not like people are trying to fill an hour, hour and a half.

02:18 They're like, here's the essence, let's go.

02:19 And it's really great.

02:21 And, you know, I definitely agree with you on the quality.

02:23 I'm really, really impressed with all the Python conferences about how high quality their AV work is and how good of an effort they make to basically record and share, not just with the attendees, but with everybody, whatever went on there.

02:39 Yeah, it's pretty impressive.

02:40 All right, what do you got for us, Brian?

02:41 Oh, all right.

02:42 One of the things that came up is another tutorial about EasyArgs.

02:48 It's a command line, command line to library for making arguments better.

02:54 I actually didn't take too much of a look at it.

02:56 I just think it's one of the things I wanted to point out was it's kind of interesting that we have yet another way to make command line arguments easier to parse.

03:07 I've lost track now of how many extra libraries there are.

03:11 Yeah, let's see.

03:12 Yeah.

03:12 Let's see.

03:12 Arg pars, click, doc opt, and a bunch more, right?

03:16 Yeah.

03:18 It seemed, I was just thinking that it seems like a rite of passage now that, so if you're serious about being a Python developer and open source and blogging about it, you have to do two things.

03:28 You have to write a tutorial about how generators work, and you have to make your own command line library.

03:33 That's awesome.

03:34 I think you might be right there.

03:36 I haven't done either of those, but I actually do like, I'm making fun of it a little bit, but I'm glad that people are doing it.

03:45 If everybody's trying to solve the problem, it means there's a lot of people not happy with the current solutions, and I'm one of those people that likes to write command line tools, so I say more power to them to write another.

03:58 Yeah, absolutely.

03:59 More power to them.

04:00 That's awesome.

04:00 So the next one that I wanted to talk about is something called safety-db, which is on GitHub, and of course, we'll put all the links to everything we're talking about in the show notes.

04:09 You can check them out in your podcast player.

04:11 But safety-db is interesting because when we write and deploy our applications, especially web applications because they have the biggest surface area and they touch the most people, right?

04:21 They're publicly out on a server.

04:22 We don't just deploy our code.

04:24 We deploy all the packages that we're using in our code and then the packages that those packages depend upon and so on.

04:31 So, for example, on my training website, I've got a mailing list, and I use MailChimp for that.

04:39 MailChimp itself depends upon DocOpt, speaking of command line parsers and so on.

04:43 How do I keep track of all these things and know if there's a security vulnerability?

04:49 Like, what if there's a problem in DocOpt that would let people hack into my server?

04:53 I didn't install DocOpt.

04:54 Maybe it's not even on my mind.

04:55 I don't even realize that it got installed.

04:58 This GitHub set of data keeps that information current so I can check, hey, does my project have any vulnerabilities that I may or may not even be aware of through this sort of hierarchy of dependencies I'm using?

05:11 Wow.

05:12 I didn't realize it was that in full.

05:16 Yeah, so I'm just starting to get into more of the web development aspect of it.

05:20 But I do, I mean, just even with what I do now, I depend on MailChimp and other services.

05:27 I didn't even think about that, of looking at their vulnerabilities.

05:30 Yeah, isn't that crazy?

05:32 There was something with one of the libraries that Flask is built upon a while ago that caused some big hoopla.

05:40 And I'm sure it got patched right away and it was no big deal.

05:43 But, you know, if you don't know that you need to go patch your system, that's a problem.

05:48 So, SafetyDB.

05:50 Yeah, great.

05:51 Cool.

05:51 I'll check it out.

05:52 Check it out.

05:52 All right.

05:53 So, next up, we've got a GitHub project that came to my attention this week called Fast Style Transfer.

06:00 And it's an interesting library that uses TensorFlow, which I've never used TensorFlow.

06:07 Have you used TensorFlow, Mike?

06:08 No, I'm just starting to work on a project, a data science project that is probably using TensorFlow.

06:17 But I've only heard good things.

06:19 I have not yet used it.

06:20 Okay.

06:20 Well, this one is something that makes me want to give it a try because the demo on it looks really cool.

06:27 You take a painting or a picture of some artist's style and you can apply that style to another picture, like your own picture.

06:36 And I've seen something like that before.

06:38 But what I'm really excited about is this will apply it to videos as well.

06:42 And I've always, I guess, ever since the AHA's Take On Me video, I've liked that notion of like an artistic style over a short video.

06:53 It just seems like fun.

06:54 That sounds really fun.

06:55 So, I could like take my training videos and turn them into like a Van Gogh type of thing?

07:01 Yeah.

07:02 Yeah, that'd be great.

07:03 That'd be awesome.

07:04 I don't know if people could read your slides.

07:05 But they would feel very creative while they're watching it.

07:09 No, I can't think of actually any commercial reason why I would try this.

07:15 But the artist in me wants to give it a shot.

07:17 No, it sounds really cool.

07:18 And a chance to play a TensorFlow.

07:19 Also cool.

07:20 Yeah.

07:20 So, another big piece of news that came out this week is PIP.

07:25 So, pip has a full new major version.

07:28 pip 9 is out.

07:29 And it comes with a couple of new features.

07:32 One is the ability to check the installed package dependencies to see if everything is set up correctly there.

07:37 You can use pip show in a less verbose way.

07:41 You can also say pip list and give it a not required option, which will show you all the packages that aren't there because they're a dependency on a thing.

07:51 They're sort of top level, which is really cool.

07:53 There's a ton of fixes that came as part of this release.

07:56 And as you might expect with any major release of some new thing, four days later, pip 901 was released with five fixes for bugs that were introduced in pip 9.

08:06 Yeah.

08:07 And actually, the list of what's in pip 9 is big.

08:12 It is big.

08:12 And one of the things that is cool is there's features in there that I didn't even know existed.

08:18 So, like, the pip check is, like, that's really cool.

08:22 I'm excited about that.

08:23 But also, they're changing the list format, or they're going to.

08:28 So, if you do pip list now on 9, it'll give you a warning that in the future, the default format will be a column format.

08:35 And the column format, you can add now with a --format equals column.

08:40 And it's really easy to read.

08:41 So, I'm excited about that.

08:43 Yeah.

08:43 I'm really excited that this is here as well.

08:45 Packaging is such a foundational, important thing in the whole ecosystem.

08:49 And so, anything that makes that better, I'm for it.

08:53 Yeah.

08:53 And I use, like most people, I'm using virtual environments all over the place.

08:59 And keeping track of what's in every one of them is pretty cool.

09:03 Yeah, it definitely is.

09:05 Okay.

09:05 So, I'm going to jump to one, the next topic that you brought up, which is a Reddit.

09:12 Actually, it's a sort of a blog post and a Jupyter project about the new MacBook Pro and what Reddit users think of the MacBook Pro.

09:24 And I, at first, you brought this up and I'm like, this is ridiculous.

09:28 I know that everybody's upset, like half the people are upset and half the people are excited.

09:33 But most of the people I talk to are developers and they're kind of ticked about all the changes of the MacBook Pro.

09:38 But this Reddit article or this article about incorporating Reddit, it's not about the specifics of that.

09:48 It's using sentiment.

09:49 It's talking about sentiment analysis to analyze all the words used in these Reddit threads and try to determine from the words if people are excited or upset about it.

10:03 And I had never even heard of sentiment analysis.

10:06 So, I think it's neat to show a data analysis project that is topical on something that people are talking about right now.

10:14 Yeah, because we all have a thing that we care about.

10:17 Either we work for a company that makes a product or has some sort of public persona or we have an open source project.

10:23 And it would be nice to know, are people liking my work?

10:26 And this ability to apply sentiment analysis to it, it sounds really cool.

10:30 And I think the MacBook Pro is a great thing to just grab because it's timely.

10:33 Yeah, and I think it'd be kind of neat to – it was interesting to apply it to that and probably safer than applying it to something like current politics because, you know, you don't want to get in the middle of that right now.

10:47 No, no, no, no.

10:48 It's T minus 28 hours or something.

10:52 Nobody wants to get in the middle of that.

10:53 Yeah, cool.

10:54 So, I really thought that was neat as well.

10:56 Personally, I have a MacBook Pro I'm holding out.

10:58 I'm not buying this new one for a while.

11:00 I'm going to see where this whole thing shakes out.

11:02 Yeah, I guess I didn't want to get too much into it, but I was curious about that, if you were going to get a new one or not.

11:07 No, I was ready to, you know, go customize it, turn all the knobs to 11 and say, ship it now.

11:14 But it's just, you know, I think there's too many things that are kind of weird and there's not enough of a benefit, so I'm just going to stick with what I got.

11:21 Yeah, I guess.

11:22 And maybe I'm just not doing anything complicated because I'm on a couple-year-old MacBook Air.

11:28 I just don't need very much power, I guess.

11:31 Yeah, that's good news.

11:33 Okay.

11:34 All right, well, that rounds out our news items for the week.

11:36 But, you know, what else?

11:37 What do you got going on?

11:38 I know you've got the Test of Code podcast going.

11:41 You're working on a book.

11:43 You just did some presentations at a conference.

11:46 Why don't you tell people, like, what's the news around your life?

11:49 Yeah, well, I'm very grateful for November to roll around because there's – I know that – I know a lot of people in startups and I guess I've heard that people are working in San Francisco and stuff, work like crazy hours.

12:02 But I'm not used to that.

12:03 I'm an older engineer and I like my 40-hour work weeks.

12:08 But October has been insane.

12:11 I've been doing our projects at work, which I can't talk about and most of the people listening wouldn't care about.

12:16 But we've been doing, like, 60-hour weeks lately.

12:19 And on top of that, I was doing the Pacific Northwest Software Quality Conference.

12:25 And I got to talk about – it was a couple weeks ago.

12:28 I got to talk at that conference.

12:30 And that was a 90-minute talk.

12:32 And that's actually when I was looking at the PyData videos.

12:35 I was impressed with all the information people got out in 30 minutes because it's, like, three times the amount of content that I got out in, like, 90 minutes.

12:44 So – but it was a good experience to talk.

12:49 I'm actually kind of glad that my first conference experience was a long one because I sure got my jitters out of the way.

12:57 And I'm excited to do a future one.

13:00 And the book's coming along good.

13:03 I'm working with an editor.

13:05 And this upcoming book is focused on pytest alone.

13:11 Yeah, that's excellent.

13:12 What's the title so people know what to look for?

13:14 We don't have a title yet.

13:16 The book about pytest.

13:19 Yeah, the book about pytest.

13:20 But I will definitely share more information with this podcast as it becomes available.

13:27 Yeah, yeah, beautiful.

13:28 How about you?

13:29 What's going on with you?

13:30 Well, I've been on an absolute terror of recording Talk Python in me episodes.

13:35 I recorded six weeks' worth last week.

13:38 So that's – given that I spend about 10 hours of research and prep time on each, that meant a very long week.

13:44 But I'm really, really happy with what I got out.

13:47 The next three that I have coming, you know, we talked about PIP.

13:50 And there's actually a bit of a crisis in the Python ecosystem's core open source infrastructure.

13:56 And so I'm doing a panel session, including the guy who works on PyPI and PIP, Donald Suft.

14:02 And that's called Are We Failing to Fund Python's Core Infrastructure?

14:06 So that comes out this week and also did something which I love the title,

14:10 parsing horrible things with Python with Eric Rose.

14:12 And then Martin Peters, who is either the top or one of the top guys at Stack Overflow as a user,

14:20 a contributor, an answerer, has over half a million reputation in Python.

14:25 And I talked to him about a bunch of the things going on with questions and answers

14:30 and hard problems that they found on Stack Overflow.

14:33 And that's just super revealing.

14:35 I've learned a ton by doing that.

14:36 Wow.

14:37 I can't wait to talk about these.

14:38 Now, is Donald still on PIP?

14:40 Donald is still on PIP.

14:43 But the short, short, short version is he was working at HP.

14:47 HP went through a huge bunch of layoffs, cut his whole division.

14:51 And by way of doing that, basically defunded PyPI.

14:55 Wow.

14:56 So there's other companies like Rackspace and a couple of others.

15:01 I'm sorry, I'm forgetting the names that contribute tons of resources in terms of computing and server and bandwidth.

15:08 Like the bandwidth and infrastructure charges for PyPI are $40,000 a month.

15:13 Wow.

15:13 But there's no funding for people to keep that infrastructure running.

15:17 So anyway, that's what that show is about.

15:19 I can't wait for that.

15:20 Yeah, yeah.

15:21 Check it out if you guys are interested.

15:22 All right.

15:22 Oh, yeah.

15:23 I forgot to mention, I do have, I did talk with Rafael Pierzina.

15:27 I'm probably getting that wrong.

15:29 But that's a podcast coming up for testing code focused on pytest and Cookie Cutter.

15:34 So that's coming up soon.

15:36 Absolutely.

15:37 All right.

15:38 Well, thank you, everybody, for listening to this very first episode of Python Bytes.

15:43 Brian, thank you for launching a new podcast with me.

15:46 I think it's going to be a lot of fun.

15:47 I hope people enjoy it.

15:48 I think it'll be fun, too.

15:49 Thank you, Michael.

15:50 Yeah, absolutely.

15:51 So if you are out there listening and you're like, oh, I have a great news item I want to send you guys,

15:56 just visit pythonbytes.fm.

15:58 There's a way right in the menu bar to click and actually send us your news.

16:02 So if you find something that's cool and you want us to cover it in the next episode,

16:06 be sure to send it our way so that it's on our radar.

16:08 All right.

16:09 Thank you, everyone.

16:10 Talk to you later.

16:11 Bye, Brian.

16:11 Bye.

16:12 Thank you for listening to Python Bytes.

16:14 Follow the show on Twitter via at Python Bytes.

16:17 That's Python Bytes as in B-Y-T-E-S.

16:20 And get the full show notes at pythonbytes.fm.

16:23 If you have a news item you want featured, just visit pythonbytes.fm and send it our way.

16:28 We're always on the lookout for sharing something cool.

16:30 On behalf of myself and Brian Okken, this is Michael Kennedy.

16:34 Thank you for listening and sharing this podcast with your friends and colleagues.

Back to show page