Transcript #40: Packet Manipulation with Scapy
Return to episode page view on github00:00 Hello and welcome to Python Bytes, where we deliver Python news and headlines directly to your earbuds.
00:05 This is episode 40, recorded August 21st, 2017.
00:09 I'm Brian Okken. We have a guest host again, filling in for Michael while he's still on vacation.
00:15 This week we have Eric Chu. Did I pronounce that right? Is it Eric Chu?
00:20 Yeah, so I would usually pronounce Eric Cho, but it's all good.
00:23 Eric Cho. Yeah, okay. Well, Eric has agreed to do it with us.
00:27 He's the author of the book Mastering Python Networking and a self-proclaimed network automation nerd.
00:33 That sounds great. Yeah. Let's jump in with Eric's first topic.
00:37 Yeah, sure. Hey, Brian. So it's good to be here.
00:40 As you mentioned, I first recently finished my book.
00:42 And in the book, I introduced about a dozen or so Python tools and modules.
00:48 And I thought this would be a great chance to kind of introduce three of the top ones, my go-to modules that I mentioned in the book,
00:56 so that I could share with everybody if they're proved to be helpful for other people.
00:59 These topics also show up. We have links on the show notes, but you have information about these in your book as well?
01:05 Yeah, totally. So these are actually...
01:07 So for example, the first tool that I will mention a bit, Ansible, you know, I actually spent two chapters in the book on Ansible.
01:14 So first, just kind of introducing, and then the second would be like more advanced topics, and so on and so forth.
01:19 So these are the ones that I kind of live by. The book is a reflection of the experience I have.
01:25 Okay, great. So let's talk about Ansible.
01:28 Yeah, sure. So if you're, you know, I know we chatted a little bit about ops and so on, but, you know, if you're involved in operations of your company, you probably heard of Ansible.
01:39 So it's a general purpose automation framework.
01:42 And so not just for networking equipment, but it could automate both, you know, servers, cloud operations like Azure and AWS through what they call the modules, like, you know, in their quote unquote playbooks.
01:55 It's a good framework because I think especially for network engineering, because it has the most vendor support.
02:01 So in the network engineering world that it's always the tool kind of lives and dies by vendor support.
02:09 Ansible has the most network support because they're agentless and they could execute locally, calls the APIs on the remote network devices, and therefore allows a lot of flexibility, allows the vendors to shine and so on.
02:22 That's cool. And Ansible is a, it's written in Python? That's neat.
02:26 Yeah, it is written in Python. So that's another thing that's great. It's that you obviously start out as a user of the tool.
02:32 But if you know a little bit about Python, you could actually start contributing to the modules rather quickly.
02:39 So I'm a network engineer by nature, and I don't write a lot of code. But you know, I was able to write modules for it rather quickly.
02:47 But you can drive it, you can also drive it then obviously from if you've got Python scripts or Python modules to control your Ansible tool chain?
02:56 Yeah.
02:56 Or is it something you use outside of that?
03:00 The heart of the tool is actually revolve around playbooks. So they have this playbook, and that is in a YAML format. And in the YAML format, you can actually specify the modules that you want to use. For example, you want to do Cisco iOS for NTP configuration, for example.
03:16 So you use the iOS command module. And within your playbook, you only have to specify the parameters for your like NTP server IP address, for example, that allows you to quickly spin up and be a user of Ansible. But the iOS command module that is written in Python.
03:34 So, you know, if there's a module that you're, you want to use, but does not exist, you could just go ahead and write that module. So for example, for me, I work for A10 networks, and A10 networks didn't really have a lot of modules. So I went ahead and ported one of the modules into our latest API and so on.
03:52 Okay, cool.
03:53 Yeah.
03:53 Well, thanks.
03:54 Yeah, thanks.
03:54 Well, I'm going to jump into a article that I found. And I like the topics. The article is Python practices for efficient code, performance, memory and usability. And I got to like preface this that I've got a lot of opinions about some of this stuff. But I don't want to it's not meant as a disrespect to the author. And I'm going to probably get his name wrong. But it's I think it's Satwick Kensal. I wanted to say it because it's a cool name.
04:23 But it's got six different practices that he talks about. I'll just list them off. Try not to blow off memory, whether you use Python 2 or 3, write beautiful code, speed up performance, analyze your code and testing and continuous integration. I like that these are it's a pretty short article. But these are it's a really good overview of a lot of things that you should care about when you're getting serious about code. So the first off the the memory bit, I think it is important to talk to people about using generators.
04:52 When they're dealing with large data sets, because that's something that by default, my brain doesn't think generators right off the bat. So making sure that generators are used when you can for big things, big number crunching, use NumPy. And a little bit on using classes and strings.
05:10 Actually, I don't see a lot of things. Actually, I don't see a lot of things. Actually, I don't see a lot of people using the mentions don't use plus for formatting large strings. And I just don't see that that much.
05:18 And then his recommendation for classes, if you're going to use classes, use slots. And I say just use adders if you're going to use classes, because adders is awesome. Python 2 versus Python 3 kind of goes on this a lot. But I guess I just say use Python 3 unless you really have to.
05:36 Writing beautiful code. He has a lot of good reasons, I think, for why you should, because you're writing for other people as well as not just, not just yourself.
05:46 But I'm more solid for why I like, I used to be like, everybody should just learn how to read other people's code. But now, I think it's a maintenance cost thing. And maybe it's my foray into management.
05:57 But just picking a style, sticking with it, using static analysis, it's a cost thing. It's going to save you money.
06:04 The next couple topics I can't really talk about too much. Speed up your performance, multiprocessing versus multithreading.
06:12 I guess I got to admit that I don't really use either in most of my Python code so far. And then analyzing your code with cprofile and memory profiler and stuff. I haven't done enough of that to really comment.
06:25 And then the last bit, I have to testing and continuous integration. I really appreciate that he links to pythontesting.net for some of these tutorials. That's awesome.
06:35 But I think for testing, the short answer is use pytest. I didn't really tell you all about what his thoughts are, but it's a well written article. So go read it.
06:44 Yeah, I agree. I read it. And I totally agree with your point about Python 2 and Python 3. I think it's almost I've read studies on how much, you know, users were on Python 3 and Python 2.
06:56 It seems like last year we reached the inflection point where most users are starting to write their new code in Python 3 as opposed to Python 2. I don't know if you agree or not.
07:04 Yeah, I think so. And also, even a couple years ago, there was this, there was a problem with some really crucial projects and modules not and packages not being available for Python 3. But I think that's totally changed. I don't, I think it's just some of the little corner case things that may not even be being used on new projects anymore that are Python 2 only.
07:27 It seems like once Django moved over, it's like, the world's wide open now.
07:32 Yeah, and I think that has a lot to do with just people don't, a lot of new people didn't, they're just following the tutorial and doing the default.
07:39 I think the next hurdle will be making, trying to get all of the default distributions and even Mac to ship with Python 3 instead of Python 2. That'd be good.
07:50 Yeah, good luck with that.
07:52 Because I think, yeah, I think, what is it, Ubuntu 16.04, it shifts with 3.4. But default, you know, if you, I forgot, when you type in Python, does it come out with 3.4 or 2.7?
08:02 On my Mac, it was 2.7. But I've had this Mac for a while. So I'm not sure. So that's all I got on that one. I'm actually pretty interested in this next tool that you're going to talk about. So let's jump into that.
08:15 Yeah, so the second tool I want to mention is the SCAPI program. So it's a Python-based interactive packet manipulation and library. So you could use it both in an interactive mode where you could just kind of fuzzing and trying stuff out.
08:29 And once you're happy with it, you can actually write a program and just reuse it. And so the biggest difference between SCAPI and other tools, like, and I'm just going to be mainly security focused. So like, I say HPing 3, for example.
08:44 So the biggest difference between SCAPI and HPing 3 is that for SCAPI, you're crafting a packet from the ground up. So you're not really limited to the features and functions that was provided by the program itself.
08:55 So for example, you know, using SCAPI, I was able to write a mail form packet, IP version 3, for example, in the packet and see if it passes through my network, or if it crashes my infrastructure, my servers or anything like that.
09:10 So mainly very much security focused tool for me, but I could imagine other people have other use for it.
09:18 So on the tutorial for SCAPI, and it's in the show notes that it shows how to, you know, use ping of death where you have a large payload of the packet and that's a ping.
09:28 See if it crashes your host.
09:30 It's pretty much an outdated vulnerability, but who knows, right?
09:34 Like some new service comes out, we still have this vulnerability, you have your land attacks, and so on and so forth.
09:41 So there's a bunch of useful things that you could test with, or even just fuzzing or, you know, TCP scanning and stuff like that.
09:47 This is actually awesome.
09:49 I can't wait to play with this.
09:50 And in testing and stuff, like being able to muck with 8 or 211 frames.
09:55 Exactly.
09:55 And yeah, this is great.
09:59 Yes.
09:59 And it's been around for a while, you know, a lot of people, I think if you're running just say something quick and something that's already included in say HP 3, go ahead and just use HP 3, right?
10:10 Because it's quick.
10:11 You don't have to spend all these time crafting your packet.
10:13 But for the 20% of stuff that you really need to clamp down on what you want inside of that packet, either through the header or the payload, SCAPI is kind of your go-to tool for that.
10:25 And it's based in Python.
10:26 And use the Python shell for interactive to play with it.
10:29 So that's even better.
10:30 Oh, that's great.
10:31 Cool.
10:31 You definitely have to pass that along with some of the people I know.
10:34 Yeah, let me know how it goes, you know.
10:36 I wanted to talk about Headless Chrome, which I haven't actually played with, but I just heard about it because Miguel Grinberg wrote a post on using Headless Chrome with Selenium.
10:48 And with testing web applications, it's a common practice to use Selenium and Python.
10:54 But usually a lot of people are using Firefox as their default browser to interact with a website.
11:02 And this is a little, he starts off with basically an example that does use, I think he's using Firefox, but if not, it's using a normal.
11:13 Yeah, starts off the Firefox example and then shows how to change it.
11:18 He's using unit test.
11:19 I got to get on him about using pytest more.
11:21 But using a Headless Chrome.
11:24 And the nice thing about that is you don't get the, with a Headless browser, you don't have the dialogue window or the window pop up all the time when you're running tests.
11:33 So the magic trick, I guess, is that there's an option to Selenium with the web driver.
11:40 You pass it a Headless option.
11:42 But like I said, all this, we got a link to Miguel's post in the show notes and he just shows you how to do it.
11:49 Yeah.
11:49 I'm a big fan of Miguel's work.
11:52 You know, obviously he did the mega tutorial on Flask and I learned a lot from that and also backed his Kickstarter campaign that's going to come out next year when he's revising his mega tutorial.
12:04 So I would definitely recommend anything that, anything he published, I would just recommend go read it.
12:09 Yeah.
12:10 He's a super nice guy too.
12:12 He was the only person in Portland that I knew that was going to Europython.
12:16 He took a bunch of rocket stickers for the Pi Test book.
12:21 He just like carried them over there with him so I could have him handed out there.
12:25 So that was cool.
12:26 Yeah.
12:27 I saw him on the last year's Pi Con in Portland, but I was just too shy to go up and say hi.
12:32 Right.
12:32 So, you know, maybe next time I would just say, hey, Miguel, like your work.
12:36 You should.
12:37 He's a super nice guy.
12:38 He's pretty underspoken.
12:40 He won't go out and tell you how awesome he is right off the bat.
12:44 We all know that's funny.
12:46 That's right.
12:46 So again, I'm excited looking at this list.
12:49 It's kind of fun getting new people in to talk about things.
12:52 I love graph visualization tools.
12:54 So tell me about GraphViz.
12:56 Yeah.
12:57 So, you know, as a network engineer, it seems like we spent a large chunk of our time and rightfully so.
13:03 It's part of our job to document the network.
13:06 And you spend a lot of time drawing the graph and using, you know, whether your choice of tool is a Visio or it's Omnigraffle.
13:13 You spend a lot of time diagramming your network.
13:16 But by the time you're done, it's almost the case that your graph is outdated.
13:21 Right.
13:21 Like the network actually just grows.
13:23 You either add a new VLAN, you add a new box or just whatever it is that it's almost because you take so long and it's not automated.
13:32 You're actually manually graphing these network diagrams.
13:35 So this is especially a big problem with the data center being bigger and bigger.
13:40 So now you have what they call the factory network where the middle is huge, where you have server-to-server communication as opposed to server-to-client communication.
13:50 So this huge network presents a big problem because you're having hundreds of links between each of the network layers and doing it manually is just not an option anymore.
14:01 So I think a couple of years ago, I went out and started thinking about trying to come up with a solution.
14:06 And GraphVis is what I saw where you could actually basically describe your network in terms of a text file.
14:13 So I would say in terms of nodes and edges and say these two nodes should be placed together at the very top tier.
14:20 And these four nodes would be in the middle tier and they should line up and so on and so forth.
14:26 So GraphVis is a great tool for doing that.
14:28 And there's a Python module that's also named GraphVis that allows you to come up with this what they call like a .file where you use the Python module, you generate this .file, you run it through GraphVis and voila, you have this beautiful graph that's up to your liking.
14:45 So the only thing that I guess it's me not being didn't learn it enough that, you know, like coloring the graph, like say if I wanted to color all the spine boxes or if I want to color all the leaf devices in a certain color, I still haven't learned how to do that.
15:01 So sometimes when I do need that colorization, I kind of launch OmniGraphle Pro that you could import that graph and then color it to my liking.
15:10 But it's an awesome tool.
15:12 It allows you to automatically generate your graph.
15:16 So it allows you to have this less of a drift between your actual network versus your diagram.
15:22 Okay.
15:22 Nice.
15:24 Actually, I didn't know about this, the Python module for it or the Python package to use it because GraphVis has been around for a long time.
15:35 That's right.
15:35 But working with the .files, even though the format's pretty easy, it still is something else to learn.
15:42 And yeah, I'll have to check out the Python package because that's pretty cool.
15:46 I like that.
15:47 Yeah.
15:48 So I think just the biggest, I mean, if you want a quick graph, you could use PyGal or Matplotlib, even.
15:55 And you just, you know, quickly draw that graph.
15:57 But GraphVis, to me, is the most powerful.
15:59 And I know, you know, a lot of companies use that for drawing their kind of eye candy kind of graph.
16:06 Like, you know, you've seen the Facebook graph where you just have lines all over the globe and so on.
16:10 I mean, that's the kind of graph that GraphVis is capable of doing really, really quickly.
16:14 Well, yeah.
16:15 I don't know if those are helpful, but yeah.
16:17 Those are helpful for marketing materials.
16:20 Yeah.
16:20 Some of the automated code processors use it for drawing, like, class hierarchies and stuff.
16:26 Right, right.
16:27 I mean, and in a shameless plug for my book, I also use the GraphVis tool.
16:31 I use an example of where you're inside of a...
16:35 So use your Python script to go query your network devices and say, basically to say, tell me who your neighbors are.
16:41 And you just kind of walk layer by layer by layer.
16:44 And you construct a GraphVis file that represents your network in real time.
16:49 So every time you could just execute that script and it will give you your most up-to-date network graph automatically.
16:56 So I think that's kind of a good use case for the tool where, like I said, like the biggest problem that I face with network graphs is there's always this drift between the actual network topology versus the graph that you present.
17:08 And if the graph is wrong, nobody's going to use it.
17:10 Yeah.
17:10 Oh, hey, I just remembered a cool example of the .files.
17:15 One of the projects I work on, one of the things that we log is basically a running state machine example.
17:23 An internal state machine that is basically a state machine as it's running and where we are currently.
17:32 That's being output as a log file in the .format so that if whenever we get like a crash or something, we can take that log file and just throw it through GraphVis and see exactly where we're at visually.
17:48 So even a manager could understand it.
17:52 Yeah, like me.
17:53 Yeah.
17:53 So cool.
17:54 Yeah.
17:55 Last topic I'd like to bring up is PyCascades.
17:58 So this is a new thing this year.
18:01 I think it's, I'm going to get it wrong, but I think it's a whole bunch of different Python groups in the area coming together to try to do a regional Northwest Python conference.
18:12 It is in January.
18:13 It's January 22nd, 23rd.
18:15 And then they're going to do sprints the 24th.
18:18 But I like the format.
18:20 I like what they're doing.
18:21 And the tickets are now very expensive.
18:22 I think they're like 100 bucks or something.
18:25 Okay.
18:25 They're not bad.
18:26 And it's in Vancouver, BC, which I, even though I live in Portland, I've never been there.
18:31 So I'm looking forward to go and seeing that.
18:33 And the call for proposals for talks is the 28th.
18:37 So that's like Monday or something like that.
18:39 So when this comes out, people only have a couple days, but I think there's still slots open.
18:44 So please check it out.
18:46 And if you want to do a talk, local talk, I think this is good.
18:48 The talk slots are 25 minutes too, which is a nice length.
18:51 I think that anybody that has a, if you've got expertise on really anything in your job, 25 minutes shouldn't be too hard to fill.
18:59 So I'm going to submit a couple talks and encourage anybody else to as well.
19:04 Yeah.
19:04 So Vancouver is about three hours away from me as well.
19:07 So I have to check it out.
19:08 I mean, I've never been to the Py Cascades, but it seems like a good format.
19:13 And like you said, tickets are cheap.
19:15 Yeah.
19:15 Well, yeah, it's the first one this year.
19:17 So.
19:18 Oh, okay.
19:18 Got it.
19:19 Yeah.
19:19 Yeah.
19:19 That's our six topics that went by really fast.
19:22 It did.
19:23 It did.
19:23 So tell me a little bit about what's up with you.
19:26 How long has Mastering Python Networking been out?
19:29 So it's been out since June 28th, but only the digital version was out.
19:34 So I think you could start ordering the physical book like mid-July.
19:37 So it's been out for about a month and a half.
19:39 I have the link in the show notes.
19:41 Go check it out if you could.
19:43 I really appreciate any kind of feedbacks or just any criticism, good, bad, ugly, whatever
19:49 it is.
19:49 You know, I'm open to it.
19:51 So I appreciate that.
19:53 Nice.
19:53 Yeah.
19:53 So what's happening with you, Brian?
19:54 Well, this last week was the fixing up the copy.
19:58 I got the copy editing changes for the Pyth test book back.
20:02 Oh, that's right.
20:03 And I just got those in yesterday.
20:05 That's going to be out.
20:07 I'm so excited to get the physical copy out.
20:10 So the physical form.
20:11 Yay!
20:11 Yeah.
20:11 The physical form will be like in October.
20:13 But I'm like, as of like, there's like one minor tweak I got to fix tonight.
20:19 Other than that, my hands are completely off.
20:21 Hopefully.
20:22 I keep saying this every week.
20:23 But once it goes in past copy editing and through the rest of it, it's really an author
20:29 doesn't get to interfere process until it comes out as a hard copy.
20:33 So from start of this project to actually this point, how long did it take for you?
20:37 That's a long answer.
20:39 But my interaction with Pragmatic started, I think, June of last year.
20:45 Right.
20:45 June of 2016.
20:46 Okay.
20:46 So a little over a year.
20:48 But I think the first four or five chapters, I think I rewrote those like four times.
20:53 Yeah.
20:54 To try to get it right.
20:55 Yeah.
20:56 Initially, I was kind of coming along the same lines as the tutorials I have on the
21:01 pythontesting.net.
21:03 But getting a lot of trying to understand how to write in a book format instead of a blog
21:08 format, it was a learning experience for me.
21:11 But still, even after doing all that, I think that, I don't think, looking back, a year isn't
21:17 that terrible.
21:18 But there's one of the other authors, Python authors, is Luciano Romalo.
21:22 I wish I knew which book he wrote.
21:25 It's like, I'm going to get it wrong.
21:26 He was one of the reviewers and his feedback was really influential, but added like two
21:31 or three weeks of my work.
21:33 Yeah.
21:34 For me, I think the biggest issue was always the, not on myself, but on the people around
21:39 me.
21:39 It's almost like, you know, I have to be concentrated and focus on the book.
21:44 And then, so the family has to, you know, kind of go away.
21:47 My wife has taken the burden of taking the kids away from me for the time duration I was
21:51 writing the book.
21:52 So, yeah.
21:53 So, it's definitely both a journey experience, but I'm very excited for you.
21:57 You know, I think when you see that paper copy, you're just going to be like, yay, you know,
22:00 makes it real.
22:02 Yeah.
22:03 It definitely is going to be cool.
22:05 The, like you said, trying to find time within your life and trying to get distractions out
22:10 because like you, I've got a full-time job and kids and family.
22:14 And so, it was quite a few late nights at the coffee shop trying to get away from everything,
22:18 but it'll be out.
22:20 So, that's all we got for today.
22:22 And thanks a ton for helping out, Eric.
22:25 And we'll keep in touch.
22:27 Yeah, absolutely.
22:28 Thanks, Brian.
22:28 This has been fun.
22:30 Thank you for listening to Python Bytes.
22:32 Follow the show on Twitter via at Python Bytes.
22:35 That's Python Bytes as in B-Y-T-E-S.
22:39 Get the full show notes, including links at pythonbytes.fm.
22:43 If you have a news story you'd like featured, visit pythonbytes.fm and send it our way.
22:48 We're always on the lookout for sharing something cool.
22:51 This is Brian Okken.
22:52 On behalf of myself and Michael Kennedy, thank you for listening and sharing this podcast with
22:57 your friends and colleagues.