Academic New Years Resolutions

What are your academic weaknesses? What would you like to improve? And in 2012, how will you resolve (see what I did there) to improve them?

I suspect that many of my Academic New Years Resolutions are the same as everyone else’s: write more papers, get grants, teach better, engage with the publics better. To this, other academics might also add: do the work/life balance thing better, go for promotion, and, if many of them are honest, get a big grant and farm out all their teaching to graduate students and RAs – but these aren’t concerns for me at the moment. If we get into more detail, we start to see different sorts of academics at different career stages have quite diverse short-term goals; for some, it might be publishing their first paper (for PhD students); for others, time management or getting more students.

It can be quite difficult to talk about weaknesses in the competitive world of academia, especially if we view those weaknesses as being core to our work (and, let’s be honest, academics have a diverse and difficult to master range of skills which are held to be core to our work). However, I thought I would share the areas where I really want to get better in 2012 -  I’m interested in hearing from others what they think their weaknesses as an academic are, and how they go about improving…

As an academic, I think I have a fairly acute sense of what my strengths and weaknesses are. I’ve had a fair bit of teaching and public engagement experience; on the minus side I’ve led a fairly peripatetic academic existence, and so my publication record is not the jewel in my crown (especially in social sciences) and neither is my substantive grounding. This is sort of the opposite position that most new lecturers find themselves in – typically they will have a very strong research record but perhaps will have had fewer teaching and PE opportunities.

1: Read more and better
I’m still reading around my new subject (only 18 months in). Finding time to do it can be hard, but committing time to regular reading during the working week is really important. I do read (academic!) papers on the commute sometimes, but I’m not someone who will get home and start reading a treatise on subgraph centrality over their steak dinner. Contextualising knowledge, retaining it through note-taking – these all happen differently in social physics compared to medical physicis or a quantum physicis, and I am still learning how to do that in this new field.

Summary: Protect reading time and learn new study habits for organising knowledge systematically

2: Write more
I can be a bit of a perfectionist when it comes to writing, and I am completely aware that this comes from knowing how savaged things get at the review process. As a musician and writer I taught myself early on that work I share with the world will meet criticism, hatred and indifference as well as interest and praise, and taught myself not to care. That’s not a reasonable outlook for academic work, as people’s criticisms have impact on (e.g.) whether the work is published and are often (but not always) useful for improving the work. I personally think that the writing process will become easier as I have more confidence in what I’m presenting, and view criticism as “suggestions for improvement” rather than “an indictment of my poor scholarship”. All of this might seem terribly thin-skinned of me, but being an itinerant academic (I’ve changed fields twice since my PhD) means that there are plenty of times when I don’t know what I’m talking about.

Summary: Learn to be capable and confident in my scholarship and so to respond positively to criticism

3: Get grants
This seems pretty important. I have only a Co-I on a small grant to my name.

Summary: Start applying for grants (duh)

4: Improve teaching
I think I’m a decentish lecturer, so now is the time to build on what I view as a reasonably solid foundation and try to make my teaching better. As hinted above, I’m not someone who especially wants to get some jackpot grant and give all my teaching to a research associate – while I think that it’s useful and important for grad students and RAs to do some teaching, I want to teach and I want to teach well. And a good course will attract more students, so there are cynical as well as idealistic reasons for this, too.

How will I teach better? With the small group we have, class-led activities have worked really well, and I want to continue those and expand them into formative assessment exercises – giving students feedback about their progress and encouraging them to assess themselves and collaborate.

Summary: Improve course content and use group-led assessment

There are lots of ways I want to improve as an academic, but I suspect these will be the ones I focus on most over the next 12 months. If there are other academics and researchers out there who want to share their improvement plans and resolutions for 2012, please leave your comments below the line…. I would suggest the twitter hashtag #acNYR12 but it’s long and incomprehensible.

The loneliness of the long-distance cyclist

One of the big concerns in the use open and public data lies around privacy – whether the information you provide and is collected about could be used to identify you personally. While this might be an issue with respect to governmental or commercial entities, where I work we are very rarely interested! It’s the patterns that arise from groups of people that are interesting, and knowing that the datum I’m observing is Oliver O’Brien and he lives in Chadwick Road, Peckham* does little to add to my analysis. Now, knowing that that a data point lives in SE15, has above median salary and reads the telegraph* might be useful for some sort of analysis – but at no point do I need his actual name, and while useful, his address is not necessary. With all this data from overlapping, geographically-coded data, it’s been argued that it’s relatively easy to identify individuals, especially those in a minority (whether ethnic, fiscal, or other). While this isn’t meant to dismiss people’s concerns, particularly wrt to governmental and political organisations and businesses, I thought it worth stating the counter-example. To wit: at CASA, knowing someone’s name and address is useless – but we are interested in information about groups of people’s income, lifestyle etc.

As an example, this is a visualisation of the journey of one London Bikeshare bike on one day last year. As noted previously, we don’t have GPS data (and as far as I know, it doesn’t exist) so the routes we assign are reasonable guesses** – only the start and end points and timings are known. Secondly, we don’t know who was using the bike – that’s also hidden to us. And seeing the “path” of one bike is (I hope you’ll agree) rather interesting, but doesn’t tell us much about the system as a whole, which is what we actually care about.

And because it’s Christmas, this is what Xmas *last* year looked like for the bike scheme:

Some very slow cyclists there, making their way home after too much turkey and Christmas cheer. Merry Xmas, readers!

*none of this is supposed to reflect the actual @oobr. He’s much too cool to live in Peckham, for a start

** by Ollie O’Brien, Open Street Map and Routino

Clouds across the moon

This movie shows a heatmap of London Bikeshare activity over the course of an average day – red indicates the density of arrivals, cyan the density of departures – and so white areas are where arrival and departures match. Animation by Martin Zaltz Austwick (@sociablePhysics) with help from Oliver O’Brien (@oobr) of UCL-CASA.

This animation scales the intensity of colour to the all-time maximum – which is why the brightest colours occur at rush hour(s). Those two big dots are King’s Cross and Waterloo. This visualisation us better for comparing activity at different timeperiods, but is pretty useless for examining spatial patterns at the quieter times.

This animation scales the intensity of colour to the most intense activity at each time point. This leads to the strange paradox of the animation getting brighter as a whole outside rush hour. This is because many areas are similarly busy and no one area stands out – so many areas appear bright. This visualisation is more useful for understanding geographical patterns at each time point and is useless for comparing total activity at different timeperiods.

So how was this produced? From a network map, surprisingly. I looked at the Transport for London data of bike journeys (covering November 2010-May 2011) and, based on an average of all the data falling on weekdays, constructed a network which told me, minute by minute, how many bikes were on each route. By “route” I mean “edge” as in “it’s 10.33 – how many bikes are travelling between London Bridge and Gower Place”. Then I summed those up – so “At 10.33, how many bikes in total are on journeys that started from London Bridge” and “at 10.33, how many bikes are travelling towards Gower Place”. Network Theorists – this is broadly like in- and out-degree.*

Bear in mind that this is not the same as the number of bikes leaving (arriving) at that time point – it is the number of bikes on the road at that time point that originated (will end up) at that source (destination). The former analysis is easier to do, in fact, but my code was set up for the latter.

That yields a set of points with data about bikes which have left it, and bikes which will arrive at it. The colour scheme could easily be applied to point data, so let’s. Data is scaled to some maximum (the maximum in or out value (whichever’s bigger) either for all time or at the current time, depending on the vis). The colours are overlaid and chosen to be complementary (in this case, red and cyan) – so if the in and out activity is equal, we get White (bright white for strong in, strong out, dimmer grey for weak but equal in and out).

That’s the conceptually tricky part, if you know what Gaussian convolution is – that’s what I did next. I played around with the window until it covered the space reasonably. To speed up the process, I created two Gaussian images (one red, one cyan) with a 3sd extent and used the intensity point data to create a mask which could be used to scale the intensity of each Gaussian. Then the “new” Gaussian could be drawn, centred on the point position, and using the blend() function, the total intensity of the overlapping Gaussians added to create the heatmap. This was repeated for all the points and both the “in” and “out” point data, and when rescaling at each timepoint, a final rescaling was carried out to ensure that the full dynamic range was being used. Using Processing’s built in graphics methods seemed to be faster than “by hand” Gaussian convolution, but there are probably even faster ways to do it. Thanks to Jon Reades for hints on speeding up the calls to the MySql database where the journey data sits.

Possible extensions: cartographers would probably like to see maps. That’s fairly easily done and would enhance readability whilst sacrificing the rather abstract nature, which I like. I would also have to work a bit harder on using graphics methods for the GC if I did that. Another simple extension would be to use actual arrival/departure data rather than the proxy I describe (I suspect this proxy leads to a certain amount of time-smoothing, which has certain advantages and does not massively skew the results, I suspect).

*I divide each bike’s contribution to edge weight by its journey time so a bike on a long journey does not have undue weight on the system over all time just by appearing in multiple time windows. If I did not do this, long journeys would be more important than short ones over the course of the day. I don’t want to dwell on this but thought it important to mention – I will no doubt write about this again in the future.

Academic Podcasting 101

This post was written for the LSE “Impact of Social Sciences” Blog:

At the beginning of November I ran a rather fun workshop in Cardiff called “Podwhating?” (not my title) – dedicated to academic podcasting. Several years of podcasting and talking to people about podcasts (including the brilliant participants at the podcast workshop) has given me a lot of ideas about what a podcast is and isn’t, which I’d like to share with you.

But before I do that, it seems appropriate to explain why an ex-physicist lecturing at UCL’s Centre for Advanced Spatial Analysis is doing telling people about podcasts. Well, I am or have been involved in several. The first, Answer Me This!, is a decidedly unacademic podcast (although one which relies on public engagement) – an independent comedy podcast based on listener questions. We’ve been going for five years and in that time won a Sony Gold and accumulated tens of thousands of regular listeners, all from our living room in Crystal Palace. On the university side, I worked on Bright Club podcast in its first year, and more recently co-founded The Global Lab, UCL-CASA’s in-house podcast focussed on cities, global complexity and the impacts of technology. All of these have taught me different things about format, editing, community-building and how to balance making the damn episodes with all the other responsibilities and obligations a modern researcher/human being has. So without further ado, let me introduce you to the wonderful world of podcasting…

1- Podcasts are not “sexy”. They are no longer the buzz word they were five years ago, or that “blog” was two years before that. People will not be impressed by “your new podcast” any more than they would be by “your new automobile” or “your new Teflon pan”. But podcasts aren’t unsexy in the way MySpace or Google Wave is; they haven’t (yet) been superseded, they’re just not new. They are, however, as effective a way of delivering speech content to the largest number for the smallest budget as you will find – as effective, perhaps, as blogs are for text.

2- Podcasting is cheap. You can basically do it for free. You can do very good ones with access to cheap equipment (microphones, etc). The editing software you are likely to want to use is free (audacity on a pc, garageband on a mac); the Internet hosting services (podbean, libsyn) are cheap or free. This means you can be as niche in your subject matter as you like. This does not mean being niche in your presentation – if you intend to engage non-specialists, you will need to make your work interesting and accessible. But it does mean you can focus on something more specific to your subject area than “the social sciences” if you want to. At the workshop last week, we had people interested in talking about illegal drugs, shipping, university events, Thai tourism, and science policy, among a wide variety of topics.

3- With this is mind, start now. Get podcasting. The sooner you start, the sooner you’ll get better, build an audience, overcome technical hurdles, create a back catalogue of work, learn, improve, enjoy. No-one starts good – so start now.

4- There are some technical hurdles to overcome. Don’t worry about those. Every step of the process is, by now, set up to have the user in mind. GarageBand was designed by Apple to allow every spotty suburban US teenager to be Phil Spector – you’ll manage fine. I’m not going to dwell on those aspects here: I’ve linked to articles that provide some of that information, at the end of this post.

5- Regularity and persistence are important. Sure, you can get a podcast on iTunes and only ever produce one episode a year, or ever. But the strength of a listener having a new show delivered to their Internet-enabled POrtable Device (I use the handy acronym IPOD – which I use to mean iPhone or Samsung or Zune and whatever Android is currently a la mode) is that they come to expect the new show every day, or week, or fortnight – and this is one way podcasting helps you to build an engaged audience.

6- Podcasting is a bit like broadcasting. In the sense that, you’re probably producing speech-based content for other people to listen to. Hopefully a few of them. If you want to make the project two-way, you have to build in mechanisms to do that – in Answer Me This!, listener questions and other listeners’ responses to their questions make it a genuinely two-way (and three-way) experience. By default, podcasting is one-way.

7- Podcasting is not a lot like broadcasting. Your show will not be beamed to the goggle box in the centre of every home, so you have to find audiences. Or create them. Word of mouth is valuable, but think about using social media, cross-promotion and reaching out to existing communities. On the Global Lab, we try to discuss and connect with events, researchers and initiatives in the field – to bring their work to our audience and hopefully, in the process, attract some of theirs. In the world of indie music, this is sometimes referred to as “spreading the love”.

8- Podcasting can be as time-consuming as you want to make it. If you intend to make a daily discussion-based podcast where you do everything and edit an hour of raw audio down to 30 minutes -  good luck. You’d be better advised to work within your limitations. I have a very supportive Head of Department that sees Global Lab as a key component in CASA’s impact and communication strategy (thanks, Andy!) -  but if your HoD is less stellar, you may not find much time in working hours to complete it. Even if you do, you need to balance your input to the project with things like teaching, research and the others elements of your job. At this point, make your life easier – I’m a big believer that creativity flourishes within constraints, so work out how to achieve your goal in a simpler way, or scale it back a bit. You’ll find yourself asking questions like: could it be shorter? Could it be once a week rather than every day? What about once a fortnight? What if other researchers did it every other week? What if they did the editing for me, or we alternated? What if someone else could be responsible for the website and social media? Could we record for 30 minutes rather than an hour? Could it be more scripted so the editing time is less (although the writing/performance time might be more)? Can I develop a streamlined workflow which gets it out the door faster once it’s edited and converted to an mp3? Do I know people with existing skills or equipment that could make this easier and better?

9- Make content -  make it good. Once you’ve got going, have dealt with technical issues, and start to connect with a community/audience, it’s essential to do good stuff. In your early days, if you’re a bit rubbish, people will ignore you -  so don’t be afraid to start out. Equally well, if you’re still a bit rubbish a year later, people will do the same. You won’t generate positive word-of-mouth, and people who do find you will wonder what the fuss is about. So try to get better all the time, remembering that everyone starts off a bit rubbish.

10- Podcasting is not like academic work, where you spend a long time figuring out what some very clever people have said and done, trying to get your head around that, and then tentatively start to add incremental value to their body of knowledge. The best podcasters have a decent idea, some decent microphones, and enough application that they’ve learned to be good at talking into them. That should be well within your abilities.

For more detailed and technical advice, you can visit my posts on the topic, starting with http://sociablephysics.wordpress.com/2011/05/30/get-podcasting/ and Elizabeth Hauke also has some very useful advice here: http://engagingtalk.wordpress.com/2011/06/05/the-5-ps-of-podcasting/ .

Bright Club: Stars – the aftermath

This article originally appeared on the UCL events blog here.

Last Friday, I was lucky enough to be small glimmer amongst a constellation of researchers as Bright Club : Stars took to the stage of the Bloomsbury Theatre. For the uninitiated, Bright Club was originated by Steve Cross at UCL, and is a night where researchers and academics perform ten-minute “sets” about their work. The spots have to be funny, engaging and entertaining – Bright Club is not a conference, and the sets aren’t lectures – so not for nothing has it been called “research stand-up”. Of course, a researcher doing mother-in-law gags would be no funnier than any other new comedian doing mother-in-law gags – what makes it come alive for me is the way the researchers instead create stories, jokes and explorations of their subjects, with all the passion and absurdity that comes with them.

I’ve performed at Bright Club before (in its monthly home at the Wilmington arms pub) but never at (gasp) the Bloomsbury (capacity: 550 people). If the others were as nervous as I was, they certainly didn’t show it. Geographer Jason Dittmer looked like an old hand as compere Lloyd Langford ushered him onstage – and his tales of culture shock for an American in London got the evening off to a great start. Solar Scientist and sometime guest of the BBC’s Infinite Monkey Cage Lucie Green was up next, illuminating us (sorry) with stories of the sun, followed by Jen Gupta, who had had travelled all the way down from Jodrell Bank to tell us about astronomers and their ridiculously large telescopes. Planetary scientist Sheila Kunani had even come dressed to impress – in a child’s star costume. The image of a lady dressed as a star waving around a fluorescent lightbulb (lit using a plasma ball!) like a tiny garish Jedi may be permanently burned onto my retina.

Much of the evening passed by in a blur as I stood quaking backstage – archaeologist Sarah Dhanjal’s outlandish TV show ideas, James Kneale’s warnings of small birds of prey trashing New York, and Nic Canty’s arsenal of punishing publishing puns all flew by. And that’s not to mention The Professional Entertainers – musician Colin Hazel and his desire for flying cars, Helen Keen’s frankly amazing etymology for the acronym NASA (it’s one that you won’t find in any official histories) and songwriter Gavin Osbourne, who delighted the audience with his song about Carl Sagan’s love affair with Ann Druyan – as well as making some comments to the effect that I looked like I’d been working out (as flattering as it was, I haven’t and I don’t and it was a bit baffling). I should explain that Gavin and I are old friends… but not like *that*.

All of this ramped up the pressure significantly when I finally took my first steps onto the stage. The audience felt more like 5000 than 500, but luckily they were gentle with me (especially after overcoming their initial disappointment that I wouldn’t be lifting any weights). After explaining that there is such a thing as a social physicist (I am one), what steam engines have to do with CASA (my mothership in the Bartlett) and conflating Boris Johnson with a blue tadpole, I left the stage shaken and a little stirred.

Backstage, everyone looked relieved, happy and maybe a little drunk. And who could blame us? We’d brought outer space, geography, publishing, music and yes, social physics to the Bloomsbury in a night of 1,000 stars – well, more like ten, but who’s counting?. Next stop, Wembley?

Complex complexity

I’ve only just had the good fortune to read Warren Weaver’s mission statement for the sciences (original paper; transcription), published in American Scientist in 1948, but I’m glad I did. Since I entered academia, I’ve felt that the standard of writing in research papers is really rather mediocre – not poor (writers do generally try to be precise and explanatory) but mediocre as in overlong, over-reliant on previous results (jargon, shorthand and techniques – and this makes very few papers “self-contained” in any meaningful sense), and a bit dull*. No-one expects a technical article to be a page-turner, but there are limits…

Reading Warren Weaver’s publication in American Scientist was like a breath of fresh air, not least because of the tone of postwar optimism but mainly because of it’s readability and accessibility. American Scientist (unlike the symmetrically titled Scientific American) is not aimed at “the general publics” but scientists, so although its audience is not as specialised as a typical journal article (even in Science or Nature), it’s not as broad as mainstream journalism would be. And it is a speculative paper, all of which afford him freedoms not present in normal academic paper-writing.

The paper focusses on what Weaver perceived as the future challenges of science, and separated scientific problems into three broad categories:

Class 1: simple problems (ones which have few constituent elements, even if their description is complicated e.g. the billiard ball on a smooth table),

Class 2: problems of disorganised complexity (many, many elements which may be treated statistically or probabilistically as a result – e.g. the atoms in a gas) and

Class 3: problems of organised complexity (systems composed of a large number of elements, but less than the second category; where these elements’ interaction does not appear to be “random”).

Weaver’s contention is that traditionally physics has done very well with class 1 (the planets, the hydrogen atom, the earth’s gravity) and latterly class 2 (statistical mechanics, kinetic theory) and at the time he was writing, class 3 very much belonged to biologists, sociologists and so on – the messy sciences (my words, not his).

This is a fascinating way in which we can think about categorising scientific problems, but I confess that I’m not clear where many lie. His contention seemed to be that class 2 problems are tractable – you can rely on statistical properties of your system and neglect the influence of an individual; class 1 problems are clearly all about the individual (or small collections thereof), and so class 3 problems are ones where there are many individuals, but the behaviour of each must be considered.

(This recalls to me the realm of mesoscopic physics – on an individual atoms level, we can make reasonable observations and predictions; likewise for a big block of metal or a canister of gas; but what about an intermediate number of atoms – a protein, a buckyball, a DNA strand?)

Are Class 3 problems defined by interactivity? If a single atom in a small canister of gas behaves unexpectedly (i.e. contrary to statistical predictions), it will not have much effect on its neighbours, at least compared to a liquid, or a grain of sand in a sand dune; it will just be an outlier. But this distinction does not seem significant: in fluid dynamics, researchers use techniques to treat liquids as continuous rather than as made up of discrete atoms and molecules; its strong interaction can be regarded as a strength. As in crystalline solids and statistical gases, scientists have found ways of finding parameters that describe macroscopic order which also relate to molecular and/or microscopic quantities. Perhaps, in time, many more of these complex systems will be describable in terms of a small number of variables.

How does one classify these systems? One imagines human beings to be “organized”, as in co-ordinated and with individual behaviours; at the opposite extreme we don’t imagine a canister of gas to have “complex” behaviour. But where is the distinction? Is a liquid a complex system? If we succeed in simplifying our model of a system does it become less complex? Is Complexity, like Entropy, another name for our ignorance?

*really, I’m not claiming my scientific writing is gripping either.