Posts tagged ‘web development’
jsoup HTML parser launches
Today, I am announcing the public beta launch of jsoup, an open source Java HTML parser for dealing with real-world HTML.
Event-driven webserver Tornado is now open source »
FriendFeed has released Tornado, a Python non-blocking event-driven webserver and framework, as open source.
The framework is distinct from most mainstream web server frameworks (and certainly most Python frameworks) because it is non-blocking and reasonably fast. Because it is non-blocking and uses epoll, it can handle thousands of simultaneous standing connections, which means it is ideal for real-time web services. We built the web server specifically to handle FriendFeed’s real-time features — every active user maintains an open connection to the FriendFeed servers.
Lucene 2.9 Release Imminent »
Mark Miller reports that:
The third release candidate for Lucene 2.9 is about to hit and the final release is likely to be only days behind. Almost one year in the making, Lucene 2.9 is feature packed and progressively faster. With Solr 1.4 planning to release very shortly after 2.9, things are shaping up very nicely in Lucene land.
In anticipation of the Solr 1.4 release, Eric Pugh has announced that the first book on Solr, Solr 1.4 Enterprise Search Server, has been published and is available for purchase.
Announcing Unicode Lookup
Over the weekend I built Unicode Lookup, a tool that lets you search for any Unicode character by name, or by codepoint number. A table of the characters with their decimal, octal, hex, and HTML entity representations is shown as results.
IE 6 and 7 to auto-update to IE8 »
Starting on or about the third week of April, users still running IE6 or IE7 on Windows XP, Windows Vista, Windows Server 2003, or Windows Server 2008 will get will get a notification through Automatic Update about IE8. This rollout will start with a narrow audience and expand over time to the entire user base. On Windows XP and Server 2003, the update will be High-Priority.
Users can decline the update, and Corporate IT groups can block it, but this is a promising move to bring users up to date, and so to increase web-development efficiency.
Amazon adds sort feature to SimpleDB »
Amazon AWS SimpleDB now supports sortable query result sets. Previously query results came back in insertion order only, but now you can sort on (only) one attribute. This makes a lot of standard relational DB use-cases more feasible for implementation in SimpleDB, as it makes for less data post-processing.
Sorting on only one attribute is still quite limiting, though, and queries still only return object IDs, which forces many further queries to retrieve the full data-set.
Review: Programming Collective Intelligence
Programming Collective Intelligence is a book about applying data mining techniques to analyse collections of data. There is submerged information in Ebay prices, in Facebook profile networks, in collections of movie reviews, in news sites, in the stockmarket; this book by Toby Segaran shows ways to extract, visualise, understand, and predict that information.
Amazon adds persistent storage to EC2 »
Amazon is adding persistent storage as an option to EC2 — currently it’s in private beta.
Previously, disk storage on an EC2 was transient:- when the machine was shut down or crashed, it felt like a hard drive crash. (And you’d lose your IP address too, but Amazon added static IPs a little while ago too.) The path to reliability was to use S3, but that can’t be mounted as a native file system.
The persistent storage appears as a raw, mountable filesystem that needs to be formatted. You’ll be able to make a quick snapshot of the data, for backup. No word on pricing or its performance, but you’d expect it to be aligned with S3.
There’s been the option of mounting S3 in EC2 using davfs, which mounts with WebDAV, but that’s a bit of a hack and one wonders what the performance would be like.
Scalr: auto-scaling web app hosting in EC2 »
Scalr is a fully redundant, self-curing and self-scaling hosting environment utilizing Amazon’s EC2.
It allows you to create server farms through a web-based interface using prebuilt AMI’s for load balancers (pound or nginx), app servers (apache, others), databases (mysql master-slave, others), and a generic AMI to build on top of.
The project is still very young, but we’re hoping that by open sourcing it the AWS development community can turn this into a robust hosting platform and give users an alternative to the current fee based services available.
This looks like it could be great when it develops. I kind of think that Amazon themselves should be providing this kind of executive service to auto-scale and -heal an application deployed in their grid (and wouldn’t be surprised if they add it as their service matures).
“We” being Intridea, a web dev shop
How-to: Optimize your site for speed
Does your website load as quickly as you — and your users — would like? If not, here’s a detailed set of proven guidelines aimed at improving the speed of your site.
