WMATA’s half-hearted open data hurts everyone

I’ve written before about WMATA’s API for train positions and API for bus route information. This time, it’s WMATA’s API for elevator and escalator status that is cause for concern. It’s good that WMATA provides this data in a machine-readable format—in fact, they’re one of only a handful of agencies to do so—but as with WMATA’s other APIs, the implementation is half-hearted at best.

Inconsistent data, the absence of a formal developer relations mechanism, and unexplained, unannounced outages are bad for everyone. They make WMATA look bad, obviously. But more importantly, they make developers look bad, and reduce the incentive for local developers to build applications using WMATA’s data. When someone finds that an app doesn’t work, or that they’re getting stale, incomplete, or inconsistent data, their first instinct is usually to blame the app or the app’s developer, not WMATA.

What’s specifically wrong with the ELES API?

  • 11-day outage, made worse by non-existant developer relations:
    From March 28 to April 9, 2012, the ELES feed returned static data. This outage was never acknowledged publicly by WMATA, in any medium.

    Because WMATA does not provide any public point of contact for developer relations, there was no way for developers to formally report the problem, nor any way for developers to get useful information like an estimated time to resolution.

    An API outage such as this may seem like the sort of thing that would only impact a handful of transit data nerds, but rest assured, there were absolutely real-world impacts: Elevator-dependent Metrorail users who relied on mobile applications which used data from the API found themselves trapped at stations where the stale data led them to erroneously believe that an elevator was in service.

    While this may have been a one-time problem, the underlying issue remains: how could a critical service have gone down for 11 days with no public notice?

  • Feed missing information from the Web site:
    Like much of the information in WMATA’s open data initiative, the ELES API presents the same data as is presented on WMATA’s Web site…or at least that’s how it’s supposed to be.

    In reality, while the Web site lists “estimated return to service” dates for each elevator/escalator, that information is omitted from the API. In addition, others have observed that the API feed and Web site don’t always seem to be in sync. This could create considerable confusion for riders who sometimes check the Web site directly and sometimes use an app which gets data from the API.

  • Feed missing information necessary for maximum usefulness:
    Before presenting this point, it’s important to explain how the elevator outage information is used by elevator-dependent riders. When an elevator-dependent rider sees that there’s an elevator outage at a transfer station that will affect them, they generally avoid the outage by transferring at another station (for example, at Fort Totten rather than Gallery Place).

    But if it’s at their origin or destination station, then they can either use another nearby station (like Judiciary Square rather than Gallery Place), or they can call for a shuttle.

    Calling for a shuttle is a difficult, time-consuming process, but in many cases, especially for outlying stations, it’s a necessity.

    Neither WMATA’s Web site nor the API contain a key piece of information needed by elevator-dependent riders: where to go to get a shuttle—which station, which exit at that station, etc. This information is displayed on the PIDS, but is simply not available on the Web in any format.

  • No master list of units:
    As I explained when I wrote about WMATA’s performance monitoring program, including the agency’s Vital Signs Report, only summary statistics are available for WMATA’s elevators and escalators. Want to know which specific units have the best or worst track records? Want to know if a major overhaul has improved a unit’s availability? Want to know how the units at transfer stations hold up, compared to their peers at less-trafficked stations? You can’t, at least not with the data in the Vital Signs Report.

    But, that doesn’t mean it’s absolutely impossible to compute those statistics; it just takes more work. For one thing, you can forget about getting historical data. However, if you’re willing to archive data from the ELES API, you can actually create your own statistics. Store that in a database, and over time you’ll build up a record of which units were out of service, and when. Transfer the result of that into an OLAP cube, and you can slice and dice to your heart’s content. Want a report on units at transfer stations? Done. Want stats on outages specifically at peak hours? Done. Want a report just on your home station? Done.

    There’s only one piece missing: a list of all elevators and escalators in the Metrorail system. Why is this necessary? In order to compute statistics with the outage data, we have to know how many units there are—in statistical terms, the universe. Of course, we can find out from WMATA’s Web site that there are a total of 588 escalators, and 239 elevators, but that’s only good enough for computing the same system-wide metric that the Vital Signs Report provides. Any more detailed analysis—like at a per-station level, or a per-line level, or any of the examples given above, requires knowing not just how many units there are, but the IDs of those units, and their locations (so statistics can be computed on a per-station, or even per-unit level).

    If WMATA had made a real commitment to transparency and open data, and if there were a developer liaison appointed, I’d imagine it might take a day or two to get such a master list of units made available as a CSV or XML file—I would have to imagine that somewhere in the 100 TB of data managed by WMATA, there must be a list of these 827 units.

    But there isn’t even anyone to ask for the data. And, to make matters worse, every such request is treated with suspicion and mistrust. There’s no sense of developers working cooperatively with WMATA; it is, from the outset, combative. Yes, some of these data will make WMATA look bad, but some will make the agency look good—especially when it can be shown that a major overhaul, such as is taking place now at Dupont Circle and will soon take place at Bethesda, improves the reliability of the overhauled units. Besides, transparency isn’t about releasing the data that make you look good, it’s about releasing data, period.

What’s the point of all this, then? When General Manager Sarles says that he “[doesn't] want to hide problems”, or that the Metro Forward campaign is making tangible improvements for riders, I expect to see data to back up those assertions.

When elevator-dependent riders have to cope with yet another outage, I don’t want for them to find out for the first time when they get to their destination and the only notice they have is a cone in front of the elevator door. I want for there to be timely (and, more importantly, meaningful) information available, in a wide variety of formats, including a high-quality API that encourages app developers to build tools that further increase the accessibility and further widen the dissemination of that information.

Why do I expect these things? I expect these things because Metrorail is supposed to be “America’s Subway”, a world-class system at the forefront of technological innovation and operational excellence. Right now, it is neither of those things. Instead, it is a system where riders climb up and down stopped escalators in dimly-lit stations and hope that their train does not pass over another poorly-maintained track circuit which shall fail to detect that it has become occupied and engender yet another fatal collision. It is a system where secrecy and the maintenance of fiefdoms are the norm, not transparency and cooperation for the good of the riding public.

I don’t claim that open data (and better still, open data that is timely and meaningful) will solve all of those problems, but it is a small step forward, and a step that WMATA could easily take using its existing infrastructure.

1 Comment

Filed under Articles

WMATA to bidders: don’t innovate with real-time signs

So who will #WMATA pick? A conventional over-priced solution from a big-name transit IT firm, or an emerging open solution?
@kurtraschke
Kurt Raschke

Last week, I asked if WMATA would entertain the idea of using a modern, open-architecture, open source system to power real-time signs at Metrobus stops. Unfortunately, I can now conclusively say that that will not be the case. When I first read the RFP, WMATA had yet to post the complete technical specifications. They’ve now updated the RFP with the complete technical specifications (albeit in an inaccessible, untagged PDF which is itself a scan of a printout).

The specifications state, in part:

Because of the criticality of meeting the firm deadlines, WMATA wants only to procure standard, proven products, from a Vendor with successful existing and fully operational implementations, in multiple transit agencies.

WMATA specifically does not want to assist a Vendor in developing new products for its product line for this RFP.

Furthermore, WMATA wants to select a Vendor that has clearly made a long-term commitment to providing contemporary customer information systems to the transit industry.

So, that’s that. I could go on at great length about how MTA New York City Transit’s success with OneBusAway, and how we could do the same for real-time signs, but I’ve already made that argument.

I’ve long known that WMATA is an agency which is deeply suspicious of change and innovation, and it pains me to see the agency continue to behave in ways that are manifestly counterproductive and which will in all likelihood only deepen the agency’s budget deficit.

Continue reading

7 Comments

Filed under News

Vancouver’s Compass Card and open payment

Translink, the regional transit authority in Metro Vancouver, is in the process of deploying a new fare collection system by Cubic. Central to the system is a new contactless farecard, the Compass Card. The Compass Card will give transit riders in Vancouver an easy way to pay their fares across all of the modes of transit that Translink operates.

As Translink’s FAQ explains, though, there’s more to the system than just the Compass Card:

Q: Will people still be able to pay for fares in cash or by credit/debit card?
A: Customers who still want to use cash can purchase a Compass Card or Compass Ticket at vending machines in stations. Vending machines will also accept debit and credit cards as payment for cash fares. Buses will also still accept cash. Customers can also use their contactless credit card on buses or at the faregates by simply tapping their credit card in the same manner as the Compass Card (don’t forget to tap off). However, customers will be strongly encouraged to use the Compass card for the pricing discounts, convenience and flexibility they offer.
from FAQs – Compass Card and Faregates

This is somewhat of a sharp contrast with the practice in other North American cities who have been contemplating open payment, where there has been a very explicit objective of getting the transit agency out of the business of issuing fare media, and making contactless credit and debit cards the primary means of fare payment. In these proposals, the needs of unbanked and underbanked riders are often addressed through third-party vending of closed-loop (or, in some cases, open-loop GPR) cards.

But in Vancouver, the roles seem to be reversed. The Compass Card and Compass Ticket (presumably either a magnetic strip ticket or limited-use contactless smart card) are being marketed as the primary means of fare payment under the new system, with greater emphasis on the reusable Compass Card. Open payment isn’t even mentioned as such; there’s just a passing mention that riders “can also use their contactless credit card on buses or at the faregates by simply tapping their credit card in the same manner as the Compass Card”, and that’s it.

Conceptually, what Translink is doing with the Compass Card and open payment is similar to what I suggested when Transport for London launched its open payment initiative: open payment may be a boon for the many infrequent riders who will flood the system during the Olympics, but for commuters who ride every day, an Oyster card will remain the best fare medium.

Of course, this approach is also cheaper, at least from a per-transaction perspective—when a rider in Vancouver uses their Compass Card or Compass Ticket, there are no credit/debit interchange fees to pay as there would be if they’d used a contactless credit or debit card.

2 Comments

Filed under Articles

No trip = no fare; it’s only fair

On Friday, the Washington Post’s Dr. Gridlock column featured a complaint from a Metrorail rider whose SmarTrip card was charged even though they’d entered and exited at the same station—in fact, they never went anywhere.

The rider entered the system at Foggy Bottom, intending to travel to Friendship Heights. With Metrorail delays mounting, due to both regularly-scheduled track work and a sick passenger, they decided to give up and seek alternate transportation, and that’s when they found that they were charged, even though they hadn’t actually gone anywhere.

Here’s what the Post’s Robert Thompson had to say about the rider’s plight:

For riders to get their money back when they give up in disgust and leave the same station, Metro officials would have to declare that extraordinary circumstances existed and authorize free exits at the affected stations. If free exits were routinely available, it would be easier for fare evaders to cheat the system.

Local bloggers and transit advocates routinely accuse Thompson (and the Post’s other transportation writers) of being shills for WMATA. I’d rather not use such strong language, but this is one case in which Dr. Gridlock should have stuck up for Metro riders.

There’s no need for WMATA to “authorize free exits”, or do anything that would encourage rampant fare evasion. On the contrary, all that is required is a simple change that ensures the fairness of the fare collection system for all: if a passenger enters and exits at the same station within a reasonable period of time, they should not be charged.

Riders should be free to change their mind and exit the system without being charged even if no “extraordinary circumstances” exist. There might be a major disruption, or there might not—either way, if a rider changes their mind, in a reasonable period of time, then let them out without charging them.
Continue reading

Leave a Comment

Filed under News

An oddity with NextBus short stop titles

The NextBus public API supports “short titles” for routes and stops, which are designed to be formatted for display on mobile devices, and other environments where space is at a premium. Stop and route definitions in the XML returned by the API both contain a title attribute with the full title, and a shortTitle attribute with the short title, if one exists. Unfortunately, there seems to be a bug in the output returned by NextBus which limits the usefulness of this feature.
Continue reading

2 Comments

Filed under Articles

To be successful, open data must be quality data

The listing above is the output of WMATA’s API method for getting a list of bus routes. Over a year ago, a developer reported corruption of the data. The corruption is suggestive of the data being transferred by way of Microsoft Excel, which is something of a frightening prospect in its own right (as Excel is no substitute for a real database, and well-known for corrupting data), but worse, more than a year after the report was made, the problem remains unresolved.

In fact, the problems with the list of bus routes returned are so severe as to make the data unusable without an inordinate amount of post-processing.
Continue reading

1 Comment

Filed under Articles

Journey history as open data, and cooperation with developers

Last April, I blogged about Chromaroma and its use of Oyster journey data from Transport for London. Since then, I’ve continued to hold up Chromaroma as one of the best examples of what can be done with journey data, in spite of a lack of cooperation from the transit authority.

When I first covered Chromaroma, I pointed out that they were screen-scraping the Transport for London site in order to retrieve Oyster journey histories, and I discussed some potential options for avoiding what is an inherently inelegant process, including the use of OAuth for authentication, and the definition of a common format for journey history data interchange.

Now that I’ve proposed the development of a system based on journey data, I’d like to revisit how Chromaroma has been doing.
Continue reading

Leave a Comment

Filed under Articles

On WMATA’s RFP for real-time signs

WMATA recently released an RFP for real-time signs for bus passenger information. Since I don’t want to completely bore people with a series of thousand-word posts as I did for WMATA’s New Electronic Payments Program, I decided to package up my earlier tweets about the RFP, along with a few notes, into a post on Storify.

I’ve also embedded the post below:

Continue reading

1 Comment

Filed under News