[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Blogging needs a whole new BAG...




Hi Bob,
I just read the "chat" minutes and thought it (and you especially ;-) raised some interesting points. Good of you to summarise here.


Bob Wyman wrote:

Just as "the web" has a TAG (Technical Architecture Group) I think Blogging needs a BAG (Blogging Architecture Group.) I believe that the charter for either the W3C or IETF Atom group should include a task to form a BAG and produce a Blogging Architecture.

Though I see where you are coming from, I'd argue that Blogging Architecture and Web Architecture are one and the same. I believe it would be most productive for the Atom WG to have a relatively narrow scope, covering the details of the format and API specifications and best practices relating to them. I do however think it would be in scope to carry out some discovery relating to the bigger picture, and if necessary recommending the creation of other groups to take on those tasks. At present there much speculation but little hard evidence of the wider requirements, or at least what there is isn't in a form from which clear courses of action can be decided. Whether the requirements are specific to the blogosphere or the web as a whole certainly isn't clear. But whatever, an Atom WG could act as a foot in the door for future, broader work if necessary.


* Push vs. Pull: Currently, blogging is all about pull and polling. However, it is clear to many that the current approach simply won't scale. We can't have millions of aggregators polling and pulling from millions of Atom files without introducing massive resource consumption issues.

Speculation. I happen to agree that this does look to be the case, but I don't believe there is enough concrete evidence on which to make concrete decisions. I certainly hope and expect there to be considerable work done on Push and P2P with Atom. I suspect this kind of work is essential for the evolution of the web as a whole. But I don't think being alarmist about it is in anyone's interests.


Will W3C be willing to support push-based Atom?

I don't see why not. The Atom format is agnostic. The Atom API may have arisen to fulfil the immediate needs of the current polled-HTTP approach to syndication, but there is every reason for it to enable other approaches.


* Pinging: Blogging benefits from the various "pinging protocols" which currently depend on XML-RPC (a non-W3C protocol).

Pings can be implemented using RESTful approaches, there is no dependency on XML-RPC. But I do agree that lightweight calls (or callbacks) are likely to be extremely useful. The Atom API should provide a good starting point for a consistent approach.


* Trackback: The "trackback protocol" is popular, provides benefit, etc. however, there are a number of major issues that should be addressed. These issues include the same authentication and signature issues that relate to pings. There are also important architectural issues that should be addressed in order to deal with the problem of trackback-spam. Is it possible/desirable to provide for "trackback aggregators" or trackback services on behalf of blogs? Is Trackback really just a special case of Pinging?

The answer to the last question sums this up as far as I can see: "yes". I believe the Atom API can provide a general purpose system that is applicable for all kinds of ping.


Trackback is a generally useful function that should be supported outside the realm of blogging. A major "architectural" issue with the web has always been the fact that links are uni-directional. Explicit trackbacks are one way of providing the backlinks needed to construct bi-directional links. Presumably, bi-directional links would be generally useful. What, if any, modifications would need to be made to trackback in order to make the facility generally useful? Are there extensions to HTML that should be made in order to record "in-bound" links or to provide a means of pointing to where one would discover in-bound links?

The kind of links you are talking about, HTML hypertext links, are only really a special case. There are considerably more versatile forms of linking already well-specified, and to varying extents deployed (RDF, Topics Maps, XLink). Probably the main reason linking on the web is currently unidirectional is that's all the common user agents (HTML browsers) support.


* OPML: Many blogging applications rely on OPML files as a means of recording and exchanging lists of blogs, metadata about them, etc. There is not, however, a well-accepted definition of how one builds an OPML file (although Danny Ayers, among others, has worked on this issue.) and OPML is not the subject of any serious standardization effort. If OPML use is to grow, we really must see some standardization here. Also, we're seeing the growth of applications that rely on the sharing of OPML files. Thus, do we need to extend the concepts of pinging, trackback, push, etc. to include API and update notification support to OPML file maintenance?

I haven't seen anything done with OPML that could be done just as easily and almost certainly in a more interoperable fashion using other established standards such as XHTML. The "big tree" ideas sound fun, but the tree is an unnecessary restriction. The hierarchical approach is one reason why the WWW succeeded and Gopher faded, about ten years ago. That quick and interesting applications can be put together using OPML is interesting, and suggests that the barrier to implementation of proper specifications can be lowered. I've attempted to get something more generally useful out of OPML, but its lack of standardisation results very quickly in diminishing returns. Frustration ensues...


* The role of proxies, synthetic feed generators, systems like PubSub.com, etc. is neither well understood nor defined, however, it is clear that there are today quite a number of these things and there will be more in the future. Effort should be put into at least clarifying the issues related to these intermediary processors and distributors of Atom files.

Yes, yes, yes. In no small part because if RSS gridlock is a likely scenario, the use of intermediaries may offer a route around it.


* Comment support is provided by many blogs and we're beginning to see issues related to comment spamming, etc. Is there something we can do from an architectural point of view to prevent comment spamming? (i.e. do we need something like an IRTF spamming group?) How do we handle remote commenting and what is the place/role of services like SixApart's "TypeKey" service? Should a "TypeKey" standard be developed? How does the W3C Annotea effort relate to commenting and/or blogs in general?

These are issues which call for examination by a Working Group.


* HTTP has limitations that are a true burden in blogging. For instance, there is no server-side support for identifying and retrieving of "fragments" of a resource served by an HTTP server. Thus, you can't say things like "Give me only the entries that have been updated since time XXXX'". Should HTTP be extended to address better the needs of Atom? Should RFC-3229 be extended to define an ATOM specific mechanism for retrieving Atom Fragments?

Did you see the responses on the TAG list relating to Patrick Stickler's proposed additional HTTP verbs? The general feeling was that this probably wasn't such a good idea. Essentially the same question arose as a possible requirement for Semantic Web systems. This isn't a coincidence.


* As Atom and blogging becomes more popular, there will be some sites that generate large quantities of Atom formatted data and others that will consume large quantities of it. This raises the issue of binary or compressed formats for Atom data.

One for the Atom WG or the Binary XML folks - but personally I'd say gzip is your friend.


* Many commercial publishers have expressed an interest in syndication but also express great concern about the various issues related to IPR and usage rights. Is there anything we can do to provide a means to support digital rights management in Atom feeds or systems?

Good question. Another for the WG.


* PICS (Platform for Internet Content Selection) provides a means to tag content to facilitate content selection, rating services, filtering, signing and privacy. We need to consider how PICS relates to the world of blogging.

Ditto.


* P3P (Platform for Privacy Preferences) was designed with web sites in mind. Does P3P work for the kind of systems that would be developed using Atom, the Atom API, and other blogging components? Does P3P need to be extended to address blogging related issues?

Probably.
(Forgive me for bringing this up, but aren't PICS and P3P already available through RDF?)


* Various efforts have been made to support "categories" as meta-data in blogs and blog entries. Most have failed. It seems like these efforts should be formally linked to similar efforts such as the ISO Topic Map or the XTM standards. Ideally, any solution for blogs would also work well in non-blog environments. Do we need standardization of things like "subject indicators" or do the existing standards do the job?

Here I would suggest that discussion with Semantic Web people could really help. This kind of thing is currently on the table for the SW Best Practices group.


These are just some of the issues that rise out of the general problem of blogging. Frankly, I think what is really going on here is that in the blogging world, we're experimenting with and defining much more than just blogs. What we're really doing is working out the details of a whole new way of interacting with the web. Now, as we are passing from the stage of initial experimentation to a stage of consolidation behind an initial set of formal standards, I think we need to expand our vision to include the full range of issues that have been raised. We also need to think about how the web-at-large can benefit from what we've learned in the laboratory of blogging.

I agree entirely, but I think it would be better to set clearly defined short-term goals than attempt to take on everything at once. The best bet IMHO is to work alongside people that are already trying to figure out the bigger picture from other directions.


As said before, blogging is much more than just a file format and an API. Atom and its API should only be considered the first of many steps in defining how blogging, and this style of web use, are to be pursued in the future. Whatever forum we use to define the first steps on this road should be a forum which will be appropriate for resolution and discussion of the other issues as well.
Is the W3C willing and able to deal with more than just the immediate issue of format and API? Is the W3C willing, able, and the best forum to deal with the overall problem of bloggging?


I think the answer to all these questions is probably yes.

Cheers,
Danny.

--

Raw
http://dannyayers.com