On the AJAX-driven Web, Plain Old Semantic HTML Can Help

For those of us working in specialized industries, investing in the strict microformat standardization process for most of the data we trade in is a lofty goal, with little reward. Microformats are standardized to encourage the browser and other software to apply behavior to the marked-up data (say, extracting a vCard and adding it to your email client). Furthermore, they’re generally, if not necessarily, used to mark up very common data—what customer facing website doesn’t have contact information?—not industry- or application-specific data formats.

But surely we’re already using lovely Plain Old Semantic HTML (POSH), and I think it can solve a long-standing problem: getting server-side data into JavaScript data structures on pageload.

The Problem

Any website that remotely resembles an application (blogs included) is working with page templates, which are populated with data from a database on pageload (at least if a responsible, progressively-enhancing web developer is behind it). If and when AJAX functionality is added to these pages, the application’s JavaScript code needs to know about these data.

The goal is to get a chunk of data into the markup and into some sort of JavaScript data storage without making multiple database queries for the same data, repeating myself, or violating any other rules of good coding.

Common Approaches

(Please note that these examples all assume Ruby on Rails and jQuery and are wildly simplified.)

Insert JavaScript into the body of the document:

Not bad, but thou shalt not include JavaScript in the body of the page, right?

Moreover, I would have already spit my data into the view (HTML) file when I generated the markup. I’m only making one database query, but I’m printing my data to the template twice. I’d like to avoid this redundancy.

Make an AJAX call after pageload:

This method eliminates the problem of putting JavaScript in the body of the document, but it means that I’m fetching data that I already fetched when I generated the markup. That is, I’m making the same database query twice.

Generate JavaScript on the server side

There’s another solution, which involves generating JS files on the server side, but since the data I’m trying to get from the server may be different on a request-by-request basis, I’d have to generate the file on every request, rather than compile it once at build or deploy time. This would inhibit or eliminate the possibility of caching, so we can give a pretty definitive thumbs down to this approach. I’m only mentioning it for the sake of thoroughness.

Enter poshformats

According to microformats.org:

the term “poshformats” distinguishes these one-off, ad-hoc or more informal class-name based formats efforts (based on long-standing modern web design POSH practices) from the more formally researched and documented microformats.

A poshformat looks like a microformat, but it hasn’t been defined and vetted with as much rigor.

For example: The markup (poshformat)

Say I work for a website that analyzes baseball statistics. I’m always displaying players’ stats and information on my pages, then doing all kinds of fancy AJAX-y things to these player representations. The POSH markup for one of these player representations might look like this:

What we have here is a “poshformat.” According to microformats.org, “When the author of that POSH declares it to be a ‘format’ of some sort, then they’ve created a poshformat.” Well I just did.

For the sake of simplicity, the “player” instance above only includes a name, a photo, and some “vitals.” But it could easily include basic stats, like batting average, hits, and runs, plus advanced stats, like OPS and VORP. Good poshformatting is perfectly extensible.

“But I’ve got the HTML5 itch!”

The HTML5 data-* attributes are commonly used to provide data relevant to JavaScript hook-ins.

“That’s what this is, right?”

Well, sort of.

Regarding data-* attributes, the HTML5 specification says this:

Custom data attributes are intended to store custom data private to the page or application, for which there are no more appropriate attributes or elements.

These player data will appear in search engine results; they may appear in an RSS feed. In other words, they aren’t private. However, what I have isn’t a microformat or RDFa; the labels I provide to it are not for the benefit of these or any other applications outside my page.

A microformats.org article, Microformats in HTML5, says this:

Note that the data-* stuff is explicitly not for microformats. […] They are intended for script authors to have a space in which they can play without ever clashing with anything the browser does. There may be some cases of private poshformats that are never intended for interchange that may be used in data-* attributes.

We’re getting some mixed signals, but they seem to be pointing towards sticking with good old classnames.

Sure, I could use CSS attribute selectors to style elements by their data-* attributes, but classes were always perfectly good for this. And, if I’m bearing in mind the tenets of progressive enhancement, as I should be, I’ll ignore my intention to eventually add JavaScript functionality when marking up the data. Surely, then, I’ll mark it up with good POSH classnames from the beginning, and only add data-* attributes if and when I need them for my scripting.

That said, some data is not displayed but is vital for any JavaScript code that will be making AJAX calls to the server. For instance, it doesn’t do Google or my RSS reader or any average reader of the site any good to know what unique ID Casey Atthebat’s database entry has been assigned, but I need it if I’m telling the server which database row to update. This is where data-* attributes are perfect. I can just add data-player-id=”1234567″ to the section.player tag, and make sure I look for it when scraping my poshformatted markup. I could also consider using the itemid microdata attribute.

JSONifying our poshformat

I know I’m going to be manipulating this data and the corresponding DOM (Document Object Model) representation via JavaScript, so I want to get it into everyone’s favorite data exchange format, JSON, as soon as the page loads:

I’ve already reviewed the old ways of doing this and decided I can do better. Here’s where JavaScript and poshformats play nicely.

The POSH way

When my page loads, and I initialize my JavaScript, I can say something like this (I’m assuming jQuery, for simplicity’s sake):

Problem solved!

A note on Skytap’s use of this technique: SmartClient presents a tricky problem: it is possible to open SmartClient for an environment that has no VMs, but to which VMs could be added by another user or from another page. If SmartClient loads with an empty environment, there is no poshformatted data from which to gather the structure of VM data. In such cases, we reload the page after the first VM is added.

Next steps

The astute reader will point out that the above code is extensible only by adding additional lines of code and not very generic either. That’s true. The next step is to develop a generic way to define a model and automatically extract instances of that model from the poshformatted DOM.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Necessary" category .
cookielawinfo-checkbox-non-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
cookielawinfo-checkbox-preferences	1 year	This cookie is set by the GDPR Cookie Consent plugin to check if the user has given consent to use cookies under the "Preferences" category.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
55d66ab20f0ad28a_cfid	2 years	Set by ChatFunnels to store chat sessions
bcookie	2 years	This cookie is set by linkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
lidc	1 day	This cookie is set by LinkedIn and used for routing.
sc_anonymous_id	9 years	Cookie is placed by SoundCloud to provide functions across pages.

Cookie	Duration	Description
__utma	2 years	This cookie is set by Google Analytics and is used to distinguish users and sessions. The cookie is created when the JavaScript library executes and there are no existing __utma cookies. The cookie is updated every time data is sent to Google Analytics.
__utmb	30 minutes	The cookie is set by Google Analytics. The cookie is used to determine new sessions/visits. The cookie is created when the JavaScript library executes and there are no existing __utma cookies. The cookie is updated every time data is sent to Google Analytics.
__utmc		The cookie is set by Google Analytics and is deleted when the user closes the browser. The cookie is not used by ga.js. The cookie is used to enable interoperability with urchin.js which is an older version of Google analytics and used in conjunction with the __utmb cookie to determine new sessions/visits.
__utmt	10 minutes	The cookie is set by Google Analytics and is used to throttle the request rate.
__utmz	6 months	This cookie is set by Google analytics and is used to store the traffic source or campaign through which the visitor reached your site.
_gat_UA-4086838-1	1 minute	This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It appears to be a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.
_uetsid	1 day	Bing Ads sets this cookie to engage with a user that has previously visited the website.
_uetvid	1 year 24 days	Bing Ads sets this cookie to engage with a user that has previously visited the website.
YSC		This cookies is set by Youtube and is used to track the views of embedded videos.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, camapign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assigns a randoly generated number to identify unique visitors.
_gcl_au	2 months	This cookie is placed by Google Tag Manager to place and track conversions.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form.
_uv_id	2 years	Slideshare: Collects data on the user's visits to the website, such as which pages have been read.
browser_id	5 years	This cookie is used for identifying the visitor browser on re-visit to the website.
bscookie	2 years	This cookie is placed by Linkedin to store performed actions on the website.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
GPS	30 minutes	This cookie is set by Youtube and registers a unique ID for tracking users based on their geographical location
li_sugr	2 months	This cookie is placed by Linkedin to store browser details.
lissc	1 year	Used by the social networking service, LinkedIn, for tracking the use of embedded services.
MR	1 week	This cookie is used to measure the use of the website for analytics purposes.
pardot		The cookie is set when the visitor is logged in as a Pardot user.
undefined	never	Wistia sets this cookie to collect data on visitor interaction with the website's video-content, to make the website's video-content more relevant for the visitor.
vuid	2 years	Vimeo

Cookie	Duration	Description
ANONCHK	10 minutes	The ANONCHK cookie, set by Bing, is used to store a user's session ID and also verify the clicks from ads on the Bing search engine. The cookie helps in reporting and personalization as well.
IDE	2 years	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
MUID	1 year	Used by Microsoft as a unique identifier. The cookie is set by embedded Microsoft scripts. The purpose of this cookie is to synchronize the ID across many different Microsoft domains to enable user tracking.
SRM_B	1 year	Bing.com
SRM_I	1 year	Bing.com
u	2 months	Collects data on user visits to the website, such as what pages have been accessed. The registered data is used to categorize the user's interest and demographic profiles in terms of resales for targeted marketing
uid	1 year	This cookie is used to measure the number and behavior of the visitors to the website anonymously. The data includes the number of visits, average duration of the visit on the website, pages visited, etc. for the purpose of better understanding user preferences for targeted advertisments.
UserMatchHistory	1 month	This cookie is place by Linkedin to enable ad delivery or retargeting.
VISITOR_INFO1_LIVE	5 months	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

On the AJAX-driven Web, Plain Old Semantic HTML Can Help

The Problem

Common Approaches

Enter poshformats

JSONifying our poshformat

The POSH way

Next steps

Join our email list for news, product updates, and more.

Product

Company

Help

Cookie	Duration	Description
_clck	1 year	No description
_clsk	1 day	No description
AnalyticsSyncHistory	1 month	No description
CLID	1 year	No description
ingrammicro.com	1 hour	No description
li_gc	2 years	No description
loglevel	never	No description available.
original_req_url	past	No description
visitor_id869971	10 years	No description
visitor_id869971-hash	10 years	No description