Conference planning is starting to ramp up. See our first group of sponsors, and don’t forget the call for papers!
Read MoreThe mobile platform landscape has changed dramatically in the last few months. So much so that organizations who even recently reached decisions on a mobile development strategy should re-visit their decisions. I’m not talking about HTML5 vs app development issues – though those decisions are just as important and directly related because of continued innovation in device and operating system capabilities combined with the need to protect content development and management investments – but about which platforms will be viable, or meet your level of risk tolerance.
What has changed? To over simplify: Apple’s dominance continues to increase and is unassailable in tablets; RIM is not a contender; Microsoft is looking like an up-and-comer; and most surprising to many, Android is looking iffy and is a flop in tablets with the exception of the very Amazon-ized version in the Kindle Fire. These are pretty general statements, but if you are in charge of your company’s mobile development strategy considering their impact is a good place to start a check-up for a possible course correction.
Another place to start is to read the excellent post by Tim Bajarin Why Google Will Use Motorola To Become Vertically Integrated. I won’t summarize because the entire post and the comments are really a must-read.
Read MoreAjay Agarwal from Bain Capital Ventures predicts that because of the confluence of big data and marketing Marketing is the next big money sector in technology and will lead to several new multi-billion dollar companies. His post is succinct and convincing, but there are additional reasons to believe he is correct.
Marketing spending more on IT than IT
Ajay opens his post with a quote from Gartner Group: “By 2017, a CMO will spend more on IT than the CIO”. It is difficult to judge this prediction without evaluating the supporting research, but it doesn’t sound unreasonable and the trend is unmistakable. Our own experience as conference organizers and consultants offers strong support for the trend. We cover the use of web, mobile, and content technologies for enterprise applications, and our audience has historically been 50% IT and 50% line of business or departmental. Since at least 2008 there has been a pronounced and steady increase in the percentage of marketers in our audience, so that 40% or more of attendees are now either in marketing, or in IT but assigned to marketing projects – this is about double what it was in earlier years. While web content management vendors have moved aggressively to incorporate marketing-focused capabilities and are now broadly positioned as hubs for customer engagement, the real driver is the success of the web. Corporate web sites have become the organizations’ new front door; companies have recognized this; and marketers are demanding tools to manage the visitor experience. Even during the peak of the recession spending on web content management, especially for marketing applications, was strong.
“Cloud” computing and workforce demographics have also beefed up marketers’ mojo. The increased ability to experiment and deploy applications without the administrative overhead and cost of IT or of software licenses has encouraged marketers to learn more about the technology tools they need to perform and helped instill the confidence necessary to take more control over technology purchases. A younger more tech-savvy workforce adds additional assertiveness to marketing (and all) departments. Now if only marketers had more data scientists and statisticians to work with…
Big data and big analytics
Big data has not caused, or contributed very much, to the increase in marketing spending to-date. Certainly there are very large companies spending lots of money on analyzing vast amounts of customer data from multiple sources, but most companies still don’t have enough data to warrant the effort of implementing big data technologies and most technology vendors don’t yet support big data technologies at all, or sufficiently. I agree with Ajay though that the “several multi-billion dollar” marketing technology companies that may emerge will have to have core big data processing and analytic strengths.
And not just because of the volume. One of the main reasons for the enterprise software bias for back office applications was that front office applications beyond simple process automation and contact data collection were just too difficult because they required processing unstructured, or semi-structured, data. Big data technologies don’t address all the challenges of processing unstructured data, but they take us a long way as tools to manage it.
The level of investment in this space is much greater than most realize. Ajay is right to invest in it, but he is not alone.
Read MoreThe call for papers for this year’s conference is now open. See information on the topics and instructions.
Read MoreIn keynote sessions at the recent Gilbane Boston Conference, three speakers in a row made points about content management solutions that are also significant to selection and implementation of enterprise search. Here is a list of paraphrased comments.
- From Forrester analyst, Stephen Powers were these observations: 1. The promise has been there for years for an E (enterprise)CM suite to do everything but the reality is that no one vendor, even when they have all the pieces, integrates them well. 2. Be cautious about promises from vendors who claim to do it all; instead, focus on those who know how to do integration.
- Tony Byrne of the Real Story Group observed about Google in the enterprise that they frequently fail because Google doesn’t really understand “how work gets done in the enterprise.”
- Finally, Scott Liewehr of the Gilbane Group stated that a services firm selection is more important than the content management system application selection.
Taken together these statements may not substantiate the current state of the content management industry but they do point to a trend. Evidence is accruing that products and product suppliers must focus on playing nice together and work for the enterprise. Most tend not to do well, out-of-the-box, without the help of expertise and experts.
Nominally, vendors themselves have a service division to perform this function but the burden falls on the buyer to make the “big” decisions about integration and deployment.
The real solution is waiting in the wings and I am increasingly talking to these experts, system integrators. They come in all sizes and configurations; perhaps they don’t even self-identify as system integrators, but what they offer is deep expertise in a number of content software applications, including search.
Generally, the larger the operation the more substantial the number and types of products with which they have experience. They may have expertise in a number of web content management products or e-commerce offerings. A couple of large operations that I have encountered in Gilbane engagements are Avalon Consulting, and Search Technologies, which have divisions each specializing in a facet of content management including search. You need to explore whether their strengths and expertise are a good fit with your needs.
The smaller companies specialize, such as working with several search engines plus tools to improve metadata and vocabulary management so content is more findable. Specialists in enterprise search must still have an understanding of content management systems (CMS) because those are usually the source of metadata that feed high quality search. I’ve recently spoken with several small service providers whose commentaries and case work illustrate a solid and practical approach. Those you might want to look into are: Applied Relevance, Contegra Systems, Findwise, KAPS Group, Lucid Imagination, New Idea Engineering, and TNR Global.
Each of these companies has a specialty and niche, and I am not making explicit recommendations. The simple reason is that what you need and what you are already working on is unique to your enterprise. Without knowledge of your resources, special needs and goals my recommendations would be guesses. What I am sharing is the idea that you need experts who can give value when they are the right experts for your requirements.
The guidance here is to choose a search services firm that will move you efficiently and effectively along the path of systems integration. Expertise is available and you do not need to struggle alone knitting together best-of-breed components. Do your research and understand the differentiators among the companies. High touch, high integrity and commitment for the long haul should be high on your list of requirements – and of course, look for experience and expertise in deploying the technology solutions you want to use and integrate.
Next month I’ll share some tips on evaluating possible service organizations starting with techniques for doing research on the Web.
Read MoreWhy isn’t “search” the logical end-point in any content and information management activity. If we don’t care about being able to find valued and valuable information, why bother with any of the myriad technologies employed to capture, organize, categorize, store, and analyze content. What on earth is the point of having our knowledge workers document the results of their business, science, engineering and marketing endeavors, if we never aspire to having it retrieved, leveraged or re-purposed by others?
However, in Information Week, an article in the September 5, 2011 issue entitled “HP Transformation: Autonomy is a Modest Start” gave me a jolt with this comment: Autonomy has very sophisticated search capabilities including federation–the ability to search across many repositories and sources–and video and image search. But with all that said, enterprise search isn’t a hot, mission-critical business priority. [NOTE: in the print version the "call-out" box had slightly different phrasing but it jumped off the page, anyway.] This is pretty provocative and disappointing to read in the pages of this particular publication.
Over the past few months, I have been engrossed in working on several client projects related to taxonomy development, vocabulary management and integration with content and search systems. There is no doubt that every one of these institutions is focused with laser intensity on getting the search interface to deliver the highest value for the effort and dollars expended. In each case, the project involved a content management component for capturing metadata with solid uniformity, strong vocabulary control, and rich synonym tables for ensuring findability when a search query has different language than the content or metadata. Every step in each of these projects has come back to the acid test, “will the searcher be able to find what they are looking for.”
In past posts I have commented on the strength of enterprise search technologies, and the breath of offerings that cover a wide array of content findability needs and markets. From embedded search (within content management systems, archive and records management systems, museum systems, etc.), to standalone search engines designed to work well in discrete vertical markets or functional areas of enterprises (e.g., engineering, marketing, healthcare, energy exploration) buyers have a wealth of options from which to choose. Companies that have formerly focused on web site management, business intelligence, data mining, and numerous other content related tools are redefining themselves with additional terminology like e-discovery, 360-degree views (of information), content accessibility, and unified information.
Without the search component, all of the other technologies that have been so hot in the past are worthless. The article goes on to say that the hottest areas (of software growth) are business analytics and big-data analysis. Neither of these contributes business value without search underpinnings.
So, let’s get off this kick of under-rating and marginalizing search as “not mission critical” and think very seriously about the consequences of trying to run any enterprise without being able to find the products of our intellectual work output.
Read MoreHere we are, half way through 2011, and on track for a banner year in the adoption of enterprise search, text mining/text analytics, and their integration with collaborative content platforms. You might ask for evidence; what I can offer is anecdotal observations. Others track industry growth in terms of dollars spent but that makes me leery when, over the past half dozen years, there has been so much disappointment expressed with the failures of legacy software applications to deliver satisfactory results. My antenna tells me we are on the cusp of expectations beginning to match reality as enterprises are finding better ways to select, procure, implement, and deploy applications that meet business needs.
What follows are my happy observations, after attending the 2011 Enterprise Search Summit in New York and 2011 Text Analytics Summit in Boston. Other inputs for me continue to be a varied reading list of information industry publications, business news, vendor press releases and web presentations, and blogs, plus conversations with clients and software vendors. While this blog is normally focused on enterprise search, experiencing and following content management technologies, and system integration tools contribute valuable insights into all applications that contribute to search successes and frustrations.
Collaboration tools and platforms gained early traction in the 1990s as technology offerings to the knowledge management crowd. The idea was that teams and workgroups needed ways to share knowledge through contribution of work products (documents) to “places” for all to view. Document management systems inserted themselves into the landscape for managing the development of work products (creating, editing, collaborative editing, etc.). However, collaboration spaces and document editing and version control activities remained applications more apart than synchronized.
The collaboration space has been redefined largely because SharePoint now dominates current discussions about collaboration platforms and activities. While early collaboration platforms were carefully structured to provide a thoughtfully bounded environment for sharing content, their lack of provision for idiosyncratic and often necessary workflows probably limited market dominance.
SharePoint changed the conversation to one of build-it-to-do-anything-you-want-the way-you-want (BITDAYWTWYW). What IT clearly wants is single vendor architecture that delivers content creation, management, collaboration, and search. What end-users want is workflow efficiency and reliable search results. This introduces another level of collaborative imperative, since the BITDAYWTWYW model requires expertise that few enterprise IT support people carry and fewer end-users would trust to their IT departments. So, third-party developers or software offerings become the collaborative option. SharePoint is not the only collaboration software but, because of its dominance, a large second tier of partner vendors is turning SharePoint adopters on to its potential. Collaboration of this type in the marketplace is ramping wildly.
Convergence of technologies and companies is on the rise, as well. The non-Microsoft platform companies, OpenText, Oracle, and IBM are placing their strategies on tightly integrating their solid cache of acquired mature products. These acquisitions have plugged gaps in text mining, analytics, and vocabulary management areas. Google and Autonomy are also entering this territory although they are still short on the maturity model. The convergence of document management, electronic content management, text and data mining, analytics, e-discovery, a variety of semantic tools, and search technologies are shoring up the “big-platform” vendors to deal with “big-data.”
Sitting on the periphery is the open source movement. It is finding ways to alternatively collaborate with the dominant commercial players, disrupt select application niches (e. g. WCM ), and contribute solutions where neither the SharePoint model nor the big platform, tightly integrated models can win easy adoption. Lucene/Solr is finding acceptance in the government and non-profit sectors but also appeal to SMBs.
All of these factors were actively on display at the two meetings but the most encouraging outcomes that I observed were:
- Rise in attendance at both meetings
- More knowledgeable and experienced attendees
- Significant increase in end-user presentations
The latter brings me back to the adoption issue. Enterprises, which previously sent people to learn about technologies and products to earlier meetings, are now in the implementation and deployment stages. Thus, they are now able to contribute presentations with real experience and commentary about products. Presenters are commenting on adoption issues, usability, governance, successful practices and pitfalls or unresolved issues.
Adoption is what will drive product improvements in the marketplace because experienced adopters are speaking out on their activities. Public presentations of user experiences can and should establish expectations for better tools, better vendor relationship experiences, more collaboration among products and ultimately, reduced complexity in the implementation and deployment of products.
Read MoreIf you have been following recent XML Technologies blog entries, you will notice we have been talking a lot lately about XML Smart Content, what it is and the benefits it can bring to an organization. These include flexible, dynamic assembly for delivery to different audiences, search optimization to improve customer experience, and improvements for distributed collaboration. Great targets to aim for, but you may ask are we ready to pursue these opportunities? It might help to better understand the technology landscape involved in creating and delivering smart content.
The figure below illustrates the technology landscape for smart content. At the center are fundamental XML technologies for creating modular content, managing it as discrete chunks (with or without a formal content management system), and publishing it in an organized fashion. These are the basic technologies for “one source, one output” applications, sometimes referred to as Singe Source Publishing (SSP) systems.

The innermost ring contains capabilities that are needed even when using a dedicated word processor or layout tool, including editing, rendering, and some limited content storage capabilities. In the middle ring are the technologies that enable single-sourcing content components for reuse in multiple outputs. They include a more robust content management environment, often with workflow management tools, as well as multi-channel formatting and delivery capabilities and structured editing tools. The outermost ring includes the technologies for smart content applications, which are described below in more detail.
It is good to note that smart content solutions rely on structured editing, component management, and multi-channel delivery as foundational capabilities, augmented with content enrichment, topic component assembly, and social publishing capabilities across a distributed network. Descriptions of the additional capabilities needed for smart content applications follow.
Content Enrichment / Metadata Management: Once a descriptive metadata taxonomy is created or adopted, its use for content enrichment will depend on tools for analyzing and/or applying the metadata. These can be manual dialogs, automated scripts and crawlers, or a combination of approaches. Automated scripts can be created to interrogate the content to determine what it is about and to extract key information for use as metadata. Automated tools are efficient and scalable, but generally do not apply metadata with the same accuracy as manual processes. Manual processes, while ensuring better enrichment, are labor intensive and not scalable for large volumes of content. A combination of manual and automated processes and tools is the most likely approach in a smart content environment. Taxonomies may be extensible over time and can require administrative tools for editorial control and term management.
Component Discovery / Assembly: Once data has been enriched, tools for searching and selecting content based on the enrichment criteria will enable more precise discovery and access. Search mechanisms can use metadata to improve search results compared to full text searching. Information architects and organizers of content can use smart searching to discover what content exists, and what still needs to be developed to proactively manage and curate the content. These same discovery and searching capabilities can be used to automatically create delivery maps and dynamically assemble content organized using them.
Distributed Collaboration / Social Publishing: Componentized information lends itself to a more granular update and maintenance process, enabling several users to simultaneously access topics that may appear in a single deliverable form to reduce schedules. Subject matter experts, both remote and local, may be included in review and content creation processes at key steps. Users of the information may want to “self-organize” the content of greatest interest to them, and even augment or comment upon specific topics. A distributed social publishing capability will enable a broader range of contributors to participate in the creation, review and updating of content in new ways.
Federated Content Management / Access: Smart content solutions can integrate content without duplicating it in multiple places, rather accessing it across the network in the original storage repository. This federated content approach requires the repositories to have integration capabilities to access content stored in other systems, platforms, and environments. A federated system architecture will rely on interoperability standards (such as CMIS), system agnostic expressions of data models (such as XML Schemas), and a robust network infrastructure (such as the Internet).
These capabilities address a broader range of business activity and, therefore, fulfill more business requirements than single-source content solutions. Assessing your ability to implement these capabilities is essential in evaluating your organizations readiness for a smart content solution.
Read MoreIn our recently completed research on Smart Content in the Enterprise we explored how organizations are taking advantage of benefits from XML throughout the enterprise and not just in the documentation department. Our findings include several key issues that leading edge XML implementers are addressing including new delivery requirements, new ways of creating and managing content, and the use of standards to create rich, interoperable content. In our case studies we examined how some are breaking out of the documentation department silo and enabling others inside or even outside the organization to contribute and collaborate on content. Some are even using crowd sourcing and social publishing to allow consumers of the information to annotate it and participate in its development. We found that expectations for content creation and management have changed significantly and we need to think about how we organize and manage our data to support these new requirements. One key finding of the research is that organizations are taking a different approach to repurposing their content, a more proactive approach that might better be called “multipurposing”.
In the XML world we have been talking about repurposing content for decades. Repurposing content usually means content that is created for one type of use is reorganized, converted, transformed, etc. for another use. Many organizations have successfully deployed XML systems that optimize delivery in multiple formats using what is often referred to as a Single Source Publishing (SSP) process where a single source of content is created and transformed into all desired deliverable formats (e.g., HTML, PDF, etc.).
Traditional delivery of content in the form of documents, whether in HTML or PDF, can be very limiting to users who want to search across multiple documents, reorganize document content into a form that is useful to the particular task at hand, or share portions with collaborators. As the functionality on Web sites and mobile devices becomes more sophisticated, new ways of delivering content are needed to take advantage of these capabilities. Dynamic assembly of content into custom views can be optimized with delivery of content components instead of whole documents. Powerful search features can be enhanced with metadata and other forms of content enrichment.
SSP and repurposing content traditionally focuses on the content creation, authoring, management and workflow steps up to delivery. In order for organizations to keep up with the potential of delivery systems and the emerging expectations of users, it behooves us to take a broader view of requirements for content systems and the underlying data model. Developers need to expand the scope of activities they evaluate and plan for when designing the system and the underlying data model. They should consider what metadata might improve faceted searching or dynamic assembly. In doing so they can identify the multiple purposes the content is destined for throughout the ecosystem in which it is created, managed and consumed.
Multipurpose content is designed with additional functionality in mind including faceted search, distributed collaboration and annotation, localization and translation, indexing, and even provisioning and other supply chain transactions. In short, multipurposing content focuses on the bigger picture to meet a broader set of business drivers throughout the enterprise, and even beyond to the needs of the information consumers.
It is easy to get carried away with data modeling and an overly complex data model usually requires more development, maintenance, and training than would otherwise be required to meet a set of business needs. You definitely want to avoid using specific processing terminology when naming elements (e.g., specific formatting, element names that describe processing actions instead of defining the role of the content). You can still create data models that address the broader range of activities without using specific commands or actions. Knowing a chunk of text is a “definition” instead of an “error message” is useful and far more easy to reinterpret for other uses than an “h2″ element name or an attribute for display=’yes’. Breaking chapters into individual topics eases custom, dynamic assembly. Adding keywords and other enrichment can improve search results and the active management of the content. In short, multipurpose data models can and should be comprehensive and remain device agnostic to meet enterprise requirements for the content.
The difference between repurposing content and multipurpose content is a matter of degree and scope, and requires generic, agnostic components and element names. But most of all, multipurposing requires understanding the requirements of all processes in the desired enterprise environment up front when designing a system to make sure the model is sufficient to deliver designed outcomes and capabilities. Otherwise repurposing content will continue to be done as an afterthought process and possibly limit the usefulness of the content for some applications.
Read More



