As you may recall from my previous post, Knowledge Network (KN) is about unlocking tacit knowledge in the enterprise (versus explicit knowledge). Glen Anderson, Group Product Manager, presented an “under the hood” view of KN, following on the heels of John Hand’s overview presentation yesterday (ref).
The first topic discussed was client profile creation, which is accomplished in four phases: select information (i.e. data sources to analyze such as the primary data source of Outlook folders), run analysis, review profile and publish profile. Client profile creation follows the Notification-Control-Consent (NCC) privacy model I relayed yesterday, which is also shown graphically here:

The initial analysis phase (#2 within the overall client profile creation process) also consists of four phases as follows:
1. Sync
- Read each email. Essentially only the first paragraph of the email body is parsed by KN; attachments are not scanned.
- Capture key “interaction data” in a local Access database.
- Read in contacts from Outlook and IM clients.
- Sync colleagues from SharePoint Portal Server 2007 profile.
2. Contact resolution
- Lookup contacts against the Global Address List (GAL) (MAPI).
- Internal or external?
- Is the contact a distribution list? If so, discard.
- Capture key properties.
3. Update
- Aggregate counts.
- Check thresholds.
- Calculate strength.
4. Recommendation
- “Exclusion lists”
- Special rules
- Limits applied
- Organization name mapping for external contacts
KN analysis is captured locally by the KNClient.log and MDB files located within %UserProfile%\Local Settings\Application Data\Microsoft\Knowledge Network. There are two Access databases on the local client machine: one has the raw data (i.e. parts of emails), and the other reflects your actual profile choices.
After talking about client profile creation, Glen went on to address some of the top questions customers have raised about KN.
Why client-side mining?
- Privacy - Nothing leaves user’s machine until they “publish.”
- Access to information - PSTs, future data sources–I’d certainly like to understand what some of these potential data source might become.
- Distribute the processing.
Why not mine sources other than email?
- Email is by far the richest and most pervasive source today.
- Calculating strength across different data sources adds complexity–I’d like to understand the nature of this complexity in more detail.
How long does the analysis process take?
- Depends on a number of factors (e.g. on the amount of email and unique contacts; on disk performance, RAM, and CPU; on user activity–since analysis process runs at low priority)
- Microsoft has measured anywhere from 5 minutes (3K emails) to 12 hours (120K emails)–further optimization prior to release is planned.
The rest of Glen’s presentation covered SharePoint server integration (e.g. mention of a KN Profile Management Web Service for publishing client-side, Access-based data to the server-side KN store), privacy and anonymous brokering (i.e. a process of connecting seekers and targets anonynously in a manner reminiscent of referrals on LinkedIn), deployment and administration, and extensibility.
On the last subject of extensibility, there is a managed API on the client-side to access the KN database. On the server-side, there are at least two web services: one that exposes a full-fledged query language for expertise/social network people search and another to retrieve (read) and augment (write) profile data (e.g. bootstrap the profile or add custom keywords and contacts). It will be interesting to examine the WSDL of these web services (e.g. via the KN SDK) to understand how to potentially introduce other data sources into the KN system beyond email (e.g. content subscriptions via feeds, authoring trend data from content repositories, discussion thread contributions in collaboration stores, etc.).
Today’s KN session finished with a fair amount of Q&A at the end, indicating clear product interest.
For more KN information, stay tuned here. I also recommend visiting the KN team blog.








4 responses so far ↓
1 Alex // May 29, 2006 at 1:26 am
Hey thanks for your OMPL file… you have an excellent collection of blogs on CMS and WinFS… saved me a lot of time… thanks a lot.
If you are interested I have been creating a sort of mini-WINFS for about the last 5 years… have a look here: http://www.base4.net
Thanks again Craig.
2 Craig Randall // May 29, 2006 at 6:03 am
Glad to be of help to you, Alex.
I did check out your site and given Base4.NET’s raison d’être, your interest in WinFS is crystal clear. I encourage you to engage with the WinFS team, especially given your related domain experience.
Cheers, Craig
3 Alex // Jun 24, 2006 at 4:51 pm
Seems a little pointless now! What with the recent developments huh!
I wonder if MS have given up on the relational file system completely or whether they have just decided to bite the bullet and admit defeat publicly while continuing with less pressure of delivery attached, by publically announcing nothing…
Only time will tell.
Alex
4 Craig Randall // Jun 25, 2006 at 8:16 pm
Your frustration is reasonable, Alex. After all WinFS is not the first (or second or…) attempt Microsoft has made at a relational file system. After more dust settles, though, I still believe it’s worthwhile to digest what occurred, why it happened, what the net net is, and how we can all learn from the exercise. The group behind the “package previously known as WinFS” hasn’t disappeared, and they’re a bright bunch. Stay tuned…
You must log in to post a comment.