Creating Dictionary Files for Entity Extractors from a SharePoint List

This script file will create an XML dictionary file that can be uploaded with the Add-FastSearchResource cmdlet

 
 
 
 
 
4.8 Star
(4)
Add to favorites
SharePoint
8/30/2010
E-mail Twitter del.icio.us Digg Facebook
  • Sorted output makes it way more readable...
    4 Posts | Last post July 12, 2010
    • Towards the end, when the results is output, I recommend changing this line:
      
          foreach ($key in $uniqueKeyValue.Keys)
      
      to
      
          foreach ($key in ($uniqueKeyValue.Keys | sort))
      
      Otherwise, the output comes in random (hash-based order). While FAST Search will not mind, it makes it almost unreadable for humans. Doing manual updates is also easier after this fix.
    • Good idea. So let it be said, so let it be done. For the record though, I never intended for the dictionary files to be updated directly.
    • Thanks Keenan, this is great.
      
      Is there any documentation around creating new Entity Extractors/Dictionaries?
      The closest I've found is this: http://www.informationworker.co.za/Meeting Content/Community Meeting - 2010-04-14 - Fast Search for SharePoint/14 April 2010.pptx
      
    • Well you get a total of 8 entity extractors in FAST Search 2010.  3 custom Whole Words (matches whole word), 2 Word Parts (begins with), and 3 oob entities (companies, locations, and personnames).  If you want to go beyond that, you need to use the pipeline extensibility features to build your own.