ad1

Friday, November 10, 2006

Open Index of Human Knowledge - Up and Running

Web service at Open Index to Human Knowledge (OIHK) is now available.
Site hopes to create an open source repository of searchable indices of public web pages.
Its goal is to provide a free and open version of the types of indices used by search engine giants to return results so quickly. With permission, below is the description of the service from its web site.

End-point for this service is: http://openindices.conceptexplore.com/scripts/OIHK_XmlRpcServerWithMySql.pl
You must register a user name and password with a valid email address in order to use OIHK Web Services
User name and password are required parameters in every transaction.

Current Data Source Sets (DSS):

  • CE Wiki Snapshot:
    Description - Wikipedia Snapshot of approx 135,000 URLs
    DSS Id - "ce_wiki_snapshot_current"
  • This web service provides one main method named "multisearch" with eight sub-methods having the following names, specific parameters, and return values:
    NOTE:
    All sub-methods take same first four untyped or "struct" type parameters in the following order:
    1 - sub method name
    2 - OIHK registered user name
    3 - OIHK password
    4 - OIHK DSS Id

  • "getMatches" - params specific to this sub method in order =
    1 - word to find matching records for.
    Returns: 1 string representing list of matching words, delimited with "@" character.
  • "getIndexData" - params specific to this sub method in order =
    1 - word to find record for. Returns: 1 string representing record for parameter, see Data Format for description of format of this string.
  • "getFileListNumber" - no extra parameters.
    Returns: 1 string representing number of URLs enumerated in this DSS.
  • "getFileListRange" - params specific to this sub method in order =
    1 - lower URL enumeration
    2 - upper URL enumeration
    Returns: 1 string representing list of file URLs with enumerations from lower URL enumeration to upper URL enumeration, inclusive, that are contained in this DSS.
  • "getFileListSet" - params specific to this sub method in order =
    1 - URL enumeration 1
    2 - URL enumeration 2
    ... n - URL enumeration n
    Returns: 1 string representing list of file URLs with enumerations equal to those listed in parameters.
  • "getMaxFileRangeRequest" - no extra parameters.
    Returns: 1 string representing maximum absolute value difference allowed between two parameters used in "getFileListRange".
  • "getMaxRecLength" - no extra parameters.
    Returns: 1 string representing maximum record length in this DSS.
  • "getSubMethodNames" - no extra parameters.
    Returns: 1 string representing colon delimited list of all sub method names available in this web service.