Class StandardCatalogBuilder

    • Method Detail

      • requestCrawlableDataset

        public CrawlableDataset requestCrawlableDataset​(String path)
                                                 throws IOException
        Description copied from interface: CatalogBuilder
        Return the CrawlableDataset for the given path, null if this CatalogBuilder does not allow the requested CrawlableDataset.

        This method can handle requests for regular datasets and proxy datasets.

        Specified by:
        requestCrawlableDataset in interface CatalogBuilder
        Parameters:
        path - the path of the requested CrawlableDataset
        Returns:
        the CrawlableDataset for the given path or null if the path is not allowed by this CatalogBuilder either due to filtering or due to the path not being a descendant (or self) of the collection level path.
        Throws:
        IOException - if an I/O error occurs while locating the children datasets.
      • generateCatalog

        public InvCatalogImpl generateCatalog​(CrawlableDataset catalogCrDs)
                                       throws IOException
        Description copied from interface: CatalogBuilder
        Return an InvCatalog for the level in the collection hierarchy specified by catalogPath.
        Specified by:
        generateCatalog in interface CatalogBuilder
        Parameters:
        catalogCrDs - the location in the collection at which to generate a catalog
        Returns:
        an InvCatalog for the specified location
        Throws:
        IOException - if problems accessing the dataset collection.
      • generateProxyDsResolverCatalog

        public InvCatalogImpl generateProxyDsResolverCatalog​(CrawlableDataset catalogCrDs,
                                                             ProxyDatasetHandler pdh)
                                                      throws IOException
        Description copied from interface: CatalogBuilder
        Generate the catalog for a resolver request of the given ProxyDatasetHandler. This method is optional, it does not need to be supported by all CatalogBuilder implementations.
        Specified by:
        generateProxyDsResolverCatalog in interface CatalogBuilder
        Parameters:
        catalogCrDs - the location in the collection at which to generate a catalog
        pdh - the ProxyDatasetHandler corresponding to the resolver request.
        Returns:
        the catalog for a resolver request of the given proxy dataset.
        Throws:
        IOException
      • generateCatalogAsDocument

        public org.jdom2.Document generateCatalogAsDocument​(CrawlableDataset catalogCrDs)
                                                     throws IOException
        Description copied from interface: CatalogBuilder
        Return a JDOM Document representation of the catalog for the level in the collection hierarchy specified by catalogPath.
        Specified by:
        generateCatalogAsDocument in interface CatalogBuilder
        Parameters:
        catalogCrDs - the location in the collection at which to generate a catalog
        Returns:
        an org.jdom.Document representing the catalog for the specified location
        Throws:
        IOException - if problems accessing the dataset collection.
      • generateCatalogAsString

        public String generateCatalogAsString​(CrawlableDataset catalogCrDs)
                                       throws IOException
        Description copied from interface: CatalogBuilder
        Return a String containing the XML representation of the catalog for the level in the collection hierarchy specified by catalogPath.
        Specified by:
        generateCatalogAsString in interface CatalogBuilder
        Parameters:
        catalogCrDs - the location in the collection at which to generate a catalog
        Returns:
        a String containing the XML representation of the catalog for the specified location
        Throws:
        IOException - if problems accessing the dataset collection.