edu.unika.aifb.rdf.crawler
Class HTMLInstance
java.lang.Object
|
+--edu.unika.aifb.rdf.crawler.HTMLInstance
- public class HTMLInstance
- extends java.lang.Object
HTMLInstance - process the metainfo extracted from the HTML document.
Initialize either by indicating URI (this results in a cache lookup)
or by passing a StringBuffer with the entire document.
(Probably we might want to implement initialization from Input streams as well).
There are unresolved problems with this class - see comment in function main()
|
Constructor Summary |
HTMLInstance(java.lang.String urlstring,
java.lang.StringBuffer arg1)
Initialize from StringBuffer |
|
Method Summary |
java.util.Vector |
getNs()
|
java.util.Vector |
getRdf()
|
java.util.Vector |
getUri()
|
static void |
main(java.lang.String[] args)
For debugging. |
| Methods inherited from class java.lang.Object |
clone,
equals,
finalize,
getClass,
hashCode,
notify,
notifyAll,
toString,
wait,
wait,
wait |
HTMLInstance
public HTMLInstance(java.lang.String urlstring,
java.lang.StringBuffer arg1)
- Initialize from StringBuffer
getUri
public java.util.Vector getUri()
getNs
public java.util.Vector getNs()
getRdf
public java.util.Vector getRdf()
main
public static void main(java.lang.String[] args)
- For debugging. Should discover all links from aa.html till kk.html