Up
collapse
0%
WDescription%
Build stability: All recent builds failed.0
Build History
x
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Changes

#109 (Feb 21, 2022, 7:42:18 PM)

  1. Fixed build due to moved packages for annotations — Claus Stadler / githubweb
  2. Fixed a broken test case by switching to RDFDataMgrEx.readAsGiven that leaves relative IRIs of the test data untouched. — Claus Stadler / githubweb

#108 (Jan 26, 2022, 11:08:50 AM)

  1. fix also uri creation bug in ML2Graph by inserting https:// — GitHub / githubweb

#107 (Jan 26, 2022, 11:07:32 AM)

  1. add https:// to strings resulting in URI semantification — GitHub / githubweb

#106 (Jan 26, 2022, 10:02:58 AM)

  1. sime4kg use case start draft — GitHub / githubweb

#105 (Jan 21, 2022, 9:26:28 PM)

  1. adjust titles — GitHub / githubweb

#104 (Jan 21, 2022, 9:25:21 PM)

  1. fix strange md lay outing — GitHub / githubweb

#103 (Jan 21, 2022, 9:14:35 PM)

  1. ml to machine learning as parent — GitHub / githubweb

#102 (Jan 21, 2022, 9:14:16 PM)

  1. ml to machine learning as parent — GitHub / githubweb

#101 (Jan 21, 2022, 9:13:40 PM)

  1. rename file — GitHub / githubweb

#100 (Jan 21, 2022, 9:10:51 PM)

  1. reorder ml — GitHub / githubweb
  2. reorder chaining — GitHub / githubweb

#99 (Jan 21, 2022, 9:10:18 PM)

  1. reorder ml — GitHub / githubweb

#98 (Jan 21, 2022, 9:07:02 PM)

  1. edit nav order — GitHub / githubweb

#97 (Jan 21, 2022, 9:05:31 PM)

  1. ml to machine learning title adjustment — GitHub / githubweb

#96 (Jan 21, 2022, 9:03:21 PM)

  1. no ch2 — GitHub / githubweb

#95 (Jan 21, 2022, 9:02:59 PM)

  1. no ch — GitHub / githubweb

#94 (Jan 21, 2022, 9:01:17 PM)

  1. layout adjustment — GitHub / githubweb

#93 (Jan 21, 2022, 9:00:30 PM)

  1. list and link original papers — GitHub / githubweb

#92 (Jan 21, 2022, 8:54:53 PM)

  1. link readme ml to GitHub pages overview — GitHub / githubweb

#91 (Jan 21, 2022, 8:48:48 PM)

  1. md optimisation — GitHub / githubweb

#90 (Jan 21, 2022, 8:47:48 PM)

  1. list of modules — GitHub / githubweb

#89 (Jan 21, 2022, 6:21:46 PM)

  1. reorganise docs — carsten.draschner / githubweb

#88 (Jan 21, 2022, 6:11:35 PM)

  1. publication information moved to subchapter — carsten.draschner / githubweb

#87 (Jan 21, 2022, 6:01:55 PM)

  1. rename file — carsten.draschner / githubweb

#86 (Jan 21, 2022, 5:56:49 PM)

  1. rename structure — carsten.draschner / githubweb

#85 (Jan 21, 2022, 5:50:47 PM)

  1. structure — carsten.draschner / githubweb

#84 (Jan 21, 2022, 5:43:44 PM)

  1. documentation structure — carsten.draschner / githubweb

#83 (Jan 21, 2022, 5:25:35 PM)

  1. Bump to jena 4.3.1 (probably there are or will be separate spark/hadoop releases with log4j 2.15.0) — Claus Stadler / githubweb
  2. bump to jena 4.3.2 — Claus Stadler / githubweb
  3. started with new documentation for github pages of sansa ml — carsten.draschner / githubweb

#82 (Dec 10, 2021, 7:11:28 PM)

  1. bump to jena 4.3.0 — Claus Stadler / githubweb

#81 (Dec 10, 2021, 3:05:07 PM)

  1. small path change in readme text — GitHub / githubweb

#80 (Dec 10, 2021, 2:59:06 PM)

  1. updated GitHubio for SimE4KG extension to Sansa — GitHub / githubweb

#79 (Dec 9, 2021, 7:56:42 PM)

  1. Fixed missing start patterns in RecordReaderRdfTrigQuad — Claus Stadler / githubweb

#78 (Dec 9, 2021, 7:14:52 PM)

  1. Fixes for the async parser (somewhat hacky though) — Claus Stadler / githubweb

#77 (Dec 9, 2021, 1:09:05 PM)

  1. Bump commons-compress from 1.20 to 1.21 — GitHub / githubweb

#76 (Dec 9, 2021, 1:08:53 PM)

  1. Bump httpclient from 4.5.12 to 4.5.13 — GitHub / githubweb

#75 (Dec 9, 2021, 1:07:40 PM)

  1. Added a comment to AnnotationMapperTests — Claus Stadler / githubweb

#73 (Dec 8, 2021, 8:14:30 PM)

  1. better usage of verbose option — carsten.draschner / githubweb
  2. put all println into verbose clause — carsten.draschner / githubweb
  3. more println into verbose optional handling — carsten.draschner / githubweb
  4. semantification clean up — carsten.draschner / githubweb

#72 (Dec 8, 2021, 3:27:17 PM)

  1. Fixed build — Claus Stadler / githubweb

#71 (Dec 7, 2021, 2:56:20 PM)

  1. introduce option to filter also by predicate — carsten.draschner / githubweb
  2. filter by preicate and fix bug for features with multiple different types — carsten.draschner / githubweb
  3. started with evaluation classes on top of DBpedia — carsten.draschner / githubweb
  4. add handling for lists of doubles as features over mean — carsten.draschner / githubweb
  5. add handling for lists of timestamp over unix time cast as features over mean — carsten.draschner / githubweb
  6. added more literals to cover lists of double and lists of timestamp — carsten.draschner / githubweb
  7. add comment — carsten.draschner / githubweb
  8. do not collapse lists of timestamps by default — carsten.draschner / githubweb
  9. calculate mean of normalized distaances as similarity score for lists of double and unix timestamp casted lists/arrays — carsten.draschner / githubweb
  10. added more data to check another new similairity score case — carsten.draschner / githubweb
  11. offer verbos mode — carsten.draschner / githubweb
  12. set verbos mode — carsten.draschner / githubweb
  13. cache in sfe — carsten.draschner / githubweb
  14. execute availability evaluation on lmdb to show distribution — carsten.draschner / githubweb

#70 (Dec 6, 2021, 3:14:37 PM)

  1. Added hadoop-based async parser — Claus Stadler / githubweb

#69 (Nov 24, 2021, 9:09:50 PM)

  1. start drafting classes for new sim feature — carsten.draschner / githubweb
  2. first distsim on lmdb with minhash betweeen movies for promising candidates — carsten.draschner / githubweb
  3. play around with different feature extractors and further use DistRDF2ML modules for feature extraction — carsten.draschner / githubweb
  4. start wih pivot nbased feature extracting transformer — carsten.draschner / githubweb
  5. automatic cast dataframe to correpsonding litreal type and split features if needed by their respective datatype — carsten.draschner / githubweb
  6. bring all information to transform — carsten.draschner / githubweb
  7. make components more compact and broader documentation — carsten.draschner / githubweb
  8. align design to be transformer conform — carsten.draschner / githubweb
  9. integrate smart feature extrator into novel dasim pipeline — carsten.draschner / githubweb
  10. cast numeric values to doubles — carsten.draschner / githubweb
  11. offer first unit test for smartFeatureExtractor — carsten.draschner / githubweb
  12. play a bit around and structure — carsten.draschner / githubweb
  13. also show schema — carsten.draschner / githubweb
  14. started handling of different feature types — carsten.draschner / githubweb
  15. started soe playground for word2vec in spark — carsten.draschner / githubweb
  16. clean up — carsten.draschner / githubweb
  17. handling of categorical strings transformed over hashing and IDF (Information Content) weightning — carsten.draschner / githubweb
  18. string column hanfling by default pipleline as current fallback non implemented word2Vec — carsten.draschner / githubweb
  19. calculate similarity values and join those into one df s.t. we can later aggregate those — carsten.draschner / githubweb
  20. introducing option to norm similarity columns and to weight by importance (for the start) — carsten.draschner / githubweb
  21. introduce all weightning factors and finally aggregate similarity score weighted over all features and so on — carsten.draschner / githubweb
  22. reduce distsim dataframe so only unique pairs stay — carsten.draschner / githubweb
  23. first data gathering for semantification — carsten.draschner / githubweb
  24. outline todos for semantification — carsten.draschner / githubweb
  25. - better handling of aggregation of overall similarity value while preserving initial sim values — carsten.draschner / githubweb
  26. - semantification of similarity results — carsten.draschner / githubweb
  27. offer verbose mode — carsten.draschner / githubweb
  28. started with refactoring — carsten.draschner / githubweb
  29. bse class for trying our refactored dasim which might become basis for unit tests — carsten.draschner / githubweb
  30. - start gather candidate pairs with distsim — carsten.draschner / githubweb
  31. optimized speed of uniwue cadidates by usage of df functionalities instead of dataset options — carsten.draschner / githubweb
  32. - norm scale similarity score — carsten.draschner / githubweb
  33. removed lines of code now refactored into norm method — carsten.draschner / githubweb
  34. print lines for automatic retreived weighting maps — carsten.draschner / githubweb
  35. - offer setters for hyperparameters — carsten.draschner / githubweb
  36. usage of parameters in eval script — carsten.draschner / githubweb
  37. scala docs for DaSimEstimator — carsten.draschner / githubweb
  38. wrote first dasim estimator unit test to more easy call it — carsten.draschner / githubweb
  39. new small dataset which can be used for unit test dor dasim unit test — carsten.draschner / githubweb
  40. include opportunity for semantification. fixed a bug causing from too much python like slicing — carsten.draschner / githubweb
  41. call semantification within test — carsten.draschner / githubweb
  42. rremoved debug println — carsten.draschner / githubweb
  43. sample file and tiny changes — carsten.draschner / githubweb
  44. introduce fast minahsh gather candidate approach as alternative to distSim, if only partial weighning are given, fill other with zero but show message — carsten.draschner / githubweb
  45. play around with lmdb — carsten.draschner / githubweb
  46. ofer fast gather candidate option as laternative to distsim — carsten.draschner / githubweb
  47. more playground for parameters — carsten.draschner / githubweb
  48. more size prints — carsten.draschner / githubweb
  49. adjust considered features — carsten.draschner / githubweb
  50. a class which offers a comparison of gather seeds wither by object filter or by sparqlframe. to compare pros and cons — carsten.draschner / githubweb
  51. extended dasim eval by time measure — carsten.draschner / githubweb
  52. not showing intermediate df — carsten.draschner / githubweb
  53. Smart Feature Extraction evaluation pipline to compare against SparqlFrame — carsten.draschner / githubweb
  54. better interpret dataset of triple — carsten.draschner / githubweb
  55. make filepath args parameter — carsten.draschner / githubweb
  56. some work of replacing df by ds to fix bug — carsten.draschner / githubweb
  57. make limit seeds for eval a parameter within eval class — carsten.draschner / githubweb
  58. make limit seeds within Dasim class settable for eval — carsten.draschner / githubweb
  59. make more params available — carsten.draschner / githubweb
  60. started with readme changes — carsten.draschner / githubweb
  61. offer filter options, include setters, work on filtered ds — carsten.draschner / githubweb
  62. remove limit within tryouts — carsten.draschner / githubweb
  63. Smart Feature Extractor unit tests with testing also the novel setters and filter opportunities to better use it outside the SimE4KG approach and pipeline — carsten.draschner / githubweb
  64. some unit test adjustments — carsten.draschner / githubweb
  65. more code snippets for readme — carsten.draschner / githubweb
  66. adjustments in readme — GitHub / githubweb
  67. adjust unit test especially the file path handling — carsten.draschner / githubweb
  68. calculate availability weighting — carsten.draschner / githubweb
  69. introduced na fill in simialrit sciore if features are not avilable and give 0 as similarity score — carsten.draschner / githubweb
  70. introduce another sample with null features to cver this case — carsten.draschner / githubweb
  71. if availability is calculated print the distribution — carsten.draschner / githubweb
  72. outcomment show statement because if occours anyways — carsten.draschner / githubweb
  73. add one feature to see hat there is a valid availability distribution — carsten.draschner / githubweb
  74. better semantification and easier handling because of removal of parameter but storing meta data in the transformer itself — carsten.draschner / githubweb
  75. small call of semantification within unit test — carsten.draschner / githubweb
  76. add semantification example to readme — carsten.draschner / githubweb
  77. bring back simEdocumentation to readme — GitHub / githubweb
  78. missing closing quotes for code block — GitHub / githubweb
  79. sime4kg Databricks notebook link to readme — GitHub / githubweb

#68 (Nov 24, 2021, 2:14:47 PM)

  1. minor — Lorenz Buehmann / githubweb
  2. minor — Lorenz Buehmann / githubweb
  3. parse any RDF language — Lorenz Buehmann / githubweb
  4. fix test — Lorenz Buehmann / githubweb

#67 (Nov 24, 2021, 2:11:12 PM)

  1. Update main.yml — GitHub / githubweb

#66 (Nov 24, 2021, 12:38:02 PM)

  1. Update pom.xml — GitHub / githubweb

#65 (Nov 24, 2021, 12:23:23 PM)

  1. Update main.yml — GitHub / githubweb

#64 (Nov 24, 2021, 12:22:41 PM)

  1. Update main.yml — GitHub / githubweb

#63 (Nov 24, 2021, 4:48:52 AM)

  1. moved doc folder to root of repo to easy link it — carsten.draschner / githubweb
  2. moved docs from subfolder ml to root folder to make it linkable in github (pages) — carsten.draschner / githubweb
  3. had to change ml to stack in references — GitHub / githubweb
  4. missed one link, changed the here reference — GitHub / githubweb
  5. extended description — GitHub / githubweb
  6. layout adjustment — GitHub / githubweb
  7. Creates a dummy index.html — GitHub / githubweb
  8. Javadoc and Scaladoc 0.8.0 — Lorenz Buehmann / githubweb
  9. Update index.html — GitHub / githubweb
  10. Update overview-frame.html — GitHub / githubweb
  11. Update overview-summary.html — GitHub / githubweb
  12. update API docs — Lorenz Buehmann / githubweb
  13. update docs entry point for Scala docs — GitHub / githubweb
  14. changed header — GitHub / githubweb
  15. fixed typos in docs — GitHub / githubweb
  16. Added DeferredSeekablePushbackInputStream in order to get end of block (EOB) advertised before reading into the next block. — Claus Stadler / githubweb
  17. And optimized the push back input stream away again... — Claus Stadler / githubweb
  18. jena4 upgrade (compiles but untested) — Claus Stadler / githubweb
  19. Fixed a bug with wrong limit computation in InterruptingSeekableByteChannel — Claus Stadler / githubweb
  20. Forgot to add pom.xml — Claus Stadler / githubweb
  21. upgraded r2rml-jena-api — Claus Stadler / githubweb
  22. Added fake JenaSystem for commons-rdf to work — Claus Stadler / githubweb
  23. Extended RddRdfSaver with a setOutputFormat(String) method and added all missing create methods — Claus Stadler / githubweb
  24. Renamed rdf-centric FileInputFormat classes to include 'Rdf' in their name — Claus Stadler / githubweb
  25. RddRdfSaver now has built in support for console output. — Claus Stadler / githubweb
  26. Added a new API for loading rdf into RDDs: e.g. RDD<Dataset> rdd = RdfSourceFactoryImpl.from(sparkSession).get("someResource").asDatasets(); — Claus Stadler / githubweb
  27. upgraded sparqlify — Claus Stadler / githubweb
  28. fixed missing base url in datalake r2rml mapping — Claus Stadler / githubweb
  29. Added trig/quad registration; added initial ghpages setup — Claus Stadler / githubweb
  30. Ok, there is already a docs branch.. — Claus Stadler / githubweb
  31. Removed my docs resources — Claus Stadler / githubweb
  32. Added just-the-docs config — Claus Stadler / githubweb
  33. Added sansa logo — Claus Stadler / githubweb
  34. Moved config to docs folder (not sure if gh pick it up there) — Claus Stadler / githubweb
  35. Added a few pages — Claus Stadler / githubweb
  36. Excluded javadoc / scaladoc folders — Claus Stadler / githubweb
  37. Trying to get the edit page on github link working — Claus Stadler / githubweb
  38. fixed typo — Claus Stadler / githubweb
  39. Minor update — Claus Stadler / githubweb
  40. Added more structure — Claus Stadler / githubweb
  41. More work on docs — Claus Stadler / githubweb
  42. set docs branch to jena4 for now — Claus Stadler / githubweb
  43. Updated doc — Claus Stadler / githubweb
  44. Increased default probe count for RecordReaderGenericBase to 100 — Claus Stadler / githubweb
  45. Added support for Lang attribute to RdfSourceFactory. — Claus Stadler / githubweb
  46. Updated imports — Claus Stadler / githubweb
  47. Bumped to a snapshot version of aksw-commons — Claus Stadler / githubweb
  48. update doc links for Sansa  ml especially distsim — GitHub / githubweb
  49. update distsim docs — GitHub / githubweb
  50. add docs for distrdf2ml — GitHub / githubweb
  51. Added operator to map named models to resource in datasets — Claus Stadler / githubweb
  52. Added Kryo Serializer for Binding — Claus Stadler / githubweb
  53. Renamed folder containing '.' — Claus Stadler / githubweb
  54. bump to jena 4.1.0 — Claus Stadler / githubweb
  55. upgrade to jena 4.2.0 — Claus Stadler / githubweb
  56. Added support for parallel ingestion of json arrays — Claus Stadler / githubweb
  57. Removed debug output — Claus Stadler / githubweb
  58. Added java version for aggregation using the java collector api — Claus Stadler / githubweb
  59. Towards adding a tarql-like operator — Claus Stadler / githubweb
  60. Added splittable multiline csv support to the generic parser framework — Claus Stadler / githubweb
  61. Removed hard coded path (need to prepare a csv test file yet) — Claus Stadler / githubweb
  62. Added tarql mapper — Claus Stadler / githubweb
  63. regex for seeking csv record offsets greatly improved (my most arcane regex to date) — Claus Stadler / githubweb
  64. There are now tarqlTriples and tarqlQuads methods in JavaRddRxOps. — Claus Stadler / githubweb
  65. Pretty much finished CSV integration; added options for setting csv formats and added test data — Claus Stadler / githubweb
  66. Skip header record is now respected by the RecordReaderCsv — Claus Stadler / githubweb
  67. Consolidated naming for rdd ops and hadoop formats — Claus Stadler / githubweb
  68. Added JavaRddOfBindingsOps class — Claus Stadler / githubweb
  69. Consolidated naming of the java rdd operators — Claus Stadler / githubweb
  70. Fixed use of a non serializable lambda in the ops — Claus Stadler / githubweb
  71. Added univocity csv parser because commons-csv apparently can't handle CR (without LF) in unquoted fields... — Claus Stadler / githubweb
  72. Switched csv implementation to univocity in CsvDataSources — Claus Stadler / githubweb
  73. Added a small rdd transformation framework — Claus Stadler / githubweb
  74. Improved documentation on rdd transformation chaining interfaces — Claus Stadler / githubweb
  75. Added documentation for chaining — Claus Stadler / githubweb
  76. Fixd for chaining doc — Claus Stadler / githubweb
  77. Moved java operators for jena to its own lightweight package — Claus Stadler / githubweb
  78. Moved the JavaRddOfBindingsOps from query layer to the sansa-spark-jena-java package — Claus Stadler / githubweb
  79. Reorganized packages in the java module — Claus Stadler / githubweb
  80. Added a kryo registrator to the sansa-spark-jena-java package — Claus Stadler / githubweb
  81. Added a call to JenaSystem.init() within mapPartitions because it seemed an output format was not found — Claus Stadler / githubweb
  82. Removed JenaSystem.init() because RDFLanguagesEx is the better place — Claus Stadler / githubweb
  83. Updated ResourceInDataset related imports — Claus Stadler / githubweb
  84. Modified rdd rdf writer api to allow for validation BEFORE running a job — Claus Stadler / githubweb
  85. Some cleanup of the revised RddRdfWriter — Claus Stadler / githubweb
  86. Allowed passing subclasses of Dataset/DatasetGraph etc to the RddRdfWriter — Claus Stadler / githubweb
  87. Changed API for RDD<Dataset> to RDD<DatasetOneNg>; this allows for enforcing only a single named graph per rdd entry (and getting the graph name) while retaining query capabilites. — Claus Stadler / githubweb
  88. Added serializer for DatasetOneNg — Claus Stadler / githubweb
  89. Fixed spelling mistake in class name — Claus Stadler / githubweb
  90. Fixed compile errors due to api changes in the rdf reader — Claus Stadler / githubweb
  91. upgraded the lower levels to jena 4.3.0-SNAPSHOT (requires jena to be built from git!) — Claus Stadler / githubweb
  92. added DistAD material — f.bakhshandegan / githubweb
  93. rdf layer now working with jenax / query layer not yet — Claus Stadler / githubweb
  94. Fixed query and cli modules for jenax / jena 4.3.0 — Claus Stadler / githubweb
  95. Disabled flink modules; all spark modules building but some tests fail — Claus Stadler / githubweb
  96. started with SimE4KG documentation on GitHub pages — GitHub / githubweb
  97. Update index.md — GitHub / githubweb
  98. Update index.md — GitHub / githubweb
  99. Update index.md — GitHub / githubweb
  100. Farshad DistAD support to commit to develop ReadMe — GitHub / githubweb
  101. SimE4KG provide Databricks notebooks link — GitHub / githubweb
  102. Update index.md — GitHub / githubweb
  103. jena branch now has its own version; upgraded to latest jena4.3.0 snapshot — Claus Stadler / githubweb
  104. Set version back to 0.8.0-RC2-SNAPSHOT — Claus Stadler / githubweb

#62 (Jun 16, 2021, 9:46:57 AM)

  1. integrate hashing as alternative to indexing of categorical strings. adjusted also unit tests and offer new setter — carsten.draschner / githubweb
  2. integrated getter for feature vector descriptions — carsten.draschner / githubweb
  3. offer semantic represenation of transformer hyperparameters — carsten.draschner / githubweb

#61 (May 31, 2021, 4:00:34 PM)

  1. created new file for unit test having datetime infomation and will result in partial null values which should be covered by functionalities of sva — carsten.draschner / githubweb
  2. changed sva sample file. now with datetime info within a second test — carsten.draschner / githubweb
  3. add lines to show how to store example metagraph within DistRDF2ML pipeline — carsten.draschner / githubweb
  4. handling and tracking of timestamp null values and offer replcment — carsten.draschner / githubweb

#60 (May 28, 2021, 12:28:10 PM)

  1. update readme with regards to DistRDF2ML — GitHub / githubweb

#59 (May 26, 2021, 1:19:49 PM)

  1. Update README.md — GitHub / githubweb

#58 (May 25, 2021, 10:36:28 AM)

  1. initial pipeline for platoon 3a demo — carsten.draschner / githubweb
  2. better support datetime timestamp and split features — carsten.draschner / githubweb
  3. Towards support for temporal datatypes in the schema mapper — Claus Stadler / githubweb
  4. SchemaMapper should now work with temporal datatypes — Claus Stadler / githubweb
  5. support of datetime within smart vector assembler — carsten.draschner / githubweb
  6. rm pl related data — carsten.draschner / githubweb

#57 (May 17, 2021, 2:06:55 PM)

  1. MmDistSim collect feature sets — carsten.draschner / githubweb
  2. MmDistSim span up exploded dataframe with feature ordering — carsten.draschner / githubweb
  3. caclulate jaccard for each feature as baseline — carsten.draschner / githubweb
  4. many changes — carsten.draschner / githubweb
  5. next features: — carsten.draschner / githubweb
  6. some changes: — carsten.draschner / githubweb
  7. some changes: — carsten.draschner / githubweb
  8. remove slowing down count for print statemnet — carsten.draschner / githubweb
  9. time catching — carsten.draschner / githubweb
  10. more time catching — carsten.draschner / githubweb
  11. reduce complexity by hero filter — carsten.draschner / githubweb
  12. some count to ensure non lazy evaluation — carsten.draschner / githubweb
  13. make write optional — carsten.draschner / githubweb
  14. total exp time — carsten.draschner / githubweb
  15. started with feature identifier — carsten.draschner / githubweb
  16. started with transformer pipelines for string based features — carsten.draschner / githubweb
  17. running version to get digitized dataframe but huge memory need — carsten.draschner / githubweb
  18. fixed bug of wrongly joined df and removed cache — carsten.draschner / githubweb
  19. catch more data types and handle variable feature sets over agg — carsten.draschner / githubweb
  20. attempts to fix oom — carsten.draschner / githubweb
  21. solve oom over persists and unpersists — carsten.draschner / githubweb
  22. pipeline runs through, adjusted prints and shows for neat output — carsten.draschner / githubweb
  23. removed outcommented println — carsten.draschner / githubweb
  24. some clean up — carsten.draschner / githubweb
  25. sparql change to run over full data — carsten.draschner / githubweb
  26. some prints for server progress — carsten.draschner / githubweb
  27. make query collapsable — carsten.draschner / githubweb
  28. created setters — carsten.draschner / githubweb
  29. bugfix by suffix strip in keycolumn generation — carsten.draschner / githubweb
  30. create seconf query string for collapsable feature columns — carsten.draschner / githubweb
  31. complete new procedure for smart vector assembler — carsten.draschner / githubweb
  32. created first test cass which calls smartvector assembler ontop of sparqlframe — carsten.draschner / githubweb
  33. assert over df size — carsten.draschner / githubweb
  34. put assert to test class — carsten.draschner / githubweb
  35. add setters for — carsten.draschner / githubweb
  36. make object attributes protected so they can be only set over setter — carsten.draschner / githubweb
  37. in unit test make use out of setters — carsten.draschner / githubweb
  38. started with DistRDF2ML evaluation class. tracking times of pipeline modules and store those to file — carsten.draschner / githubweb
  39. adjustments in write information — carsten.draschner / githubweb
  40. fix some column name handling in sva — carsten.draschner / githubweb
  41. label column name handling better distinct from feature columns in edge cases — carsten.draschner / githubweb
  42. explode of labelcolumn which is within list — carsten.draschner / githubweb
  43. assert for expected column names — carsten.draschner / githubweb
  44. soem docstrings — carsten.draschner / githubweb
  45. get soize after sparql frame to ensure no lazyness — carsten.draschner / githubweb
  46. fix write bug — carsten.draschner / githubweb
  47. more persist — carsten.draschner / githubweb
  48. remove of need of joinable df and instead direct assignment of digitized df — carsten.draschner / githubweb
  49. remove debug component and make word2vec model seperate — carsten.draschner / githubweb
  50. reduce word2vec size to 2 — carsten.draschner / githubweb
  51. offer sampling of word2vec traning data to reduce ram complexity — carsten.draschner / githubweb
  52. create full regression pipeline as an example of DistRDF2ML use — carsten.draschner / githubweb
  53. move and rename evaluation script of DistRDF2ML — carsten.draschner / githubweb
  54. create or in sva of Int and integer — carsten.draschner / githubweb
  55. moved — carsten.draschner / githubweb
  56. use test for eval — carsten.draschner / githubweb
  57. sample classification pipeline based on DistRDF2ML modules — carsten.draschner / githubweb
  58. rename method — carsten.draschner / githubweb
  59. adjust setter — carsten.draschner / githubweb
  60. create Readme for DistRDF2ML — carsten.draschner / githubweb
  61. created DistRDF2ML clustering pipeline — carsten.draschner / githubweb
  62. adjustments in DistRDF2ML readme — carsten.draschner / githubweb
  63. adjustments in DistRDF2ML readme - add reference to classes — carsten.draschner / githubweb
  64. update DistRDF2ML readme - databricks — carsten.draschner / githubweb
  65. created ml2graph transformer for semantification of DistRDF2ML pipeline Results — carsten.draschner / githubweb
  66. ml to graph offer fallback xsd type — carsten.draschner / githubweb
  67. create ML2Graph unit test — carsten.draschner / githubweb
  68. show to different opportunities to gather int value out of string within KG — carsten.draschner / githubweb
  69. created new tiny sample file — carsten.draschner / githubweb
  70. created ML to Graph transformer to make MLlib output semantic for DistSim Pipeline — carsten.draschner / githubweb
  71. unit tests calling ML2Granph module — carsten.draschner / githubweb
  72. reorder metagraph — carsten.draschner / githubweb
  73. show semantification of results in saomple regression pipleine — carsten.draschner / githubweb
  74. show semantification of results in saomple classfication pipeline — carsten.draschner / githubweb
  75. show semantification of results in saomple clustering pipeline — carsten.draschner / githubweb
  76. class for data size eval on artificial movie data — carsten.draschner / githubweb
  77. offer ration of indexer traning like word 2 vec — carsten.draschner / githubweb

#56 (Apr 13, 2021, 10:59:57 AM)

  1. first running example to transform rdf data into a native spark dataframe with native scala datatypes — carsten.draschner / githubweb
  2. created transformer which creates sparql to dataframe with native object like string integer ect — carsten.draschner / githubweb
  3. scala class object with main showcasing usage of sparqlFrame — carsten.draschner / githubweb
  4. annotate datatype of literals so it can be fetched by sparqlFrame — carsten.draschner / githubweb
  5. payaround with Sparqlify in SANSA ml — carsten.draschner / githubweb
  6. outcomment prints and add todos to make logs also fixed bug by remove a closing curly braket in sparql creation — carsten.draschner / githubweb
  7. added elements for more complete feature extracting pipeline — carsten.draschner / githubweb
  8. better literal identification — carsten.draschner / githubweb
  9. clearer warning of multiple struct types — carsten.draschner / githubweb
  10. clear difference in val names of auto and manual spaarql — carsten.draschner / githubweb
  11. move rdf2feature sparql creatoe into feature extraction package — carsten.draschner / githubweb
  12. added missing imports after class moving — carsten.draschner / githubweb
  13. move to desired package structure — carsten.draschner / githubweb
  14. switch to make or not make feature blocks optional over paramter — carsten.draschner / githubweb
  15. better handling of empty answer dataframe — carsten.draschner / githubweb
  16. adjusted literal identification — carsten.draschner / githubweb
  17. running sample pipeline — carsten.draschner / githubweb
  18. apply common mllib algo and it workx :) — carsten.draschner / githubweb
  19. more complex needed sparql statement — carsten.draschner / githubweb
  20. debug level error — carsten.draschner / githubweb
  21. changed position of sampling — carsten.draschner / githubweb
  22. cache in rdf2feature up and down dataframe — carsten.draschner / githubweb
  23. some fixes after merge — carsten.draschner / githubweb
  24. rename sparql creating method — carsten.draschner / githubweb
  25. spark setup taken from ontop to fix issue — carsten.draschner / githubweb
  26. vp — Lorenz Buehmann / githubweb
  27. VP warehouse path — Lorenz Buehmann / githubweb
  28. example from existing DB — Lorenz Buehmann / githubweb
  29. conf options — Lorenz Buehmann / githubweb
  30. use Hive metastore — Lorenz Buehmann / githubweb
  31. log — Lorenz Buehmann / githubweb
  32. dateTime/dateTimeStamp support — Lorenz Buehmann / githubweb
  33. log — Lorenz Buehmann / githubweb
  34. Added execSelectSpark() method to QueryExecutionSparqlifySpark — Claus Stadler / githubweb
  35. some rearrange of impoorts and add of jenasystem init — carsten.draschner / githubweb
  36. try out new version of sparklify to gain rdd of bindings — carsten.draschner / githubweb
  37. try out more complex sparql query with multi line optional blocks — carsten.draschner / githubweb
  38. change sparqlframe such that it can switch between ontop and sparqlify. also better handling for null values in columns to evaluate structtype and circumvent NPE when .get methods are called — carsten.draschner / githubweb
  39. try out new sparqlyfy based feature extraction in this pipeline — carsten.draschner / githubweb
  40. Refactored partitioning and optimized imports — Claus Stadler / githubweb
  41. minor — Lorenz Buehmann / githubweb
  42. minor code style — Lorenz Buehmann / githubweb
  43. partition type for datatime literals — Lorenz Buehmann / githubweb
  44. partitioner changes adapted — Lorenz Buehmann / githubweb
  45. omit Javadoc/Scaladoc in build script — Lorenz Buehmann / githubweb
  46. test if ontop can deal with multiline optional blocks. and its working — carsten.draschner / githubweb
  47. Partitioning — Lorenz Buehmann / githubweb
  48. Added partition-to-r2rml converter based on our r2rml-jena API — Claus Stadler / githubweb
  49. Renamed method, added documentation — Claus Stadler / githubweb
  50. Attempt to add a manual workflow trigger — Claus Stadler / githubweb
  51. cntd. R2RML export/import — Lorenz Buehmann / githubweb
  52. minor — Lorenz Buehmann / githubweb
  53. Added new interfaces for unified sparqlify/ontop apis — Claus Stadler / githubweb
  54. More consolidation of the r2rml/virtual knowledge graph subsystem (sparqlify / ontop) — Claus Stadler / githubweb
  55. Update for change in r2rml lib — Claus Stadler / githubweb
  56. reuse code — Lorenz Buehmann / githubweb
  57. query execution alignment — Lorenz Buehmann / githubweb
  58. Update README.md — GitHub / githubweb
  59. comment methods — carsten.draschner / githubweb
  60. create first sparqlframe unit test — carsten.draschner / githubweb
  61. remove play around class which is not needed anymore — carsten.draschner / githubweb
  62. annotate literal type such that sparqlframe can handle it — carsten.draschner / githubweb
  63. document sparql frame — carsten.draschner / githubweb
  64. sample usage of pipeline elements — carsten.draschner / githubweb
  65. Set r2rml api version — Claus Stadler / githubweb
  66. query engine — Lorenz Buehmann / githubweb
  67. take table name fn — Lorenz Buehmann / githubweb
  68. Sparqlify now ported to the new the new API towards unified virtual graph handling — Claus Stadler / githubweb
  69. rewrite — Lorenz Buehmann / githubweb
  70. started developing smart vector assembler — carsten.draschner / githubweb
  71. added docstring — carsten.draschner / githubweb
  72. fixed null replacements — carsten.draschner / githubweb
  73. usage of smart vector assembler and fix of some strange read in toDS bug — carsten.draschner / githubweb
  74. extension of smart vector assembler transform logic — carsten.draschner / githubweb
  75. Ontop R2RML parse — Lorenz Buehmann / githubweb
  76. soem cleanup and some docs. now first approach working on beta level — carsten.draschner / githubweb
  77. minor — Lorenz Buehmann / githubweb
  78. changed readme for smart vector assembler — carsten.draschner / githubweb
  79. more owrk on R2RML — Lorenz Buehmann / githubweb
  80. cntd Ontop integration — Lorenz Buehmann / githubweb
  81. reduced tests loads — Lorenz Buehmann / githubweb
  82. A few improvements for the R2RML layer design — Claus Stadler / githubweb
  83. fixes tests — Lorenz Buehmann / githubweb
  84. avoid exception — Lorenz Buehmann / githubweb
  85. tests debugging — Lorenz Buehmann / githubweb
  86. Work on analytic RDD ops — Claus Stadler / githubweb
  87. fixes tests — Lorenz Buehmann / githubweb
  88. Cache — Lorenz Buehmann / githubweb
  89. rewrite — Lorenz Buehmann / githubweb
  90. fixes build errors — Lorenz Buehmann / githubweb
  91. fix print lines — carsten.draschner / githubweb
  92. clean up — Lorenz Buehmann / githubweb
  93. minor — Lorenz Buehmann / githubweb
  94. simplified code — Lorenz Buehmann / githubweb
  95. minor — Lorenz Buehmann / githubweb
  96. Added schema mapping system (but some parts not yet serializable) — Claus Stadler / githubweb
  97. first working version of the schema mapper for customizable RDD[Binding] -> DataFrame conversions — Claus Stadler / githubweb
  98. dist iq — Lorenz Buehmann / githubweb
  99. Removed needless SparkSession argument — Claus Stadler / githubweb
  100. Result var order is now retained in the schema mapping — Claus Stadler / githubweb
  101. rename — Lorenz Buehmann / githubweb
  102. Experimenting with schema mapper — Claus Stadler / githubweb
  103. first commented possible change to incooperate with claus changes — carsten.draschner / githubweb
  104. first commented possible change to incooperate with claus changes — carsten.draschner / githubweb
  105. remove print — carsten.draschner / githubweb
  106. string type — Lorenz Buehmann / githubweb
  107. Fixed no archaic import — Claus Stadler / githubweb
  108. Fixed a no longer available import — Claus Stadler / githubweb
  109. extract methods — Lorenz Buehmann / githubweb
  110. minor — Lorenz Buehmann / githubweb
  111. test runner ext — Lorenz Buehmann / githubweb
  112. partition test fix — Lorenz Buehmann / githubweb
  113. log reduced — Lorenz Buehmann / githubweb
  114. merged SPARQL examples — Lorenz Buehmann / githubweb
  115. Added a comment about renaming RddToDframeMapper to something more specific — Claus Stadler / githubweb
  116. generic test suite — Lorenz Buehmann / githubweb
  117. test rework — Lorenz Buehmann / githubweb
  118. clean up — Lorenz Buehmann / githubweb
  119. Work on test cases — Claus Stadler / githubweb
  120. Adapted test runner — Claus Stadler / githubweb
  121. renamed test package — Lorenz Buehmann / githubweb
  122. Jar plugin — Lorenz Buehmann / githubweb
  123. SPARQL engine example improved — Lorenz Buehmann / githubweb
  124. all tests green so far — Claus Stadler / githubweb
  125. renamed test cases once more — Claus Stadler / githubweb
  126. fix compile error — Lorenz Buehmann / githubweb
  127. move out ontop cause it is unclear how to call it with new api — carsten.draschner / githubweb
  128. fix tests by System.setProperty and some changes in tests because of small datatype differences — carsten.draschner / githubweb
  129. bring ontop back into sparql frame over setter — carsten.draschner / githubweb
  130. updated integration tests - sparqlify succeed - ontop hangs — Claus Stadler / githubweb
  131. hotfix: some fecatoring of hard coded smark master set to local, remove unused sample classes, hotfix of small udf change — carsten.draschner / githubweb
  132. offset support — Lorenz Buehmann / githubweb
  133. minor changes in engine — Lorenz Buehmann / githubweb
  134. ser tests — Lorenz Buehmann / githubweb
  135. debug serializer — Lorenz Buehmann / githubweb
  136. Scala code in Java ... — Lorenz Buehmann / githubweb
  137. moved Java serializer to Scala — Lorenz Buehmann / githubweb
  138. some cleanup — Lorenz Buehmann / githubweb
  139. Added core of for another query engine for ad-hoc quries on trig data  - conceptually its probably sparqlgx-sde — Claus Stadler / githubweb
  140. ser extended — Lorenz Buehmann / githubweb
  141. Renamed class — Claus Stadler / githubweb
  142. Added cli, deb packaing, build/version metadata generation, changed r2rml dep version to 0.9.0-SNAPSHOT — Claus Stadler / githubweb
  143. Fixed typo — Claus Stadler / githubweb
  144. Cleaned up some warning... now java/scala cyclic dependency issue -.- — Claus Stadler / githubweb
  145. Fixed some annotation issues — Claus Stadler / githubweb
  146. Added the well known compile-scala-first workaround — Claus Stadler / githubweb
  147. Ignored binding engine test — Claus Stadler / githubweb
  148. sansa command working from debian package — Claus Stadler / githubweb
  149. Attempt to fix broken prefix handling in TrigFileInputFormat — Claus Stadler / githubweb
  150. General improvements on the cli tool — Claus Stadler / githubweb
  151. Removed all log4j.properties files under src/main/resources — Claus Stadler / githubweb
  152. Moved spark-bench to separate module because it is about the only remaining lib that ships with its own log4j.properties and doesn't belong to main lib anyway — Claus Stadler / githubweb
  153. Increased default max record length in trig reader to 10MB — Claus Stadler / githubweb
  154. Attempt to bump TRL or trig reader from 4 (works under laboratory conditions) to 6/7 (works in a relevant setup)... — Claus Stadler / githubweb
  155. Added workaround for HADOOP-17453 and it seems that made it work... code yet needs removal of swear words.... — Claus Stadler / githubweb
  156. cleanup — Claus Stadler / githubweb
  157. reset trig reader defaults — Claus Stadler / githubweb
  158. registered more classes to serializer — Lorenz Buehmann / githubweb
  159. Improvements to the trig query; added --distinct flag for preprocessing data — Claus Stadler / githubweb
  160. type extractor based on R2RML — Lorenz Buehmann / githubweb
  161. some debug — Lorenz Buehmann / githubweb
  162. use enum for term types — Lorenz Buehmann / githubweb
  163. generic qaud writer — Lorenz Buehmann / githubweb
  164. Towards a utility to merge multiple trig files — Claus Stadler / githubweb
  165. Added extra module for test data resources (sansa-resource-testdata) — Claus Stadler / githubweb
  166. Added missing pom for test data — Claus Stadler / githubweb
  167. Improved trig record reader to dynamically allocate head and tail buffers — Claus Stadler / githubweb
  168. Fixed bug in trig reader — Claus Stadler / githubweb
  169. test cases now working again (a test resource is now shared between rdf-common and query-spark) — Claus Stadler / githubweb
  170. Messing around with test case file lookups... — Claus Stadler / githubweb
  171. test resource file now found, but prefixing data frame names with the origin rdd now fails for ontop... — Claus Stadler / githubweb
  172. Tests working up the inference flink where they fail — Claus Stadler / githubweb
  173. Fell victim to accidental overflow with int/long arithmetic after rearranging an expression... — Claus Stadler / githubweb
  174. Wrapping hadoop input stream with close shield in an attempt to get rid of the stream closed exception... — Claus Stadler / githubweb
  175. A bit of cleanup — Claus Stadler / githubweb
  176. More fiddling with suddenly closed streams (apparently jena closes streams in LangBase:44) — Claus Stadler / githubweb
  177. Solved issues related to AbstractInterruptibleChannel: RDFDataMgrRx's Invocation of .interrupt() on the parser thread closed hadoop's input stream; — Claus Stadler / githubweb
  178. TrigRecordReader update and fixes for compressed input — Claus Stadler / githubweb
  179. Added license file — Claus Stadler / githubweb
  180. Update LICENSE — GitHub / githubweb
  181. Reinstanted jens' version of the license — GitHub / githubweb
  182. engine setup — Lorenz Buehmann / githubweb
  183. docs — Lorenz Buehmann / githubweb
  184. cleanup — Lorenz Buehmann / githubweb
  185. needed code adaption for latest Ontop changes — Lorenz Buehmann / githubweb
  186. test loader changed for relative paths — Lorenz Buehmann / githubweb
  187. Ontop serialization stuff — Lorenz Buehmann / githubweb
  188. test output reduced — Lorenz Buehmann / githubweb
  189. boolean parse fix — Lorenz Buehmann / githubweb
  190. removed debug output — Lorenz Buehmann / githubweb
  191. more serialization — Lorenz Buehmann / githubweb
  192. update readme regarding automatic creation of spraql query — GitHub / githubweb
  193. Kryo pool — Lorenz Buehmann / githubweb
  194. kryo debugging — Lorenz Buehmann / githubweb
  195. debug log — Lorenz Buehmann / githubweb
  196. disable kryo pool for now — Lorenz Buehmann / githubweb
  197. version to keep it separated from dev branch — Lorenz Buehmann / githubweb
  198. scientific notation workaround — Lorenz Buehmann / githubweb
  199. try rewrite instruction ser via Spark Kryo — Lorenz Buehmann / githubweb
  200. remove print line and started with sample pipeline for ea use case — carsten.draschner / githubweb
  201. remove unwanted print statement — carsten.draschner / githubweb
  202. more replacements of unwanted chars in projection vars — carsten.draschner / githubweb
  203. refactor ehre auto sparql creation is — carsten.draschner / githubweb
  204. refactor  place of class — carsten.draschner / githubweb
  205. refactor  place of pipelines to examples — carsten.draschner / githubweb
  206. More work on the trig/distinct command — Claus Stadler / githubweb
  207. use more filesand more print — carsten.draschner / githubweb
  208. Fixed wrong reference to cli module (now include scala version) — Claus Stadler / githubweb
  209. set number of mappers via properties — Lorenz Buehmann / githubweb
  210. local eval handler — Lorenz Buehmann / githubweb
  211. Switched to spark.sparkContext.union — Claus Stadler / githubweb
  212. Updated trig/distinct tool — Claus Stadler / githubweb
  213. Switched to thrift serialization by default — Claus Stadler / githubweb
  214. set local evaluation via property — Lorenz Buehmann / githubweb
  215. Pinned thrift version to 0.13.0 — Claus Stadler / githubweb
  216. tryout pipelines for lmdb — carsten.draschner / githubweb
  217. added hdt plugin — Claus Stadler / githubweb
  218. Thrift now working (issue was due to serialization as 'string' instead of byte[] rather than thrift version conflict) — Claus Stadler / githubweb
  219. WarningParseMode.IGNORE — carsten.draschner / githubweb
  220. some debug stuff — Lorenz Buehmann / githubweb
  221. relient node parser — Lorenz Buehmann / githubweb
  222. relient node parser — Lorenz Buehmann / githubweb
  223. Using thrift as default serialization for nodes now — Claus Stadler / githubweb
  224. more relient Node parse — Lorenz Buehmann / githubweb
  225. Fix for serialization issues (Turns out handling of ByteBuffer by spark is via util methods - not dedicated serializers) — Claus Stadler / githubweb
  226. cleanup — Lorenz Buehmann / githubweb
  227. parse mode enum — Lorenz Buehmann / githubweb
  228. fixed more issues with the binary serialization — Claus Stadler / githubweb
  229. refactor pipelines — carsten.draschner / githubweb
  230. fix bug in literal detection — carsten.draschner / githubweb
  231. added filtering by given feature list in smart vector assembler — carsten.draschner / githubweb
  232. smart vector assembler sample setting of feature columns to use — carsten.draschner / githubweb
  233. use autosparql in lmdb use case — carsten.draschner / githubweb
  234. Refactored all kryo serializers into java classes and moved them to rdf-common — Claus Stadler / githubweb
  235. Fixed typos and bug in KryoArrayUtils.java — Claus Stadler / githubweb
  236. Added performance benchmark for thrift vs riot (as junit test case for now) — Claus Stadler / githubweb
  237. formatting — Claus Stadler / githubweb
  238. cleanup — Claus Stadler / githubweb
  239. Fixed runtime java.lang.NoClassDefFoundError: Could not initialize class net.sansa_stack.rdf.spark.model.rdd.RddOfTriplesOps$ when running in standalone/yarn Spark modes. — azary / githubweb
  240. Added a custom node serializer because node serialization via jena's thrift api is too slow (riot is 20x faster) — Claus Stadler / githubweb
  241. Updated class description of GenericNodeSerializerCustom — Claus Stadler / githubweb
  242. code style — Claus Stadler / githubweb
  243. store sample created autosparql in lmdb use case — carsten.draschner / githubweb
  244. use compact but representative sparql query in lmdb — carsten.draschner / githubweb
  245. experiment with eauc — carsten.draschner / githubweb
  246. disable Kryo logging in tests — Lorenz Buehmann / githubweb
  247. Changed a serr to logger.debug — Claus Stadler / githubweb
  248. fix null issue in qef setup — Lorenz Buehmann / githubweb
  249. Updated getting the spark session for a given RDD — Claus Stadler / githubweb
  250. Excluded excludes for jaxb because it breaks sparqlify's xml parser on java 9+ — Claus Stadler / githubweb
  251. Put a benchmark 'unit test' on ignore — Claus Stadler / githubweb
  252. Updated broken imports in inference test case — Claus Stadler / githubweb
  253. Update README.md — GitHub / githubweb
  254. Update README.md — GitHub / githubweb
  255. Update main.yml — GitHub / githubweb
  256. Update README.md — GitHub / githubweb
  257. Added more options to the trig distinct tool — Claus Stadler / githubweb
  258. better handling of literal identification — carsten.draschner / githubweb
  259. handle non supported query engines — carsten.draschner / githubweb
  260. make ontop unavaible — carsten.draschner / githubweb
  261. reove a certain show of a debug dataframe — carsten.draschner / githubweb
  262. update readme to fit to recent changes — GitHub / githubweb
  263. fixed VP setup for Ontop — Lorenz Buehmann / githubweb
  264. use enum instead of string for query engine setter — Lorenz Buehmann / githubweb
  265. exxplicit dataset type — carsten.draschner / githubweb
  266. handling if no seeds can be resolved — carsten.draschner / githubweb
  267. hadnle set of query engine in upper and lower case — carsten.draschner / githubweb
  268. some R2RML model utils — Lorenz Buehmann / githubweb
  269. run classification on rdf kg — carsten.draschner / githubweb
  270. R2RML utils — Lorenz Buehmann / githubweb
  271. switch order of queries — carsten.draschner / githubweb
  272. JSQL version — Lorenz Buehmann / githubweb
  273. GH issue101 test — Lorenz Buehmann / githubweb
  274. Refactored / fixed inference test cases (now uses classpath scanning rather than messing around with files) — Claus Stadler / githubweb
  275. W3C test suite runner Sparqlify — Lorenz Buehmann / githubweb
  276. added dependency — carsten.draschner / githubweb
  277. optimize imports — carsten.draschner / githubweb
  278. new dataset to dataframe implementation for auto sparql generation — carsten.draschner / githubweb
  279. Removed relative reference to parent pom from inference layer... — Claus Stadler / githubweb
  280. get back to old df handling of literals — carsten.draschner / githubweb
  281. More cleanup of inference pom setup... — Claus Stadler / githubweb
  282. some experiments and making code more stable — carsten.draschner / githubweb
  283. Improvement of the schema mapper; renamed RddOf[plural]Ops to singular — Claus Stadler / githubweb
  284. Ported TrigRecordReader to plain java (1 compile error remaining) — Claus Stadler / githubweb
  285. fixed last compile error on java TrigRecordReader - now for testing... — Claus Stadler / githubweb
  286. Ported TrigFileInputFormat to java — Claus Stadler / githubweb
  287. Ported TrigRecordReader test to a nice parameterized junit test. Applied auto-formatting. — Claus Stadler / githubweb
  288. Trig record reader should now be able to cope with pretty much all corner cases; improved test framework; real-world data tests pending. — Claus Stadler / githubweb
  289. Excluded slow trig record reader tests for now — Claus Stadler / githubweb
  290. Commented out r2rml-sql utils to see where updates to the common lib are needed — Claus Stadler / githubweb
  291. Fixeda regression that causes non-encoded input to be non-splittable in the TrigRecordReader — Claus Stadler / githubweb
  292. add system init — carsten.draschner / githubweb
  293. tr out if common way works — carsten.draschner / githubweb
  294. remove line which not handle invalid triple — carsten.draschner / githubweb
  295. use only manual sparql string for test purposes — carsten.draschner / githubweb
  296. Replaced SqlEscaper with the new SqlCodec API (which can do both encoding and decoding of identifiers) — Claus Stadler / githubweb
  297. use only manual sparql string for test purposes — carsten.draschner / githubweb
  298. project now compiling again — Claus Stadler / githubweb
  299. Added support for post-processing R2RML mappings to qualify table names with database names and integration with the sparqlify system — Claus Stadler / githubweb
  300. test cleaned up — Lorenz Buehmann / githubweb
  301. enabled HTML export in ScalaTest — Lorenz Buehmann / githubweb
  302. Scalatest version change — Lorenz Buehmann / githubweb
  303. Scatatest version cleanup — Lorenz Buehmann / githubweb
  304. HTML export lib — Lorenz Buehmann / githubweb
  305. outcomment ontop block. locally running now — carsten.draschner / githubweb
  306. expand shortcuts readded — Lorenz Buehmann / githubweb
  307. Refactored TrigRecordReader into a framework with the central class RecordReaderGenericBase. — Claus Stadler / githubweb
  308. Turtle record reader working on some datasets but failing on others with lots of blank nodes. — Claus Stadler / githubweb
  309. Splittable record reader for turtle now functional — Claus Stadler / githubweb
  310. Aligned hadoop configuration option naming — Claus Stadler / githubweb
  311. test exclude — Lorenz Buehmann / githubweb
  312. workaround Scalatest conflicts — Lorenz Buehmann / githubweb
  313. Created dedicated modules sansa-hadoop-jena and sansa-kryo-jena — Claus Stadler / githubweb
  314. Cleaning up some leftovers from refactoring jena/hadoop/kryo — Claus Stadler / githubweb
  315. all tests passed locally — Claus Stadler / githubweb
  316. Fix for #144 — Claus Stadler / githubweb
  317. write debug data on test error — Lorenz Buehmann / githubweb
  318. clean tests — Lorenz Buehmann / githubweb
  319. minor — Lorenz Buehmann / githubweb
  320. minor — Lorenz Buehmann / githubweb
  321. omit DB in test clean up — Lorenz Buehmann / githubweb
  322. minor — Lorenz Buehmann / githubweb
  323. Fix for #146; test with 10 million numbered triples did you exhibit data loss anymore — Claus Stadler / githubweb
  324. bumped RDF4j version — Lorenz Buehmann / githubweb
  325. Improved README to better point out features — GitHub / githubweb
  326. Rephrased features — GitHub / githubweb
  327. Update README.md — GitHub / githubweb
  328. Update README.md — GitHub / githubweb
  329. Update README.md — GitHub / githubweb
  330. Significant reworking of RecordReaderGenericBase for #144; when collecting the split's flow to lists all tests succeed; otherwise there is a 'MissingBackPressure' exception. — Claus Stadler / githubweb
  331. Maybe the missing backpressure exception is solved now — Claus Stadler / githubweb
  332. Added SERVICE<rdd:perGraph> support in addition to rdd:perPartition (but still not totally happy because semantics w.r.t. to named graphs, efault graphs, datasets and partitions of those not totally clear; needs more thought) — Claus Stadler / githubweb
  333. Added feature to merge part files into a single file — Claus Stadler / githubweb
  334. kryo package — Lorenz Buehmann / githubweb
  335. CLI moved — Lorenz Buehmann / githubweb
  336. CLI moved — Lorenz Buehmann / githubweb
  337. Improvements for #147: Renamed methods and updated documentation — Claus Stadler / githubweb
  338. Fixed a compile error due to String.format(str, ...) - changed to str.format(...) — Claus Stadler / githubweb
  339. Added spark-cli module to sansa-stack bundle project — Claus Stadler / githubweb
  340. Changed guava Stopwatch to Apache StopWatch because of hadoop — Claus Stadler / githubweb
  341. TrigMerge should now properly use the hadoop file system — Claus Stadler / githubweb
  342. Fixed compile error — Claus Stadler / githubweb
  343. Messing around with hadoop file system uri resolution... — Claus Stadler / githubweb
  344. Maybe now... — Claus Stadler / githubweb
  345. Bumped commons-lang3 version — Claus Stadler / githubweb
  346. Added more logging to RecordReaderGenericBase — Claus Stadler / githubweb
  347. Typo — Claus Stadler / githubweb
  348. Changed strategy for writing partitions of datasets out using PipedInput/OutputStream approach; assuming that spark is smart then this approach — Claus Stadler / githubweb
  349. clean up — Lorenz Buehmann / githubweb
  350. Deprecated the saveAsNtriples file method; going to replace it with a more powerful fluent API — Claus Stadler / githubweb
  351. Potential fix for issue with lambda serialization in NodeAnalytics — Claus Stadler / githubweb
  352. Another fix — Claus Stadler / githubweb
  353. serialize DB metadata — Lorenz Buehmann / githubweb
  354. Removed exclude for lang3 — Claus Stadler / githubweb
  355. Removed guava from SchemaMapper interface in an attempt to solve a serialization issue — Claus Stadler / githubweb
  356. Removed unused imports — Claus Stadler / githubweb
  357. some logs — Lorenz Buehmann / githubweb
  358. more logs — Lorenz Buehmann / githubweb
  359. omit DB creation — Lorenz Buehmann / githubweb
  360. for server eval make sparql alternatives setable — carsten.draschner / githubweb
  361. try out without auto sparql — carsten.draschner / githubweb
  362. try out without auto sparql — carsten.draschner / githubweb
  363. make sparql query parameter — carsten.draschner / githubweb
  364. mapper — Lorenz Buehmann / githubweb
  365. more debug log — Lorenz Buehmann / githubweb
  366. added print lines to see needed processing time — carsten.draschner / githubweb
  367. catch exception which always occurs — Lorenz Buehmann / githubweb
  368. adjusted manual sparql query — carsten.draschner / githubweb
  369. no metamapping expansion — Lorenz Buehmann / githubweb
  370. cache and prepare config — Lorenz Buehmann / githubweb
  371. expand mappings — Lorenz Buehmann / githubweb
  372. Added first version of a fluent API to RDF Rdds rdd.configureSave()...run(). — Claus Stadler / githubweb
  373. Added all formats to the new OutputFormatRdfRegistry — Claus Stadler / githubweb
  374. Updated Sparqlify integration to use SqlCodec — Claus Stadler / githubweb
  375. fixed flink build — Claus Stadler / githubweb
  376. logging and settings — Lorenz Buehmann / githubweb
  377. Experimenting with save settings — Claus Stadler / githubweb
  378. QueryEngineFactoryBase now uses double quotes encoding for R2RML — Claus Stadler / githubweb
  379. reduced log — Lorenz Buehmann / githubweb
  380. Revision of quoting: As a rule, R2RML documents should now ALWAYS escape any SQL indentifiers with double quotes; conversly: never use backticks in the R2RML. — Claus Stadler / githubweb
  381. Removed debug output — Claus Stadler / githubweb
  382. Removed unused repos — Claus Stadler / githubweb
  383. Refactored the creation of StreamRDF instances for writing RDF out in order to simplify changing of strategies — Claus Stadler / githubweb
  384. local eval mode — Lorenz Buehmann / githubweb
  385. repo — Lorenz Buehmann / githubweb
  386. Work on RddRdfSaver (unfinished) and improving structure of operations on Datasets / Named Models — Claus Stadler / githubweb
  387. remove old row mapper — Lorenz Buehmann / githubweb
  388. version set — Lorenz Buehmann / githubweb
  389. Work  on rdd ops — Claus Stadler / githubweb
  390. version fix — Lorenz Buehmann / githubweb
  391. Removed duplicate elephas-io declaration in hadoop-jena — Claus Stadler / githubweb
  392. Replacing a 'dash' from system hash code with '_' in order to yield a valid spark table name. — Claus Stadler / githubweb
  393. set log4j to warn for tests; another attempt to fix the naming issue with the dash — Claus Stadler / githubweb
  394. Added feature to convert quads to triples when saving with a triples languague — Claus Stadler / githubweb
  395. Fixed versions of the bundle modules — Claus Stadler / githubweb
  396. serialize DB metadata — GitHub / githubweb
  397. Removed guava stopwatch — Claus Stadler / githubweb
  398. Updated trig/query tool (needs generalization and consolidation later) — Claus Stadler / githubweb
  399. Re-added workaround for HADOOP-17453 (bugged non-zero offset reads from BZip2Codec) — Claus Stadler / githubweb
  400. all around quoting — Lorenz Buehmann / githubweb
  401. test keep — Lorenz Buehmann / githubweb
  402. encode string literals — Lorenz Buehmann / githubweb
  403. tests cleaned — Lorenz Buehmann / githubweb
  404. docs — Lorenz Buehmann / githubweb
  405. Update README.md — GitHub / githubweb
  406. Update README.md — GitHub / githubweb
  407. Update README.md — GitHub / githubweb
  408. formatting — Lorenz Buehmann / githubweb
  409. SPARQL usage example — Lorenz Buehmann / githubweb
  410. less log — Lorenz Buehmann / githubweb
  411. Update README.md — GitHub / githubweb
  412. Update README.md — GitHub / githubweb
  413. Update README.md — GitHub / githubweb
  414. Update README.md — GitHub / githubweb
  415. pom cleanup — Lorenz Buehmann / githubweb
  416. Removed unused classes and imports — Claus Stadler / githubweb
  417. Removed an unused class — Claus Stadler / githubweb
  418. removed guava — Claus Stadler / githubweb
  419. Removed another guava stopwatch — Claus Stadler / githubweb
  420. stick to Guava 14 for Spark compatibility — Lorenz Buehmann / githubweb
  421. Fixed some warnings — Claus Stadler / githubweb
  422. start with semantic description — carsten.draschner / githubweb
  423. bumped guava — Lorenz Buehmann / githubweb
  424. Replaced snapshot versions of Claus's stuff with release ones — Claus Stadler / githubweb
  425. removed legacy GeoSpark dep — Lorenz Buehmann / githubweb
  426. pom update for aksw release — Claus Stadler / githubweb
  427. bumped gitflow plugin version / using official one again — Claus Stadler / githubweb
  428. removed build sections from datalake; compilation still works — Claus Stadler / githubweb
  429. Update versions for release — Claus Stadler / githubweb
  430. Update for next development version — Claus Stadler / githubweb
  431. Added bundle modules to parent — Claus Stadler / githubweb
  432. Removed empty file — Claus Stadler / githubweb
  433. Commented out maven badge — Claus Stadler / githubweb
  434. Update README.md — GitHub / githubweb

#55 (Dec 23, 2020, 2:22:30 PM)

  1. Minor update on IT documentation, removed intensional exception from SansaIT again — Claus Stadler / githubweb

#54 (Dec 23, 2020, 2:00:40 PM)

  1. minor — Lorenz Buehmann / githubweb

#53 (Dec 23, 2020, 10:06:41 AM)

  1. Update README.md — GitHub / githubweb

#52 (Dec 22, 2020, 3:18:24 PM)

  1. Added temorary intensional fail to IT to investigate network issue between spawned docker containers — Claus Stadler / githubweb

#51 (Dec 22, 2020, 1:11:13 PM)

  1. removed hard coded master local — Lorenz Buehmann / githubweb
  2. inference example cleaned — Lorenz Buehmann / githubweb
  3. keep commons codec for Jena — Lorenz Buehmann / githubweb

#50 (Dec 22, 2020, 1:02:02 PM)

  1. Update README.md — GitHub / githubweb

#49 (Dec 22, 2020, 11:34:43 AM)

  1. Reenabled Ontop test (test works locally) — Claus Stadler / githubweb

#48 (Dec 21, 2020, 4:01:14 PM)

  1. Added initial readme for it module — Claus Stadler / githubweb

#47 (Dec 21, 2020, 11:02:37 AM)

  1. removed history server setting — Lorenz Buehmann / githubweb

#46 (Dec 21, 2020, 10:51:07 AM)

  1. IT check should now abort if spark submit container exits — Claus Stadler / githubweb

#45 (Dec 21, 2020, 10:07:22 AM)

  1. Intgration test with dockerized spark-submit working for sparqlify — Claus Stadler / githubweb

#44 (Dec 18, 2020, 2:37:52 PM)

  1. update docs for rdf2feature paper/scaladocs link and naming of headers — GitHub / githubweb

#43 (Dec 18, 2020, 1:27:16 PM)

  1. fixes JDK 11 issue with test loader — Lorenz Buehmann / githubweb

#42 (Dec 18, 2020, 11:28:04 AM)

  1. fixes SQL generation — Lorenz Buehmann / githubweb

#41 (Dec 18, 2020, 12:33:27 AM)

  1. hotfix after change to new jena version to explicitly call jenaSystem in ml — carsten.draschner / githubweb

#40 (Dec 17, 2020, 11:30:43 PM)

  1. Fixed test cases by adding JenaSystem.init — Claus Stadler / githubweb

#39 (Dec 17, 2020, 1:53:38 PM)

  1. Fixed dependencies in it module — Claus Stadler / githubweb

#38 (Dec 17, 2020, 12:46:15 PM)

  1. Successful test to start BDE spark using testcontainers — Claus Stadler / githubweb
  2. Updated gitignore — Claus Stadler / githubweb
  3. More experiments with testcontainers and spark submit — Claus Stadler / githubweb
  4. dist profile on sansa-stack-spark now yields correct filename for jar with dependencies. — Claus Stadler / githubweb
  5. first working sparklify deployment within test containers — Claus Stadler / githubweb
  6. Improved integration test; sparqlify server now actually tested automatically — Claus Stadler / githubweb
  7. Removed some println — Claus Stadler / githubweb
  8. More work on integration testing of ontop and sparklify — Claus Stadler / githubweb
  9. Sparqlify integration test working with improved boilerplate — Claus Stadler / githubweb
  10. Added jena init to tests — Claus Stadler / githubweb
  11. Another JenaSystem.init — Claus Stadler / githubweb
  12. Integration test now only depends on spark-core and jena (the rest comes from the jar bundle) — Claus Stadler / githubweb

#37 (Dec 17, 2020, 12:27:20 PM)

  1. registrator switched — Lorenz Buehmann / githubweb

#36 (Dec 16, 2020, 2:27:06 PM)

  1. remove redundant registrator — Lorenz Buehmann / githubweb
  2. allow for negative years — Lorenz Buehmann / githubweb
  3. fix registrator — Lorenz Buehmann / githubweb

#35 (Dec 15, 2020, 9:14:01 AM)

  1. Update README.md — GitHub / githubweb
  2. some cleanup — Lorenz Buehmann / githubweb

#34 (Dec 12, 2020, 10:54:22 PM)

  1. ml readme href changes — GitHub / githubweb

#33 (Dec 11, 2020, 10:56:04 AM)

  1. removed deps from example POM — Lorenz Buehmann / githubweb

#32 (Dec 11, 2020, 10:49:50 AM)

  1. SBT cleaned — Lorenz Buehmann / githubweb
  2. docker tests — Lorenz Buehmann / githubweb
  3. examples POM cleaned — Lorenz Buehmann / githubweb

#31 (Dec 9, 2020, 3:05:44 PM)

  1. Fixed wrong implementation of "EntitiesMentioned" function. — f.bakhshandegan / githubweb

#30 (Dec 8, 2020, 5:30:03 PM)

  1. added more tests for RDFStatsTests and some related functions to stats object — f.bakhshandegan / githubweb

#29 (Dec 8, 2020, 4:25:24 PM)

  1. fixed missing chars to indicate code start — GitHub / githubweb

#28 (Dec 8, 2020, 4:18:33 PM)

  1. update links to new Scala docs — GitHub / githubweb

#27 (Dec 8, 2020, 2:59:02 PM)

  1. hotfix, udf change of Spark three does not allow typed return value. this lead to error in running the related test class, so simply removed return type. — carsten.draschner / githubweb

#26 (Dec 8, 2020, 1:55:28 PM)

  1. Update README.md — GitHub / githubweb

#25 (Dec 8, 2020, 1:52:22 PM)

  1. Spark 3 changes — Lorenz Buehmann / githubweb
  2. Spark 3 adaptions — Lorenz Buehmann / githubweb
  3. Datalake connectors for Spark 3 — Lorenz Buehmann / githubweb
  4. Ontop bumped — Lorenz Buehmann / githubweb
  5. Spark 3 changes — Lorenz Buehmann / githubweb
  6. Spark 3 adaptions — Lorenz Buehmann / githubweb
  7. Datalake connectors for Spark 3 — Lorenz Buehmann / githubweb
  8. Ontop bumped — Lorenz Buehmann / githubweb
  9. POM fix — Lorenz Buehmann / githubweb
  10. deps — Lorenz Buehmann / githubweb

#24 (Dec 8, 2020, 1:43:28 PM)

  1. remove scala annotations for code to circumvent undesired code block parsing in markdown — carsten.draschner / githubweb
  2. git ignore ds store as artifact from mac devices — carsten.draschner / githubweb
  3. remove unused/unknown dependencies for cleanup of poms — carsten.draschner / githubweb
  4. in readme references — carsten.draschner / githubweb
  5. only one hash — carsten.draschner / githubweb
  6. change link — carsten.draschner / githubweb
  7. update and clean links — carsten.draschner / githubweb
  8. edid header to make link reference possible — carsten.draschner / githubweb
  9. Update README.md — GitHub / githubweb
  10. Update README.md — GitHub / githubweb
  11. Update README.md — GitHub / githubweb
  12. link subsections in head bulletpoint list — GitHub / githubweb
  13. update unclear sentence — GitHub / githubweb
  14. remove scala annotations for code to circumvent undesired code block parsing in markdown — Lorenz Buehmann / githubweb
  15. git ignore ds store as artifact from mac devices — Lorenz Buehmann / githubweb
  16. remove unused/unknown dependencies for cleanup of poms — Lorenz Buehmann / githubweb
  17. in readme references — Lorenz Buehmann / githubweb
  18. only one hash — Lorenz Buehmann / githubweb
  19. change link — Lorenz Buehmann / githubweb
  20. update and clean links — Lorenz Buehmann / githubweb
  21. edid header to make link reference possible — Lorenz Buehmann / githubweb
  22. Update README.md — Lorenz Buehmann / githubweb
  23. Update README.md — Lorenz Buehmann / githubweb
  24. Update README.md — Lorenz Buehmann / githubweb
  25. link subsections in head bulletpoint list — Lorenz Buehmann / githubweb
  26. update unclear sentence — Lorenz Buehmann / githubweb

#23 (Dec 8, 2020, 1:27:01 PM)

  1. Add Badges and How to Contribute sections on README file — Gezim Sejdiu / githubweb
  2. Make mvn_install_stack_*.sh executable scripts — Gezim Sejdiu / githubweb

#22 (Dec 8, 2020, 1:26:05 PM)

  1. SBT cleanup — Lorenz Buehmann / githubweb

#21 (Dec 8, 2020, 12:48:09 PM)

  1. remove unused deps — Lorenz Buehmann / githubweb
  2. typos — Lorenz Buehmann / githubweb

#20 (Dec 8, 2020, 12:33:23 PM)

  1. owl api versions fix — Lorenz Buehmann / githubweb

#19 (Dec 8, 2020, 12:13:23 PM)

  1. removed unused dependencies — Lorenz Buehmann / githubweb

#18 (Dec 8, 2020, 11:34:41 AM)

  1. removed SBT file — Lorenz Buehmann / githubweb

#17 (Dec 8, 2020, 9:19:31 AM)

  1. Update README.md — GitHub / githubweb

#16 (Dec 7, 2020, 1:33:54 PM)

  1. fixed some repo issues — Lorenz Buehmann / githubweb
  2. SBT draft — Lorenz Buehmann / githubweb
  3. remove SBT file — Lorenz Buehmann / githubweb

#15 (Dec 7, 2020, 1:17:53 PM)

  1. fix bug in resource loading — carsten.draschner / githubweb
  2. ignore log files — carsten.draschner / githubweb
  3. delete tests which of modules planned to be deleted from release 0.8 — carsten.draschner / githubweb
  4. delete tests whressources ich of modules planned to be deleted from release 0.8 — carsten.draschner / githubweb
  5. delete classes and folders of modules planned to be deleted from release 0.8 — carsten.draschner / githubweb
  6. sansa ml examples flink delete not maintained modules — carsten.draschner / githubweb
  7. sansa ml examples spark delete not maintained modules — carsten.draschner / githubweb
  8. sansa ml examples spark delete files for not maintained modules — carsten.draschner / githubweb
  9. sansa ml examples spark delete files for not maintained modules — carsten.draschner / githubweb
  10. sansa ml common delete files for not maintained modules — carsten.draschner / githubweb
  11. sansa ml flink delete classes for not maintained modules — carsten.draschner / githubweb
  12. new README.md for sansa ml — carsten.draschner / githubweb

#14 (Dec 6, 2020, 5:06:24 PM)

  1. First Approach of automatic feature extracting sparql creation based on python implementation, working on small dataset. now needs refactoring in terms vor camelcase, move of fractions to seperated functions and make it callable and not only calling in main — carsten.draschner / githubweb
  2. fixed bug in for loop and filtering bug also removed some  print and show calls which were there for debug purposes — carsten.draschner / githubweb
  3. removed further prints and especcially refactored out the row to query line into a seperate method — carsten.draschner / githubweb
  4. refactored to camel case — carsten.draschner / githubweb
  5. removed commented lines and nade print stements with f string more clean and oneliner — carsten.draschner / githubweb
  6. changed call behavior to use config instead of hard coded setup. also provided small sample files in ressources — carsten.draschner / githubweb
  7. also added master as configurable parameter, maybe needed for standalone on sparkservers — carsten.draschner / githubweb
  8. writeout sparql into file given by config file, and renaaming inputfile name var — carsten.draschner / githubweb
  9. added scala doc strings — carsten.draschner / githubweb
  10. added first scala test checking if desired projecition variable have been gathered — carsten.draschner / githubweb
  11. removed not necessary lines and added minor docstring — carsten.draschner / githubweb
  12. created artificial number random walk behavior opportunity — carsten.draschner / githubweb
  13. handling of rdf lang by file ending — carsten.draschner / githubweb
  14. fix bug in split by dot — carsten.draschner / githubweb
  15. changed main class for rdf2feature — carsten.draschner / githubweb
  16. added spark 3 profile — carsten.draschner / githubweb
  17. rename spark appname to rdf2feature — carsten.draschner / githubweb
  18. fix suggested by lorenz to fix guave problem over insert filter — carsten.draschner / githubweb
  19. use config for outputpath — carsten.draschner / githubweb
  20. cache read in df — carsten.draschner / githubweb
  21. tab code block — carsten.draschner / githubweb
  22. show intermediate used and processed dataframes — carsten.draschner / githubweb
  23. added intermediate step to tranverfer list of node for seeds to list of strings for seeds — carsten.draschner / githubweb
  24. replaced sparqlquery usage by desired resulting seeds as string to test if this can run on server. and it does! — carsten.draschner / githubweb
  25. using sparql with one more spark config — carsten.draschner / githubweb
  26. make hard coded seeds usable over config — carsten.draschner / githubweb
  27. duplicate profile removed — Lorenz Buehmann / githubweb
  28. ignore ds store coming from mac os devices — carsten.draschner / githubweb
  29. use sparqlQuery for gaining seeds instead of using if from hardcoded list — carsten.draschner / githubweb
  30. a branch with Ontop shaded Guava dependency — Lorenz Buehmann / githubweb
  31. OWL API issues tests — Lorenz Buehmann / githubweb
  32. serialization tests — Lorenz Buehmann / githubweb
  33. extended test — Lorenz Buehmann / githubweb
  34. Kryo registrator for Ontop — Lorenz Buehmann / githubweb
  35. Use Ontop Kryo registrator — Lorenz Buehmann / githubweb
  36. simplified RDF lang detection — Lorenz Buehmann / githubweb
  37. partitioner log — Lorenz Buehmann / githubweb
  38. remove usage of hard coded seeds, commented not needed prints and shows and solved minor bug with cutoff seeds, removed artifact from suboptimal spark master setting — carsten.draschner / githubweb
  39. fixed bug with seed cutoff and provide opportunity to sort seeds by outgoing links — carsten.draschner / githubweb
  40. bug fixed in the seed generator sparql — f.bakhshandegan / githubweb
  41. deleted extra log lines and fixed some typos — f.bakhshandegan / githubweb

#13 (Dec 3, 2020, 1:37:35 PM)

  1. Updates OWL API version and makes required changes — Patrick Westphal / githubweb
  2. Fixed overridden ontop version — Patrick Westphal / githubweb
  3. Scala code simplifications — Lorenz Buehmann / githubweb

#12 (Dec 3, 2020, 8:55:09 AM)

  1. Bump jetty-server in /sansa-examples/sansa-examples-spark — GitHub / githubweb

#11 (Dec 1, 2020, 12:39:02 PM)

  1. a branch with Ontop shaded Guava dependency — Lorenz Buehmann / githubweb
  2. serialization tests — Lorenz Buehmann / githubweb
  3. extended test — Lorenz Buehmann / githubweb
  4. Kryo registrator for Ontop — Lorenz Buehmann / githubweb
  5. OWLOntology Kryo serialization — Lorenz Buehmann / githubweb

#10 (Nov 30, 2020, 12:05:21 PM)

  1. misc — Lorenz Buehmann / githubweb
  2. remove deprecation — Lorenz Buehmann / githubweb

#9 (Nov 27, 2020, 11:26:17 AM)

  1. Update README.md — GitHub / githubweb

#8 (Nov 27, 2020, 11:21:17 AM)

  1. mvn install from source scripts — Lorenz Buehmann / githubweb

#7 (Nov 27, 2020, 10:38:19 AM)

  1. aligned artifact names — Lorenz Buehmann / githubweb

#6 (Nov 27, 2020, 9:47:03 AM)

  1. Update pom.xml — GitHub / githubweb

#5 (Nov 27, 2020, 9:46:04 AM)

  1. Update pom.xml — GitHub / githubweb

#4 (Nov 25, 2020, 1:42:08 PM)

  1. Update Jenkinsfile — GitHub / githubweb