2.0.43: Bump version. Update commons compress. (#125)
Changelog
Unreleased
GitHub #101 Correct Connect Packaging plugin configuration
V2.0 (#106)
- Start of the 2.0 branch.
- Added support for sorting the order of files processed. Added support for multiple tasks. Refactored file selection logic out to predicates. Fixes #98. Fixes #97.
- Clean up policy to move files to subdirectories by date in finished directory (#103)
- Moving finished files to date folders
- Add test
- Removed duplicate call to listFiles.
- Moving the metadata to headers. Metadata currently was not used by the individual connectors. Fixed #94. Fixed #95.
- Refactored to remove usage of Path since we have that functionality in File. Lower logging of directory to trace.
- Refactored InputFile to own everything for the input file. Added metadata class that is responsible for the file metadata based on cached values from InputFile. Updated test cases to have values for headers, ignoring some of the headers that will by dynamic.
- Fixed java doc warnings.
- Added finishedPathRequired method to config. This will be used to determine if the tasks should check for a finished path. Moved ownership of the InputStream to be completely within InputFile.
- Fixed checkstyle.
- Major refactor to add support for using a BufferedInputStream. Renamed abstract classes to be consistent. Connectors can optionally support using a BufferedInputStream which should reduce the round trips to the underlying filesystem.
file.buffer.size.bytes
can be used to configure the buffer. Fixes #105. - Corrected Metadata to include offset within the file. #95.
- Corrected to use proper config() method.
- Added missing file.offset header to test cases. #95.
- Removed duplicate check.
- Added MB per second calculations.
- Bump the backoff time to 500 ms if no files are returned.
- Lower the logging level when searching for files.
- Correct Connect Packaging plugin configuration (#101)
- Correct Connect Packaging plugin configuration so that the project's packaging configuration properties properly override those inherited from the parent POM.
- Change manifest file's link to documentation (#102)
- Missed configuring tasks for SchemaLessJson. #97.
- Fixed Checkstyle.
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Modified to not require a key for the messages. Bumped version for extended-log-format. Fixes #117
- CC-5455: Update support summary for LA connector (#109)
- Signed-off-by: Arjun Satish <[email protected]>
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Documentation for
metadata.field
was not correct. - Corrected extended-log-format version.
- Update docs for task.partitioner.
- Removed schema generation dependency.
- Updated examples.
- Co-authored-by: Jon Mitchell <[email protected]>
- Co-authored-by: Randall Hauch <[email protected]>
- Co-authored-by: Arjun Satish <[email protected]>
08c35d576db982d Jeremy Custenborder 2020-02-28 18:02:52
Correct Connect Packaging plugin configuration (#101)
- Correct Connect Packaging plugin configuration so that the project's packaging configuration properties properly override those inherited from the parent POM.
1db3817a007a386 Randall Hauch 2019-06-04 03:58:06
GitHub #102 Change manifest file's link to documentation
V2.0 (#106)
- Start of the 2.0 branch.
- Added support for sorting the order of files processed. Added support for multiple tasks. Refactored file selection logic out to predicates. Fixes #98. Fixes #97.
- Clean up policy to move files to subdirectories by date in finished directory (#103)
- Moving finished files to date folders
- Add test
- Removed duplicate call to listFiles.
- Moving the metadata to headers. Metadata currently was not used by the individual connectors. Fixed #94. Fixed #95.
- Refactored to remove usage of Path since we have that functionality in File. Lower logging of directory to trace.
- Refactored InputFile to own everything for the input file. Added metadata class that is responsible for the file metadata based on cached values from InputFile. Updated test cases to have values for headers, ignoring some of the headers that will by dynamic.
- Fixed java doc warnings.
- Added finishedPathRequired method to config. This will be used to determine if the tasks should check for a finished path. Moved ownership of the InputStream to be completely within InputFile.
- Fixed checkstyle.
- Major refactor to add support for using a BufferedInputStream. Renamed abstract classes to be consistent. Connectors can optionally support using a BufferedInputStream which should reduce the round trips to the underlying filesystem.
file.buffer.size.bytes
can be used to configure the buffer. Fixes #105. - Corrected Metadata to include offset within the file. #95.
- Corrected to use proper config() method.
- Added missing file.offset header to test cases. #95.
- Removed duplicate check.
- Added MB per second calculations.
- Bump the backoff time to 500 ms if no files are returned.
- Lower the logging level when searching for files.
- Correct Connect Packaging plugin configuration (#101)
- Correct Connect Packaging plugin configuration so that the project's packaging configuration properties properly override those inherited from the parent POM.
- Change manifest file's link to documentation (#102)
- Missed configuring tasks for SchemaLessJson. #97.
- Fixed Checkstyle.
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Modified to not require a key for the messages. Bumped version for extended-log-format. Fixes #117
- CC-5455: Update support summary for LA connector (#109)
- Signed-off-by: Arjun Satish <[email protected]>
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Documentation for
metadata.field
was not correct. - Corrected extended-log-format version.
- Update docs for task.partitioner.
- Removed schema generation dependency.
- Updated examples.
- Co-authored-by: Jon Mitchell <[email protected]>
- Co-authored-by: Randall Hauch <[email protected]>
- Co-authored-by: Arjun Satish <[email protected]>
08c35d576db982d Jeremy Custenborder 2020-02-28 18:02:52
Change manifest file's link to documentation (#102)
a9c1e382e3bc1ad Randall Hauch 2019-06-04 17:24:26
GitHub #103 Clean up policy to move files to subdirectories by date in finished directory
V2.0 (#106)
- Start of the 2.0 branch.
- Added support for sorting the order of files processed. Added support for multiple tasks. Refactored file selection logic out to predicates. Fixes #98. Fixes #97.
- Clean up policy to move files to subdirectories by date in finished directory (#103)
- Moving finished files to date folders
- Add test
- Removed duplicate call to listFiles.
- Moving the metadata to headers. Metadata currently was not used by the individual connectors. Fixed #94. Fixed #95.
- Refactored to remove usage of Path since we have that functionality in File. Lower logging of directory to trace.
- Refactored InputFile to own everything for the input file. Added metadata class that is responsible for the file metadata based on cached values from InputFile. Updated test cases to have values for headers, ignoring some of the headers that will by dynamic.
- Fixed java doc warnings.
- Added finishedPathRequired method to config. This will be used to determine if the tasks should check for a finished path. Moved ownership of the InputStream to be completely within InputFile.
- Fixed checkstyle.
- Major refactor to add support for using a BufferedInputStream. Renamed abstract classes to be consistent. Connectors can optionally support using a BufferedInputStream which should reduce the round trips to the underlying filesystem.
file.buffer.size.bytes
can be used to configure the buffer. Fixes #105. - Corrected Metadata to include offset within the file. #95.
- Corrected to use proper config() method.
- Added missing file.offset header to test cases. #95.
- Removed duplicate check.
- Added MB per second calculations.
- Bump the backoff time to 500 ms if no files are returned.
- Lower the logging level when searching for files.
- Correct Connect Packaging plugin configuration (#101)
- Correct Connect Packaging plugin configuration so that the project's packaging configuration properties properly override those inherited from the parent POM.
- Change manifest file's link to documentation (#102)
- Missed configuring tasks for SchemaLessJson. #97.
- Fixed Checkstyle.
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Modified to not require a key for the messages. Bumped version for extended-log-format. Fixes #117
- CC-5455: Update support summary for LA connector (#109)
- Signed-off-by: Arjun Satish <[email protected]>
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Documentation for
metadata.field
was not correct. - Corrected extended-log-format version.
- Update docs for task.partitioner.
- Removed schema generation dependency.
- Updated examples.
- Co-authored-by: Jon Mitchell <[email protected]>
- Co-authored-by: Randall Hauch <[email protected]>
- Co-authored-by: Arjun Satish <[email protected]>
08c35d576db982d Jeremy Custenborder 2020-02-28 18:02:52
GitHub #105 Add support for using BufferedInputStream wrapping FileInputStream enhancement
V2.0 (#106)
- Start of the 2.0 branch.
- Added support for sorting the order of files processed. Added support for multiple tasks. Refactored file selection logic out to predicates. Fixes #98. Fixes #97.
- Clean up policy to move files to subdirectories by date in finished directory (#103)
- Moving finished files to date folders
- Add test
- Removed duplicate call to listFiles.
- Moving the metadata to headers. Metadata currently was not used by the individual connectors. Fixed #94. Fixed #95.
- Refactored to remove usage of Path since we have that functionality in File. Lower logging of directory to trace.
- Refactored InputFile to own everything for the input file. Added metadata class that is responsible for the file metadata based on cached values from InputFile. Updated test cases to have values for headers, ignoring some of the headers that will by dynamic.
- Fixed java doc warnings.
- Added finishedPathRequired method to config. This will be used to determine if the tasks should check for a finished path. Moved ownership of the InputStream to be completely within InputFile.
- Fixed checkstyle.
- Major refactor to add support for using a BufferedInputStream. Renamed abstract classes to be consistent. Connectors can optionally support using a BufferedInputStream which should reduce the round trips to the underlying filesystem.
file.buffer.size.bytes
can be used to configure the buffer. Fixes #105. - Corrected Metadata to include offset within the file. #95.
- Corrected to use proper config() method.
- Added missing file.offset header to test cases. #95.
- Removed duplicate check.
- Added MB per second calculations.
- Bump the backoff time to 500 ms if no files are returned.
- Lower the logging level when searching for files.
- Correct Connect Packaging plugin configuration (#101)
- Correct Connect Packaging plugin configuration so that the project's packaging configuration properties properly override those inherited from the parent POM.
- Change manifest file's link to documentation (#102)
- Missed configuring tasks for SchemaLessJson. #97.
- Fixed Checkstyle.
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Modified to not require a key for the messages. Bumped version for extended-log-format. Fixes #117
- CC-5455: Update support summary for LA connector (#109)
- Signed-off-by: Arjun Satish <[email protected]>
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Documentation for
metadata.field
was not correct. - Corrected extended-log-format version.
- Update docs for task.partitioner.
- Removed schema generation dependency.
- Updated examples.
- Co-authored-by: Jon Mitchell <[email protected]>
- Co-authored-by: Randall Hauch <[email protected]>
- Co-authored-by: Arjun Satish <[email protected]>
08c35d576db982d Jeremy Custenborder 2020-02-28 18:02:52
GitHub #106 V2.0
V2.0 (#106)
- Start of the 2.0 branch.
- Added support for sorting the order of files processed. Added support for multiple tasks. Refactored file selection logic out to predicates. Fixes #98. Fixes #97.
- Clean up policy to move files to subdirectories by date in finished directory (#103)
- Moving finished files to date folders
- Add test
- Removed duplicate call to listFiles.
- Moving the metadata to headers. Metadata currently was not used by the individual connectors. Fixed #94. Fixed #95.
- Refactored to remove usage of Path since we have that functionality in File. Lower logging of directory to trace.
- Refactored InputFile to own everything for the input file. Added metadata class that is responsible for the file metadata based on cached values from InputFile. Updated test cases to have values for headers, ignoring some of the headers that will by dynamic.
- Fixed java doc warnings.
- Added finishedPathRequired method to config. This will be used to determine if the tasks should check for a finished path. Moved ownership of the InputStream to be completely within InputFile.
- Fixed checkstyle.
- Major refactor to add support for using a BufferedInputStream. Renamed abstract classes to be consistent. Connectors can optionally support using a BufferedInputStream which should reduce the round trips to the underlying filesystem.
file.buffer.size.bytes
can be used to configure the buffer. Fixes #105. - Corrected Metadata to include offset within the file. #95.
- Corrected to use proper config() method.
- Added missing file.offset header to test cases. #95.
- Removed duplicate check.
- Added MB per second calculations.
- Bump the backoff time to 500 ms if no files are returned.
- Lower the logging level when searching for files.
- Correct Connect Packaging plugin configuration (#101)
- Correct Connect Packaging plugin configuration so that the project's packaging configuration properties properly override those inherited from the parent POM.
- Change manifest file's link to documentation (#102)
- Missed configuring tasks for SchemaLessJson. #97.
- Fixed Checkstyle.
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Modified to not require a key for the messages. Bumped version for extended-log-format. Fixes #117
- CC-5455: Update support summary for LA connector (#109)
- Signed-off-by: Arjun Satish <[email protected]>
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Documentation for
metadata.field
was not correct. - Corrected extended-log-format version.
- Update docs for task.partitioner.
- Removed schema generation dependency.
- Updated examples.
- Co-authored-by: Jon Mitchell <[email protected]>
- Co-authored-by: Randall Hauch <[email protected]>
- Co-authored-by: Arjun Satish <[email protected]>
08c35d576db982d Jeremy Custenborder 2020-02-28 18:02:52
GitHub #109 CC-5455: Update support summary for LA connector
V2.0 (#106)
- Start of the 2.0 branch.
- Added support for sorting the order of files processed. Added support for multiple tasks. Refactored file selection logic out to predicates. Fixes #98. Fixes #97.
- Clean up policy to move files to subdirectories by date in finished directory (#103)
- Moving finished files to date folders
- Add test
- Removed duplicate call to listFiles.
- Moving the metadata to headers. Metadata currently was not used by the individual connectors. Fixed #94. Fixed #95.
- Refactored to remove usage of Path since we have that functionality in File. Lower logging of directory to trace.
- Refactored InputFile to own everything for the input file. Added metadata class that is responsible for the file metadata based on cached values from InputFile. Updated test cases to have values for headers, ignoring some of the headers that will by dynamic.
- Fixed java doc warnings.
- Added finishedPathRequired method to config. This will be used to determine if the tasks should check for a finished path. Moved ownership of the InputStream to be completely within InputFile.
- Fixed checkstyle.
- Major refactor to add support for using a BufferedInputStream. Renamed abstract classes to be consistent. Connectors can optionally support using a BufferedInputStream which should reduce the round trips to the underlying filesystem.
file.buffer.size.bytes
can be used to configure the buffer. Fixes #105. - Corrected Metadata to include offset within the file. #95.
- Corrected to use proper config() method.
- Added missing file.offset header to test cases. #95.
- Removed duplicate check.
- Added MB per second calculations.
- Bump the backoff time to 500 ms if no files are returned.
- Lower the logging level when searching for files.
- Correct Connect Packaging plugin configuration (#101)
- Correct Connect Packaging plugin configuration so that the project's packaging configuration properties properly override those inherited from the parent POM.
- Change manifest file's link to documentation (#102)
- Missed configuring tasks for SchemaLessJson. #97.
- Fixed Checkstyle.
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Modified to not require a key for the messages. Bumped version for extended-log-format. Fixes #117
- CC-5455: Update support summary for LA connector (#109)
- Signed-off-by: Arjun Satish <[email protected]>
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Documentation for
metadata.field
was not correct. - Corrected extended-log-format version.
- Update docs for task.partitioner.
- Removed schema generation dependency.
- Updated examples.
- Co-authored-by: Jon Mitchell <[email protected]>
- Co-authored-by: Randall Hauch <[email protected]>
- Co-authored-by: Arjun Satish <[email protected]>
08c35d576db982d Jeremy Custenborder 2020-02-28 18:02:52
CC-5455: Update support summary for LA connector (#109)
- Signed-off-by: Arjun Satish <[email protected]>
32d2d69d4ac1e38 Arjun Satish 2019-07-11 00:03:05
GitHub #11 Corrected schema registry image.
Merge pull request #11 from jcustenborder/jenkinsfile
- Corrected schema registry image.
2c1ce07542ad2c8 Jeremy Custenborder 2016-11-04 20:05:28
GitHub #117 Spooldir ELF connector throws NullPointerException
V2.0 (#106)
- Start of the 2.0 branch.
- Added support for sorting the order of files processed. Added support for multiple tasks. Refactored file selection logic out to predicates. Fixes #98. Fixes #97.
- Clean up policy to move files to subdirectories by date in finished directory (#103)
- Moving finished files to date folders
- Add test
- Removed duplicate call to listFiles.
- Moving the metadata to headers. Metadata currently was not used by the individual connectors. Fixed #94. Fixed #95.
- Refactored to remove usage of Path since we have that functionality in File. Lower logging of directory to trace.
- Refactored InputFile to own everything for the input file. Added metadata class that is responsible for the file metadata based on cached values from InputFile. Updated test cases to have values for headers, ignoring some of the headers that will by dynamic.
- Fixed java doc warnings.
- Added finishedPathRequired method to config. This will be used to determine if the tasks should check for a finished path. Moved ownership of the InputStream to be completely within InputFile.
- Fixed checkstyle.
- Major refactor to add support for using a BufferedInputStream. Renamed abstract classes to be consistent. Connectors can optionally support using a BufferedInputStream which should reduce the round trips to the underlying filesystem.
file.buffer.size.bytes
can be used to configure the buffer. Fixes #105. - Corrected Metadata to include offset within the file. #95.
- Corrected to use proper config() method.
- Added missing file.offset header to test cases. #95.
- Removed duplicate check.
- Added MB per second calculations.
- Bump the backoff time to 500 ms if no files are returned.
- Lower the logging level when searching for files.
- Correct Connect Packaging plugin configuration (#101)
- Correct Connect Packaging plugin configuration so that the project's packaging configuration properties properly override those inherited from the parent POM.
- Change manifest file's link to documentation (#102)
- Missed configuring tasks for SchemaLessJson. #97.
- Fixed Checkstyle.
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Modified to not require a key for the messages. Bumped version for extended-log-format. Fixes #117
- CC-5455: Update support summary for LA connector (#109)
- Signed-off-by: Arjun Satish <[email protected]>
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Documentation for
metadata.field
was not correct. - Corrected extended-log-format version.
- Update docs for task.partitioner.
- Removed schema generation dependency.
- Updated examples.
- Co-authored-by: Jon Mitchell <[email protected]>
- Co-authored-by: Randall Hauch <[email protected]>
- Co-authored-by: Arjun Satish <[email protected]>
08c35d576db982d Jeremy Custenborder 2020-02-28 18:02:52
GitHub #124 Bump commons-compress from 1.18 to 1.19 dependencies
Bump version. Update commons compress. (#125)
- Changed commons-compress version to
[1.19,)
. Closes #124. - Bump version to 2.0-SNAPSHOT.
d1b69e904e97c90 Jeremy Custenborder 2020-02-28 19:20:34
GitHub #125 Bump version. Update commons compress.
Bump version. Update commons compress. (#125)
- Changed commons-compress version to
[1.19,)
. Closes #124. - Bump version to 2.0-SNAPSHOT.
d1b69e904e97c90 Jeremy Custenborder 2020-02-28 19:20:34
GitHub #17 Schema refactor
Schema refactor (#17)
- Refactored to add support for json. Added experimental support for tracking offsets. Moved to require schemas to be supplied.
- Modified scope to ensure that jackson is in target directory.
- Updated the docker compose to the latest version
- Updated data files with changes from offsets.
- Added documentation test.
- Bumped version to 0.10.2.0-cp1.
- Added support to throw a contextual exception when a record cannot be parsed. This will throw the record and field.
- Added support to write records with a timestamp.
- Added description to connectors.
- Added generators to aid in building schemas for the input files. Added a command line utility as well.
- Added validation of character sets.
- Added support to dynamically generate schemas if
schema.generation.enabled
is set to true. - Pulled out jfairy because it's no longer used.
- Updated the documentation.
- Modified so SimpleDateFormat is being set with the correct timezone. This was causing tests to fail in other timezones like UTC.
f14906f3fd81a79 Jeremy Custenborder 2017-05-29 02:56:36
GitHub #22 Fix connectors validate method
Issue 22 (#24)
- bump opencsv to 3.10.
- Moved to use ValidDirectoryWritable from config-utils. Fixes #22
7a0ecc80908605e Jeremy Custenborder 2017-07-22 03:46:29
GitHub #23 Rst
Rst (#23)
- Added documentation.
- Changed to 0.10.2.1-cp2.
9c753e61623383f Jeremy Custenborder 2017-07-18 22:15:45
GitHub #24 Issue 22
Issue 22 (#24)
- bump opencsv to 3.10.
- Moved to use ValidDirectoryWritable from config-utils. Fixes #22
7a0ecc80908605e Jeremy Custenborder 2017-07-22 03:46:29
GitHub #3 Jenkinsfile
Merge pull request #3 from jcustenborder/jenkinsfile
- Jenkinsfile
50e2543c9770abc Jeremy Custenborder 2016-09-07 05:17:57
GitHub #34 Bumped kafka to 1.0.0
Bumped kafka to 1.0.0 (#34)
ee16eaba2f64fb8 Jeremy Custenborder 2017-11-04 03:00:53
GitHub #4 Updated license header.
Merge pull request #4 from jcustenborder/jenkinsfile
- Updated license header.
279a40f9ff13b20 Jeremy Custenborder 2016-09-07 05:21:13
GitHub #40 Incorrect offset handling of reprocessed JSON files
Corrected offset. fixes #40
7b36790f3587b20 Jeremy Custenborder 2018-03-27 16:32:38
GitHub #42 Connector Hub Support enhancement
Issue 42 (#43)
- Added assembly plugin. fixes #42.
- Corrected. Just a sink.
68b2e59226f0122 Jeremy Custenborder 2018-04-03 00:26:04
GitHub #43 Issue 42
Issue 42 (#43)
- Added assembly plugin. fixes #42.
- Corrected. Just a sink.
68b2e59226f0122 Jeremy Custenborder 2018-04-03 00:26:04
GitHub #45 kafka-run-class: command not found bug
Updated documentation to the corrected path for the build output. Fixes #45.
d0a498b0bf48e1d Jeremy Custenborder 2018-04-20 19:35:25
GitHub #48 Added support to decompress based on the file extension.
Added support to decompress based on the file extension. (#48)
cac32a7dc7ad8c1 Jeremy Custenborder 2018-04-21 02:13:21
GitHub #5 Changed to include developer SCM data so the releases plugin will upl…
Merge pull request #5 from jcustenborder/jenkinsfile
- Changed to include developer SCM data so the releases plugin will upl…
eb377bbe953c218 Jeremy Custenborder 2016-09-07 05:27:14
GitHub #50 Add support for extended log format.
Issue 50 (#51)
- Partial support for ELF. #50
- Full support for ELF. Added InputFileDeque the number of calls to list the directory contents. This will query only when needed to reduce the number of stat calls.
b85ffb68636b29b Jeremy Custenborder 2018-05-02 14:02:38
GitHub #51 Issue 50
Issue 50 (#51)
- Partial support for ELF. #50
- Full support for ELF. Added InputFileDeque the number of calls to list the directory contents. This will query only when needed to reduce the number of stat calls.
b85ffb68636b29b Jeremy Custenborder 2018-05-02 14:02:38
GitHub #52 Support a configurable cleanup policy. enhancement
Added support for a configurable delete policy. Fixes #52. (#53)
b8f1b19cc2a1ca4 Jeremy Custenborder 2018-05-02 21:51:02
GitHub #53 Added support for a configurable delete policy. Fixes #52.
Added support for a configurable delete policy. Fixes #52. (#53)
b8f1b19cc2a1ca4 Jeremy Custenborder 2018-05-02 21:51:02
GitHub #56 bump version enhancement
Schemaless json support (#74)
- Work towards refactoring schema support to it's own class. Moved filesystem logic to a set of parent classes.
- SourceOffset was not getting populated to the record.
- Should return null if they're are no records.
- Corrected unit test to check that nulls are returned instead of an empty list.
- Added support for importing json data without a schema. Fixes #70. Fixes #73. Fixes #69. Fixes #60. Fixes #56.
ec15c3a2a9a046a Jeremy Custenborder 2019-01-22 23:17:42
GitHub #6 Jenkinsfile
Merge pull request #6 from jcustenborder/jenkinsfile
- Jenkinsfile
efa7c514fbe83f2 Jeremy Custenborder 2016-09-07 05:39:29
GitHub #60 Build Error
Schemaless json support (#74)
- Work towards refactoring schema support to it's own class. Moved filesystem logic to a set of parent classes.
- SourceOffset was not getting populated to the record.
- Should return null if they're are no records.
- Corrected unit test to check that nulls are returned instead of an empty list.
- Added support for importing json data without a schema. Fixes #70. Fixes #73. Fixes #69. Fixes #60. Fixes #56.
ec15c3a2a9a046a Jeremy Custenborder 2019-01-22 23:17:42
GitHub #69 Test failing when building
Schemaless json support (#74)
- Work towards refactoring schema support to it's own class. Moved filesystem logic to a set of parent classes.
- SourceOffset was not getting populated to the record.
- Should return null if they're are no records.
- Corrected unit test to check that nulls are returned instead of an empty list.
- Added support for importing json data without a schema. Fixes #70. Fixes #73. Fixes #69. Fixes #60. Fixes #56.
ec15c3a2a9a046a Jeremy Custenborder 2019-01-22 23:17:42
GitHub #70 Add method to import JSON without schema. enhancement
Schemaless json support (#74)
- Work towards refactoring schema support to it's own class. Moved filesystem logic to a set of parent classes.
- SourceOffset was not getting populated to the record.
- Should return null if they're are no records.
- Corrected unit test to check that nulls are returned instead of an empty list.
- Added support for importing json data without a schema. Fixes #70. Fixes #73. Fixes #69. Fixes #60. Fixes #56.
ec15c3a2a9a046a Jeremy Custenborder 2019-01-22 23:17:42
GitHub #72 File processing fails in the middle
Rebalance support (#96)
- Refactored Cleanup policy to it's own class. Introduced the concept of an InputFile which owns the processing file. close() is responsible for closing the inputStream and removing the processing file. Fixes #93. Fixes #72.
- Documentation cleanup.
- Ensure that the input file is closed when the task is stopped. This will ensure that the processing file is removed when the task is stopped. This will ensure that the connector handles rebalances. Fixes #93. Fixes #72.
- Added rebalance unit test that writes 100 records. Polls 50. Stops the connector and starts the connector. Polls again. The connector should pick up at the last line that it read.
- Refactored test classes to make things a little easier to extend.
- Removed unneeded prefix.
- Check to ensure file exists before moving it.
- Moved InputFile to own the InputStream.
- Added support language from #99.
- Create the stopwatch unstarted.
- Changed support information.
- Cleanup of logging. All implementations should be using their own logger.
- Bump docker compose to the latest Confluent platform version.
fb747d38434388e Jeremy Custenborder 2019-05-31 22:23:20
GitHub #73 Maven build not working on Mac with Java 8 (test failures)
Schemaless json support (#74)
- Work towards refactoring schema support to it's own class. Moved filesystem logic to a set of parent classes.
- SourceOffset was not getting populated to the record.
- Should return null if they're are no records.
- Corrected unit test to check that nulls are returned instead of an empty list.
- Added support for importing json data without a schema. Fixes #70. Fixes #73. Fixes #69. Fixes #60. Fixes #56.
ec15c3a2a9a046a Jeremy Custenborder 2019-01-22 23:17:42
GitHub #74 Schemaless json support
Schemaless json support (#74)
- Work towards refactoring schema support to it's own class. Moved filesystem logic to a set of parent classes.
- SourceOffset was not getting populated to the record.
- Should return null if they're are no records.
- Corrected unit test to check that nulls are returned instead of an empty list.
- Added support for importing json data without a schema. Fixes #70. Fixes #73. Fixes #69. Fixes #60. Fixes #56.
ec15c3a2a9a046a Jeremy Custenborder 2019-01-22 23:17:42
GitHub #77 csv.separator.char=0 does not work
Line delimited support (#84)
- Added support to read files line by line. Fixes #82
- Changed the title of the connector on the documentation website.
- Changed to support using different parsers. Added support for null separator characters. Fixes #77. Added support for RFC4180 Parsing. Fixes #83.
750d2b25411d3c5 Jeremy Custenborder 2019-03-15 15:01:24
GitHub #78 Updated dependencies for CVEs bug
Update opencsv (#80)
c6111bd4c9532e4 Jeremy Custenborder 2019-02-06 17:05:03
CVE fixes (#79)
a81ff122b053f7f Jeremy Custenborder 2019-02-06 02:54:41
GitHub #79 CVE fixes
CVE fixes (#79)
a81ff122b053f7f Jeremy Custenborder 2019-02-06 02:54:41
GitHub #8 Changed the displayname for a few fields that were using the same entry
Merge pull request #8 from cjmatta/fix_displayname
- Changed the displayname for a few fields that were using the same entry
38423326c898f41 Jeremy Custenborder 2016-09-25 03:51:44
GitHub #80 Update opencsv
Update opencsv (#80)
c6111bd4c9532e4 Jeremy Custenborder 2019-02-06 17:05:03
GitHub #82 Add support to read new line delimited files. enhancement
Line delimited support (#84)
- Added support to read files line by line. Fixes #82
- Changed the title of the connector on the documentation website.
- Changed to support using different parsers. Added support for null separator characters. Fixes #77. Added support for RFC4180 Parsing. Fixes #83.
750d2b25411d3c5 Jeremy Custenborder 2019-03-15 15:01:24
GitHub #83 Add support for the RFC4180Parser enhancement
Line delimited support (#84)
- Added support to read files line by line. Fixes #82
- Changed the title of the connector on the documentation website.
- Changed to support using different parsers. Added support for null separator characters. Fixes #77. Added support for RFC4180 Parsing. Fixes #83.
750d2b25411d3c5 Jeremy Custenborder 2019-03-15 15:01:24
GitHub #84 Line delimited support
Line delimited support (#84)
- Added support to read files line by line. Fixes #82
- Changed the title of the connector on the documentation website.
- Changed to support using different parsers. Added support for null separator characters. Fixes #77. Added support for RFC4180 Parsing. Fixes #83.
750d2b25411d3c5 Jeremy Custenborder 2019-03-15 15:01:24
GitHub #87 Issue 86
Issue 86 (#87)
- Bump to 2.1.1-cp1. Added additional documentation and examples.
- Added example of using a csv with a schema.
88b6db1f1650e59 Jeremy Custenborder 2019-03-20 22:54:09
GitHub #93 halt.on.error=false doesnt' work
Rebalance support (#96)
- Refactored Cleanup policy to it's own class. Introduced the concept of an InputFile which owns the processing file. close() is responsible for closing the inputStream and removing the processing file. Fixes #93. Fixes #72.
- Documentation cleanup.
- Ensure that the input file is closed when the task is stopped. This will ensure that the processing file is removed when the task is stopped. This will ensure that the connector handles rebalances. Fixes #93. Fixes #72.
- Added rebalance unit test that writes 100 records. Polls 50. Stops the connector and starts the connector. Polls again. The connector should pick up at the last line that it read.
- Refactored test classes to make things a little easier to extend.
- Removed unneeded prefix.
- Check to ensure file exists before moving it.
- Moved InputFile to own the InputStream.
- Added support language from #99.
- Create the stopwatch unstarted.
- Changed support information.
- Cleanup of logging. All implementations should be using their own logger.
- Bump docker compose to the latest Confluent platform version.
fb747d38434388e Jeremy Custenborder 2019-05-31 22:23:20
GitHub #94 Include source file name for line delimited connector enhancement
V2.0 (#106)
- Start of the 2.0 branch.
- Added support for sorting the order of files processed. Added support for multiple tasks. Refactored file selection logic out to predicates. Fixes #98. Fixes #97.
- Clean up policy to move files to subdirectories by date in finished directory (#103)
- Moving finished files to date folders
- Add test
- Removed duplicate call to listFiles.
- Moving the metadata to headers. Metadata currently was not used by the individual connectors. Fixed #94. Fixed #95.
- Refactored to remove usage of Path since we have that functionality in File. Lower logging of directory to trace.
- Refactored InputFile to own everything for the input file. Added metadata class that is responsible for the file metadata based on cached values from InputFile. Updated test cases to have values for headers, ignoring some of the headers that will by dynamic.
- Fixed java doc warnings.
- Added finishedPathRequired method to config. This will be used to determine if the tasks should check for a finished path. Moved ownership of the InputStream to be completely within InputFile.
- Fixed checkstyle.
- Major refactor to add support for using a BufferedInputStream. Renamed abstract classes to be consistent. Connectors can optionally support using a BufferedInputStream which should reduce the round trips to the underlying filesystem.
file.buffer.size.bytes
can be used to configure the buffer. Fixes #105. - Corrected Metadata to include offset within the file. #95.
- Corrected to use proper config() method.
- Added missing file.offset header to test cases. #95.
- Removed duplicate check.
- Added MB per second calculations.
- Bump the backoff time to 500 ms if no files are returned.
- Lower the logging level when searching for files.
- Correct Connect Packaging plugin configuration (#101)
- Correct Connect Packaging plugin configuration so that the project's packaging configuration properties properly override those inherited from the parent POM.
- Change manifest file's link to documentation (#102)
- Missed configuring tasks for SchemaLessJson. #97.
- Fixed Checkstyle.
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Modified to not require a key for the messages. Bumped version for extended-log-format. Fixes #117
- CC-5455: Update support summary for LA connector (#109)
- Signed-off-by: Arjun Satish <[email protected]>
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Documentation for
metadata.field
was not correct. - Corrected extended-log-format version.
- Update docs for task.partitioner.
- Removed schema generation dependency.
- Updated examples.
- Co-authored-by: Jon Mitchell <[email protected]>
- Co-authored-by: Randall Hauch <[email protected]>
- Co-authored-by: Arjun Satish <[email protected]>
08c35d576db982d Jeremy Custenborder 2020-02-28 18:02:52
GitHub #95 Add metadata about the file read to headers in the message. enhancement
V2.0 (#106)
- Start of the 2.0 branch.
- Added support for sorting the order of files processed. Added support for multiple tasks. Refactored file selection logic out to predicates. Fixes #98. Fixes #97.
- Clean up policy to move files to subdirectories by date in finished directory (#103)
- Moving finished files to date folders
- Add test
- Removed duplicate call to listFiles.
- Moving the metadata to headers. Metadata currently was not used by the individual connectors. Fixed #94. Fixed #95.
- Refactored to remove usage of Path since we have that functionality in File. Lower logging of directory to trace.
- Refactored InputFile to own everything for the input file. Added metadata class that is responsible for the file metadata based on cached values from InputFile. Updated test cases to have values for headers, ignoring some of the headers that will by dynamic.
- Fixed java doc warnings.
- Added finishedPathRequired method to config. This will be used to determine if the tasks should check for a finished path. Moved ownership of the InputStream to be completely within InputFile.
- Fixed checkstyle.
- Major refactor to add support for using a BufferedInputStream. Renamed abstract classes to be consistent. Connectors can optionally support using a BufferedInputStream which should reduce the round trips to the underlying filesystem.
file.buffer.size.bytes
can be used to configure the buffer. Fixes #105. - Corrected Metadata to include offset within the file. #95.
- Corrected to use proper config() method.
- Added missing file.offset header to test cases. #95.
- Removed duplicate check.
- Added MB per second calculations.
- Bump the backoff time to 500 ms if no files are returned.
- Lower the logging level when searching for files.
- Correct Connect Packaging plugin configuration (#101)
- Correct Connect Packaging plugin configuration so that the project's packaging configuration properties properly override those inherited from the parent POM.
- Change manifest file's link to documentation (#102)
- Missed configuring tasks for SchemaLessJson. #97.
- Fixed Checkstyle.
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Modified to not require a key for the messages. Bumped version for extended-log-format. Fixes #117
- CC-5455: Update support summary for LA connector (#109)
- Signed-off-by: Arjun Satish <[email protected]>
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Documentation for
metadata.field
was not correct. - Corrected extended-log-format version.
- Update docs for task.partitioner.
- Removed schema generation dependency.
- Updated examples.
- Co-authored-by: Jon Mitchell <[email protected]>
- Co-authored-by: Randall Hauch <[email protected]>
- Co-authored-by: Arjun Satish <[email protected]>
08c35d576db982d Jeremy Custenborder 2020-02-28 18:02:52
GitHub #96 Rebalance support
Rebalance support (#96)
- Refactored Cleanup policy to it's own class. Introduced the concept of an InputFile which owns the processing file. close() is responsible for closing the inputStream and removing the processing file. Fixes #93. Fixes #72.
- Documentation cleanup.
- Ensure that the input file is closed when the task is stopped. This will ensure that the processing file is removed when the task is stopped. This will ensure that the connector handles rebalances. Fixes #93. Fixes #72.
- Added rebalance unit test that writes 100 records. Polls 50. Stops the connector and starts the connector. Polls again. The connector should pick up at the last line that it read.
- Refactored test classes to make things a little easier to extend.
- Removed unneeded prefix.
- Check to ensure file exists before moving it.
- Moved InputFile to own the InputStream.
- Added support language from #99.
- Create the stopwatch unstarted.
- Changed support information.
- Cleanup of logging. All implementations should be using their own logger.
- Bump docker compose to the latest Confluent platform version.
fb747d38434388e Jeremy Custenborder 2019-05-31 22:23:20
GitHub #97 Support more than one task enhancement
V2.0 (#106)
- Start of the 2.0 branch.
- Added support for sorting the order of files processed. Added support for multiple tasks. Refactored file selection logic out to predicates. Fixes #98. Fixes #97.
- Clean up policy to move files to subdirectories by date in finished directory (#103)
- Moving finished files to date folders
- Add test
- Removed duplicate call to listFiles.
- Moving the metadata to headers. Metadata currently was not used by the individual connectors. Fixed #94. Fixed #95.
- Refactored to remove usage of Path since we have that functionality in File. Lower logging of directory to trace.
- Refactored InputFile to own everything for the input file. Added metadata class that is responsible for the file metadata based on cached values from InputFile. Updated test cases to have values for headers, ignoring some of the headers that will by dynamic.
- Fixed java doc warnings.
- Added finishedPathRequired method to config. This will be used to determine if the tasks should check for a finished path. Moved ownership of the InputStream to be completely within InputFile.
- Fixed checkstyle.
- Major refactor to add support for using a BufferedInputStream. Renamed abstract classes to be consistent. Connectors can optionally support using a BufferedInputStream which should reduce the round trips to the underlying filesystem.
file.buffer.size.bytes
can be used to configure the buffer. Fixes #105. - Corrected Metadata to include offset within the file. #95.
- Corrected to use proper config() method.
- Added missing file.offset header to test cases. #95.
- Removed duplicate check.
- Added MB per second calculations.
- Bump the backoff time to 500 ms if no files are returned.
- Lower the logging level when searching for files.
- Correct Connect Packaging plugin configuration (#101)
- Correct Connect Packaging plugin configuration so that the project's packaging configuration properties properly override those inherited from the parent POM.
- Change manifest file's link to documentation (#102)
- Missed configuring tasks for SchemaLessJson. #97.
- Fixed Checkstyle.
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Modified to not require a key for the messages. Bumped version for extended-log-format. Fixes #117
- CC-5455: Update support summary for LA connector (#109)
- Signed-off-by: Arjun Satish <[email protected]>
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Documentation for
metadata.field
was not correct. - Corrected extended-log-format version.
- Update docs for task.partitioner.
- Removed schema generation dependency.
- Updated examples.
- Co-authored-by: Jon Mitchell <[email protected]>
- Co-authored-by: Randall Hauch <[email protected]>
- Co-authored-by: Arjun Satish <[email protected]>
08c35d576db982d Jeremy Custenborder 2020-02-28 18:02:52
GitHub #98 Support allowing the user to control the sort order of the files to be processed. enhancement
V2.0 (#106)
- Start of the 2.0 branch.
- Added support for sorting the order of files processed. Added support for multiple tasks. Refactored file selection logic out to predicates. Fixes #98. Fixes #97.
- Clean up policy to move files to subdirectories by date in finished directory (#103)
- Moving finished files to date folders
- Add test
- Removed duplicate call to listFiles.
- Moving the metadata to headers. Metadata currently was not used by the individual connectors. Fixed #94. Fixed #95.
- Refactored to remove usage of Path since we have that functionality in File. Lower logging of directory to trace.
- Refactored InputFile to own everything for the input file. Added metadata class that is responsible for the file metadata based on cached values from InputFile. Updated test cases to have values for headers, ignoring some of the headers that will by dynamic.
- Fixed java doc warnings.
- Added finishedPathRequired method to config. This will be used to determine if the tasks should check for a finished path. Moved ownership of the InputStream to be completely within InputFile.
- Fixed checkstyle.
- Major refactor to add support for using a BufferedInputStream. Renamed abstract classes to be consistent. Connectors can optionally support using a BufferedInputStream which should reduce the round trips to the underlying filesystem.
file.buffer.size.bytes
can be used to configure the buffer. Fixes #105. - Corrected Metadata to include offset within the file. #95.
- Corrected to use proper config() method.
- Added missing file.offset header to test cases. #95.
- Removed duplicate check.
- Added MB per second calculations.
- Bump the backoff time to 500 ms if no files are returned.
- Lower the logging level when searching for files.
- Correct Connect Packaging plugin configuration (#101)
- Correct Connect Packaging plugin configuration so that the project's packaging configuration properties properly override those inherited from the parent POM.
- Change manifest file's link to documentation (#102)
- Missed configuring tasks for SchemaLessJson. #97.
- Fixed Checkstyle.
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Modified to not require a key for the messages. Bumped version for extended-log-format. Fixes #117
- CC-5455: Update support summary for LA connector (#109)
- Signed-off-by: Arjun Satish <[email protected]>
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Documentation for
metadata.field
was not correct. - Corrected extended-log-format version.
- Update docs for task.partitioner.
- Removed schema generation dependency.
- Updated examples.
- Co-authored-by: Jon Mitchell <[email protected]>
- Co-authored-by: Randall Hauch <[email protected]>
- Co-authored-by: Arjun Satish <[email protected]>
08c35d576db982d Jeremy Custenborder 2020-02-28 18:02:52
GitHub #99 Added Confluent support information
Rebalance support (#96)
- Refactored Cleanup policy to it's own class. Introduced the concept of an InputFile which owns the processing file. close() is responsible for closing the inputStream and removing the processing file. Fixes #93. Fixes #72.
- Documentation cleanup.
- Ensure that the input file is closed when the task is stopped. This will ensure that the processing file is removed when the task is stopped. This will ensure that the connector handles rebalances. Fixes #93. Fixes #72.
- Added rebalance unit test that writes 100 records. Polls 50. Stops the connector and starts the connector. Polls again. The connector should pick up at the last line that it read.
- Refactored test classes to make things a little easier to extend.
- Removed unneeded prefix.
- Check to ensure file exists before moving it.
- Moved InputFile to own the InputStream.
- Added support language from #99.
- Create the stopwatch unstarted.
- Changed support information.
- Cleanup of logging. All implementations should be using their own logger.
- Bump docker compose to the latest Confluent platform version.
fb747d38434388e Jeremy Custenborder 2019-05-31 22:23:20
Jira CC-5455
V2.0 (#106)
- Start of the 2.0 branch.
- Added support for sorting the order of files processed. Added support for multiple tasks. Refactored file selection logic out to predicates. Fixes #98. Fixes #97.
- Clean up policy to move files to subdirectories by date in finished directory (#103)
- Moving finished files to date folders
- Add test
- Removed duplicate call to listFiles.
- Moving the metadata to headers. Metadata currently was not used by the individual connectors. Fixed #94. Fixed #95.
- Refactored to remove usage of Path since we have that functionality in File. Lower logging of directory to trace.
- Refactored InputFile to own everything for the input file. Added metadata class that is responsible for the file metadata based on cached values from InputFile. Updated test cases to have values for headers, ignoring some of the headers that will by dynamic.
- Fixed java doc warnings.
- Added finishedPathRequired method to config. This will be used to determine if the tasks should check for a finished path. Moved ownership of the InputStream to be completely within InputFile.
- Fixed checkstyle.
- Major refactor to add support for using a BufferedInputStream. Renamed abstract classes to be consistent. Connectors can optionally support using a BufferedInputStream which should reduce the round trips to the underlying filesystem.
file.buffer.size.bytes
can be used to configure the buffer. Fixes #105. - Corrected Metadata to include offset within the file. #95.
- Corrected to use proper config() method.
- Added missing file.offset header to test cases. #95.
- Removed duplicate check.
- Added MB per second calculations.
- Bump the backoff time to 500 ms if no files are returned.
- Lower the logging level when searching for files.
- Correct Connect Packaging plugin configuration (#101)
- Correct Connect Packaging plugin configuration so that the project's packaging configuration properties properly override those inherited from the parent POM.
- Change manifest file's link to documentation (#102)
- Missed configuring tasks for SchemaLessJson. #97.
- Fixed Checkstyle.
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Modified to not require a key for the messages. Bumped version for extended-log-format. Fixes #117
- CC-5455: Update support summary for LA connector (#109)
- Signed-off-by: Arjun Satish <[email protected]>
- Refactored to allow metadata to be placed in a header, field, or dropped.
- Documentation for
metadata.field
was not correct. - Corrected extended-log-format version.
- Update docs for task.partitioner.
- Removed schema generation dependency.
- Updated examples.
- Co-authored-by: Jon Mitchell <[email protected]>
- Co-authored-by: Randall Hauch <[email protected]>
- Co-authored-by: Arjun Satish <[email protected]>
08c35d576db982d Jeremy Custenborder 2020-02-28 18:02:52
CC-5455: Update support summary for LA connector (#109)
- Signed-off-by: Arjun Satish <[email protected]>
32d2d69d4ac1e38 Arjun Satish 2019-07-11 00:03:05
No issue
Add the number of records imported.
82aeb6ca2f5a356 Jeremy Custenborder 2018-05-09 20:53:47
Make the separator character configurable.
95d384fa37db2bb Jeremy Custenborder 2018-05-09 20:53:20
Added check for cleanup policy. Added log message when removing file.
e6c6ec4d30d91f9 Jeremy Custenborder 2018-05-02 23:03:20
validValues should not return null.
3af4d58257f6dfc Jeremy Custenborder 2018-05-02 22:40:13
Cleanup of configuration documentation.
cb67558c6a4931b Jeremy Custenborder 2018-04-20 22:23:26
Cleanup of configuration documentation.
ac8c4e452402c16 Jeremy Custenborder 2018-04-20 21:38:03
Updated documentation
1a52576b4719e95 Jeremy Custenborder 2017-11-07 22:07:02
Corrected schema registry image.
464603a3ea40a4b Jeremy Custenborder 2016-11-04 20:04:03
Changed the displayname parameter for a few fields in the CSVRecordProcessorConfig
6e8954d925d9971 Chris Matta 2016-09-15 18:02:51
Moved all of the documentation back to markdown.
45a3eacf9c01464 Jeremy Custenborder 2016-09-07 05:38:53
Changed assembly to ouput a tar.gz
7b2dce5e2c593b6 Jeremy Custenborder 2016-09-07 05:33:11
Changed to include developer SCM data so the releases plugin will upload properly.
cb7abea45b497d6 Jeremy Custenborder 2016-09-07 05:26:38
Updated license header.
243fbb8f6108bcc Jeremy Custenborder 2016-09-07 05:20:13
Deploy to github releases.
f12eba253a6f4b6 Jeremy Custenborder 2016-09-07 05:16:08
Added logging for testing.
3748e346703a5a1 Jeremy Custenborder 2016-08-23 03:35:59
Added confluent repository.
c8610c37fb26fba Jeremy Custenborder 2016-08-23 03:30:09
Initial commit for Jenkinsfile.
139ed2da2514c70 Jeremy Custenborder 2016-08-23 03:26:10
Changes to pull connect-utils from maven central.
a98dc0b7aa32fff Jeremy Custenborder 2016-08-23 03:21:57
Added settings to control the output schema name for a generated schema.
02a0071767974d3 Jeremy Custenborder 2016-07-19 18:22:43
Corrected test cases for maven. Added metadata to the record if configured.
c606245639e26bb Jeremy Custenborder 2016-07-12 13:22:43
Cleanup of refactor in strings.
d61d1d8b5af3ee9 Jeremy Custenborder 2016-07-11 21:56:28
Rename the file once before it has started processing.
a060f567ad6eb10 Jeremy Custenborder 2016-07-11 17:57:17
Added more testing around not having keys defined. This should result in a null schema and a null object ending up in the connect record.
7c49df8804a986f Jeremy Custenborder 2016-07-10 02:28:24
Added more test cases around non defined schemas. Extend SchemaConfig tests to validate the key schema. Refactored CSVRecordProcessor to use data package. Added more documentation to RST.
e2361cc656c32df Jeremy Custenborder 2016-07-10 02:16:31
Hopefully corrected RST.
1851a2b7b12de45 Jeremy Custenborder 2016-07-10 01:15:01
Added more documentation and moved some common functionality out to connect-utils.
00ceb01d64764a2 Jeremy Custenborder 2016-07-10 01:13:51
Cleanup schema generation code. Clarified support for generating a schema based on the the header row. Indexes for specified schemas are indexed based on the header row if it is used. Added checkstyle.
28c0590ebe8f62b Jeremy Custenborder 2016-07-09 17:05:05
Moved schema generation over to SchemaConfig class. Added tests for this. Added license headers to all classes.
afae51b55c605dd Jeremy Custenborder 2016-07-08 15:30:00
Added scripts to run with debugging or suspend with debugging.
bbeca70262f7db1 Jeremy Custenborder 2016-07-08 07:48:35
Corrected formatting.
41d0c16a077f3e0 Jeremy Custenborder 2016-07-08 07:46:18
Corrected formatting.
f8982d33f93582c Jeremy Custenborder 2016-07-08 07:44:39
Added documentation on how to build.
04186b322fd4934 Jeremy Custenborder 2016-07-08 07:43:54
Major refactor to combine all of the configs and go to a json based configuration for the connect schema. Cleanup work is still needed in the case you are not specifying fields.
01ae8d6d6ed34b2 Jeremy Custenborder 2016-07-08 07:35:56
Moved charset setting to file.charset for consistency.
c1a35c307eba3dd Jeremy Custenborder 2016-07-07 15:25:11
Added setting to allow a minimum file age.
50aeca0258424cd Jeremy Custenborder 2016-07-07 15:24:38
Added some logging defaulting to 20x the batch size.
1cf63344ba37147 Jeremy Custenborder 2016-07-07 06:58:46
Cleanup the documentation for null fields in a CSV.
120ea0e56f98392 Jeremy Custenborder 2016-07-07 06:47:58
Remove the markdown for the rst.
ce0e1a248555346 Jeremy Custenborder 2016-07-07 06:43:11
Added more documentation.
f69780dd8ca458b Jeremy Custenborder 2016-07-07 06:42:17
First working pass supporting both a text only mode directly from the header of a CSV and specific configuration through the config file. Decent parsing speed.
bdbe92a2fa4e4ff Jeremy Custenborder 2016-07-07 05:44:24
Added more documentation.
0535f425d798085 Jeremy Custenborder 2016-07-06 22:01:41
Added indexes to field config. Added more type testing.
abce419831b517a Jeremy Custenborder 2016-07-06 21:48:17
Cleanup from code analysis.
e9b3d567924f5b6 Jeremy Custenborder 2016-07-05 05:51:05
Corrected file formatting.
79b79bbfcceac7b Jeremy Custenborder 2016-07-05 05:12:00
Changed to support changing the file encoding from the default.
992f5bb7b4b09ba Jeremy Custenborder 2016-07-05 05:11:02
Cleanup of some errors from FindBugs.
2b442f83d65b084 Jeremy Custenborder 2016-07-05 02:18:27
Cleanup of some errors from FindBugs.
448e6bcee2819ee Jeremy Custenborder 2016-07-05 02:11:43
Cleanup formatting.
e08209932d6d144 Jeremy Custenborder 2016-07-05 02:03:18
CSV Support is working. Directory polling as well.
e58fe84c800daad Jeremy Custenborder 2016-07-05 02:02:08