pyspark.sql.functions.input_file_block_start¶
-
pyspark.sql.functions.
input_file_block_start
() → pyspark.sql.column.Column[source]¶ Returns the start offset of the block being read, or -1 if not available.
New in version 3.5.0.
Examples
>>> df = spark.read.text("python/test_support/sql/ages_newlines.csv", lineSep=",") >>> df.select(input_file_block_start().alias('r')).first() Row(r=0)