[ad_1]
Problem Description
Following is the StringIO object value
print(csv_log_stream.getvalue())
Output:
"2022-06-03 14:03:12,385","azure_functions_worker","INFO","Successfully processed FunctionLoadRequest, request ID: 58ce716e-46b1-4093-9991-5bc773ebfbdb, function ID: 17dda4e6-52af-4c8b-a0b9-17c42812449b,function Name: ProcessWebSaleExportFilesInRSBlobStorage"
"2022-06-03 14:03:12,388","azure_functions_worker","INFO","Received FunctionLoadRequest, request ID: 58ce716e-46b1-4093-9991-5bc773ebfbdb, function ID: 373c225f-b33b-4233-8bb6-a288eb3bae8ffunction Name: ReadIntegrationInterfaceConfiguration"
"2022-06-03 14:03:12,396","azure_functions_worker","INFO","Successfully processed FunctionLoadRequest, request ID: 58ce716e-46b1-4093-9991-5bc773ebfbdb, function ID: 373c225f-b33b-4233-8bb6-a288eb3bae8f,function Name: ReadIntegrationInterfaceConfiguration"
"2022-06-03 14:03:12,401","azure_functions_worker","INFO","Received FunctionLoadRequest, request ID: 58ce716e-46b1-4093-9991-5bc773ebfbdb, function ID: ab9015b6-c421-4611-8678-d9aef0dc8967function Name: SaveLogsToRSBlobStorage"
"2022-06-03 14:03:12,410","azure_functions_worker","INFO","Successfully processed FunctionLoadRequest, request ID: 58ce716e-46b1-4093-9991-5bc773ebfbdb, function ID: ab9015b6-c421-4611-8678-d9aef0dc8967,function Name: SaveLogsToRSBlobStorage"
"2022-06-03 14:03:12,492","azure_functions_worker","INFO","Received FunctionInvocationRequest, request ID: 58ce716e-46b1-4093-9991-5bc773ebfbdb, function ID: 17dda4e6-52af-4c8b-a0b9-17c42812449b, function name: ProcessWebSaleExportFilesInRSBlobStorage, invocation ID: 3e013911-b0c9-41e1-80a5-e05a0c611cf9, function type: sync, sync threadpool max workers: 1000"
"2022-06-03 14:03:12,507","root","INFO","Python HTTP trigger :: ProcessWebSaleExportFilesInRSBlobStorage function processed a request."
"2022-06-03 14:03:12,511","root","INFO","Processing Request object started for the desired parameters."
"2022-06-03 14:03:12,514","root","INFO","Processing Request object completed for the desired parameters."
"2022-06-03 14:03:12,517","root","INFO","Processing Request object started for the desired parameters."
"2022-06-03 14:03:12,520","root","INFO","Processing Request object completed for the desired parameters."
"2022-06-03 14:03:12,566","azure.core.pipeline.policies.http_logging_policy","INFO","Request URL: 'https://koxdsrssa.blob.core.windows.net/koxds-export?restype=REDACTED&comp=REDACTED&prefix=REDACTED&st=REDACTED&se=REDACTED&sp=REDACTED&sv=REDACTED&sr=REDACTED&sig=REDACTED'
Request method: 'GET'
Request headers:
'x-ms-version': 'REDACTED'
'Accept': 'application/xml'
'User-Agent': 'azsdk-python-storage-blob/12.12.0 Python/3.8.12 (Windows-10-10.0.19044-SP0)'
'x-ms-date': 'REDACTED'
'x-ms-client-request-id': '246e6f5e-e335-11ec-a140-48a4728e3a8b'
No body was attached to the request"
"2022-06-03 14:03:13,286","azure.core.pipeline.policies.http_logging_policy","INFO","Response status: 200
Response headers:
'Transfer-Encoding': 'chunked'
'Content-Type': 'application/xml'
'Server': 'Windows-Azure-Blob/1.0 Microsoft-HTTPAPI/2.0'
'x-ms-request-id': '15851041-001e-0057-1741-776bdc000000'
'x-ms-client-request-id': '246e6f5e-e335-11ec-a140-48a4728e3a8b'
'x-ms-version': 'REDACTED'
'Access-Control-Expose-Headers': 'REDACTED'
'Access-Control-Allow-Origin': 'REDACTED'
'Date': 'Fri, 03 Jun 2022 12:03:12 GMT'"
Reading from StringIO to dataframe:
df_logs = pd.read_csv(csv_log_stream, names=["Timestamp", "LogName", "LogLevel", "LogMessage"])
print(df_logs)
Output:
Empty DataFrame
Columns: [Timestamp, LogName, LogLevel, LogMessage]
Index: []
What did I try?
Try #1
df_logs = pd.read_csv(csv_log_stream)
print(df_logs)
OR
df_logs = pd.read_csv(csv_log_stream, header=None)
print(df_logs)
Output:
Traceback (most recent call last):
File "<string>", line 1, in <module>
File "ProjectDir\\.venv\lib\site-packages\pandas\util\_decorators.py", line 311, in wrapper
return func(*args, **kwargs)
File "ProjectDir\\.venv\lib\site-packages\pandas\io\parsers\readers.py", line 680, in read_csv
return _read(filepath_or_buffer, kwds)
File "ProjectDir\\.venv\lib\site-packages\pandas\io\parsers\readers.py", line 575, in _read
parser = TextFileReader(filepath_or_buffer, **kwds)
File "ProjectDir\\.venv\lib\site-packages\pandas\io\parsers\readers.py", line 933, in __init__
self._engine = self._make_engine(f, self.engine)
File "ProjectDir\\.venv\lib\site-packages\pandas\io\parsers\readers.py", line 1235, in _make_engine
return mapping[engine](f, **self.options)
File "ProjectDir\\.venv\lib\site-packages\pandas\io\parsers\c_parser_wrapper.py", line 75, in __init__
self._reader = parsers.TextReader(src, **kwds)
File "pandas\_libs\parsers.pyx", line 551, in pandas._libs.parsers.TextReader.__cinit__
pandas.errors.EmptyDataError: No columns to parse from file
Try #2
df_logs = pd.read_csv(csv_log_stream, names=["Timestamp", "LogName", "LogLevel", "LogMessage"], sep=",", header=None)
print(df_logs)
OR
df_logs = pd.read_csv(csv_log_stream, names=["Timestamp", "LogName", "LogLevel", "LogMessage"], sep=",", lineterminator="\n", header=None)
print(df_logs)
OR
df_logs = pd.read_csv(csv_log_stream, names=["Timestamp", "LogName", "LogLevel", "LogMessage"], sep=",", header=None, quoting=csv.QUOTE_ALL)
print(df_logs)
Output:
Empty DataFrame
Columns: [Timestamp, LogName, LogLevel, LogMessage]
Index: []
Try #3
df_logs = pd.read_csv(csv_log_stream, names=["Timestamp", "LogName", "LogLevel", "LogMessage"], skip_blank_lines=True, skipinitialspace=True, engine="python")
print(df_logs)
Output:
Empty DataFrame
Columns: [Timestamp, LogName, LogLevel, LogMessage]
Index: []
I am not able to understand what am I doing wrong. My input StringIO value seems to be correct. What am I missing?!!
[ad_2]