Tools: Scared of Linux as a Beginner Data Engineer? Here’s How to Get Started - Full Analysis
Step 1: Connect to the Server via SSH and Update It
Step 1: Connect via SSH
Step 2: Update the Server
Step 3: Create Your Own User
Step 4: Create Folders and Files
Step 5: Edit Files
Step 6: Downloading file from webpage and Managing it
Step 7: Transfer Files Between Your Local PC and Server
Summary Takeaways as a beginner If you're scared of Linux as a beginner data engineer, you're not alone. Almost everyone feels this way at the start. This year, I decided to transition from being a data analyst to a data engineer with zero Linux experience. Over the past two weeks, I’ve been learning practical Linux skills and how they apply to solving real world data problems for businesses. Here’s a summary of what I’ve learned. Firstly, Every stage of the data engineering pipeline runs on Linux servers, usually in the cloud. As a data engineer, here’s what I’ll actually use Linux for: Secondly: In real life, businesses pull data from APIs, databases, or external files daily. One has to automatically pull the data from these APIs using a Linux Server. To achieve this, one has learn how to: Below are simplified steps to achieve this. SSH (Secure Shell) allowed me to open an encrypted terminal session to a remote server. I needed two things: On Windows, you can use PowerShell or Git Bash. I was using PowerShell. SSH (Secure Shell) opens an encrypted terminal session. You need your
server's IP address and username. On Windows, use PowerShell or Git Bash: Always update your server first before doing anything else: Avoid using root regularly by creating a personal user right after setup: Now that you are logged in as your own user, organize your workspace: Use nano to write or paste your code into the file: View file contents anytime with: Now that the workspace is set up, you can bring in data files: Move files from your local machine to the server using SCP
(Secure Copy Protocol): On the server, navigate to the folder and run your script: If you’re also learning Linux for data engineering, what’s been challenging for you so far?. Drop a comment. I’d love to learn from your experience. Also, stay tuned for the next two weeks progress update. Templates let you quickly answer FAQs or store snippets for re-use. Hide child comments as well For further actions, you may consider blocking this person and/or reporting abuse