The Useful Command Lines
The Useful Command Lines
Xunyi Jiang
永久添加到 PATH
为了永久添加 conda
到你的 PATH
,你需要将上述 export
命令添加到你的 shell 配置文件中,如 .bashrc
或 .bash_profile
(取决于你使用的 shell 和操作系统)。
打开你的 shell 配置文件。如果你使用的是 bash,通常是
.bashrc
或.bash_profile
:nano ~/.bashrc
在文件的末尾添加以下行:
export PATH=~/miniconda3/bin:$PATH
保存并关闭文件。(如果你使用的是 nano,可以按
Ctrl + O
保存,然后按Ctrl + X
退出。)为了使更改生效,你需要重新加载配置文件,可以通过运行以下命令或者关闭并重新打开你的终端:
source ~/.bashrc
Website
Run the .html
on my computer
hugo server
Git
Some useful command lines for git(You’d better remember it)
git clone <repo_url> # clone
git add . # add file to track
git commit -m "This is the comment for this commit" # commit the change
git push origin main # push to remote
git push origin feature-branch # push to the certain branch
git branch -b new_branch_name # create a new branch and check to this branch
git checkout brach_name
git branch new_branch_name # creat a new branch without checkout.
git pull origin <branch_name> # pull the revision of branch
git log -p -1 # check the last commit
# after check, how to quit in ternimal? q
git add remote <ssh>
git checkout -b <branch_name> # checkout and new a branch
git merge <branch_name> # if we want to merge 2 branch, we need to first check to the branch we want to mergem then use this command line to merge
Update the local repository with the remote repository
git push --force origin master
Add remote of the existing local repository
git remote add origin
If the exsiting remote repo is not what I want to contribute, I can change the remote repo with the following command
git remote set-url origin <url>
Push your code to the remote repository
git branch -M main
git push -u origin main
Handle large files with git
If we want to push and track these files, we can use lfs
to solve this problem. First we can install git-lfs
with the following command
brew install git-lfs
Then we can track the large files with the following command
git lfs track "*.psd"
# or
git lfs track "*.pkl"
If you want to untrack the lsf
git lfs untrack "*.pkl"
After any invocation of git-lfs-track(1)
or git-lfs-untrack(1)
, you must
commit changes to your .gitattributes
file. This can be done by running:
$ git add .gitattributes
$ git commit -m "track *.psd files using Git LFS"
Tip: if you have large files already in your repository’s history,
git lfs track
will not track them retroactively. To migrate existing large files in your history to use Git LFS, usegit lfs migrate
. For example:$ git lfs migrate import --include="*.psd" --everything
Note that this will rewrite history and change all of the Git object IDs in your repository, just like the export version of this command.
If some are extremely big and I don’t want to track them, I can use the following command. For instance, I want to ignore these two file: data/dbpedia/kg.pkl data/dbpedia/mappingbased-objects_lang=en.ttl
- add the filename into the
.gitignore
file - add
.gitignore
file into the git repo
git add .gitignore
- commit the changes
git commit -m "add .gitignore file"
- push the changes to the remote repo
git push origin main
But I encounter with the problem that the large files have been tracked by git, I can use the following command to clean the history of the git repo
git filter-branch --index-filter 'git rm --cached --ignore-unmatch data/dbpedia/kg.pkl' --prune-empty -- --all
Uninstall lfs
Cause my files are too large, so I just ignore the files in the dataset
folder. But I have already installed lfs
in my computer. I can uninstall it with the following command
git lfs uninstall
After uninstall, I need to clear the cache with the following command
git rm -r --cached .
Since in the .gitignore
file, we already ignore the dataset folder.
Use
git add .
However, I still have the large files in the git commit history, so I just remove .git
and init a new git repo
rm -rf .git
git init
Finally, I ignore the large files in the original repo and push my code to the remote repo. (Actually during this process, I encouter with the secret leak error and I need to modify my code with personal information)
HCP
This part is the useful command lines for HCP, mainly refer to this tutorial
1. Connect to HCP
I use vscode to connect to HCP, which need NUS web VPN.
2. Create Conda Environment
Before I run the code, I need to create a conda environment. The official website of miniconda
mkdir -p ~/miniconda3
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O ~/miniconda3/miniconda.sh
bash ~/miniconda3/miniconda.sh -b -u -p ~/miniconda3
rm -rf ~/miniconda3/miniconda.sh
After installing
~/miniconda3/bin/conda init bash
~/miniconda3/bin/conda init zsh
Create a new environment
conda create -n myenv python=3.10
Activate the environment
conda activate myenv
3. Bash Script for job submission
#!/bin/bash
#PBS -N LLaMA2_FineTuning
#PBS -l select=8:ncpus=128:mem=440G:mpiprocs=128:ompthreads=1
#PBS -l walltime=3:00:00
#PBS -j oe
#PBS -o fine-tuning-output.txt
#PBS -P xunyi
#PBS -q ai
# 加载Anaconda环境
echo "Python interpreter: $(which python)"
module load cray-python/3.9.7.1
# source activate /home/users/nus/e1325135/miniconda3/envs/llms
source ~/miniconda3/etc/profile.d/conda.sh
conda activate llms
echo $CONDA_DEFAULT_ENV
# 进入你的工作目录(如果有的话)
cd scratch/llama2_inspire
# 执行fine-tuning脚本,这里假设是一个Python脚本
# 你需要根据你的具体情况修改脚本名称和路径,以及传递给它的参数
# python fine_tune_llama2.py
# 修改为直接指定解释器路径
/home/users/nus/e1325135/miniconda3/envs/llms/bin/python fine_tune_llama2.py
4. Submit the job and job status
qsub fine_tuning.sh
qstat
As for slurm job, we can use the following command
squeue -u $USER
tmux command lines
Create a new tumx session
tmux new -s <session-name>
Look up the existed sessions
tmux list-session
Detach the session Using
exit
orctrl + b d
to exitAttach the session
tmux attach -t <session-id> tmux attach -t <session-name>
Download file from remote server
ssh e1325135@aspire2a.nus.edu.sg
Download file: scp e1325135@aspire2a.nus.edu.sg:~/scratch/scripts/ProjectCode/DatabaseCreationCodes/DatabaseCreation.ipynb /Users/xunyijiang/Documents/StudyMaterials/2024Spring/CRS/explainableCRS/ProjectCode/DatabaseCreationCodes/DatabaseCreationCodeFullVersion.ipynb
scp e1325135@aspire2a.nus.edu.sg:~/scratch/scripts/ProjectCode/dataset/restuls/data0416/my_info_v3.csv /Users/xunyijiang/Documents/StudyMaterials/2024Spring/CRS/explainableCRS/ProjectCode/dataset/results/data0416/movie_db.csv
## 查看GPU的线程使用情况
fuser -v /dev/nvidia*
## 批量删除显卡4的进程
fuser -v /dev/nvidia4 |awk '{for(i=1;i<=NF;i++)print "kill -9 " $i;}' | sh
Remote SSH
- Why I need SSH?
- 安全性更高:SSH 使用加密密钥对(私钥和公钥)来认证,而不是用户名和密码。这使得你的操作更加安全,特别是在公共网络上。
- 免密登录:一旦你设置好了 SSH 密钥并将公钥添加到 GitHub 上,你就不需要每次操作都输入用户名和密码。这极大地简化了操作。
- 避免 HTTP 限制:在某些网络环境中,HTTP 可能被限制或会遇到如你之前所见的 HTTP2 framing 层的问题。SSH 通常不受这些限制。
- How to set
- First we need to generate ssh key:
ssh-keygen -t ed25519 -C "your_email@example.com" # jiangxy2020@mail.sustech.edu.cn
# then it will store the generated key into `~/.ssh/id_ed25519`
cat ~/.ssh/id_ed25519.pub # get the generated public key
After finish all this set on my own laptop, I need to set up at GitHub. Settings > SSH > Add new key. I use this method to solve the problem of unsuccessful push.
同步远程文件夹
rsync -avz --delete /path/to/local/project/ ubuntu@ec2-35-94-87-47.us-west-2.compute.amazonaws.com:pa1
rsync -avz –delete /Users/xunyijiang/Documents/AStudyUCSD/2024Fall/CSE260/PAs/PA1/pa1-faz007-xuj003/ ubuntu@172.31.54.108:~/pa1
brew 查看路径
brew info openblas