Tong Guo

179 Questions 33 Answers 0 Followers

Questions related from Tong Guo

Lora fine-tuning. Why use preference datasets for DPO training? For the same question, with data comparing which answer is better, why not just SFT?

Why use preference datasets for DPO training? For the same question, with data comparing which answer is better, why not just use the better answer for SFT directly?

06 July 2025 2,356 0 View

SFT-trained model -> self-predict -> DPO-train. Is it work?

Lora fine-tuning. Using an SFT-trained model, make predictions on the training dataset, then human annotators label preference dataset — specifically, indicating whether the training dataset...

06 July 2025 9,877 1 View

‌Is the "linear sequential" training approach of GPT the root cause of hallucinations in LLMs?‌

‌Is the "linear sequential" training approach of GPT the root cause of hallucinations in large language models?‌

09 June 2025 1,206 0 View

What storage system should be used for training LLMs? NAS? HDFS? GPFS? NetApp?

What storage system should be used for training large language models? NAS? HDFS? GPFS? NetApp?

09 June 2025 7,489 2 View

Compared to GPT1 and GPT2, the biggest advantage of LLMs for domain-specific generative NLP tasks is that they require less fine-tuning data, right?

LLMs = large language models

03 April 2025 2,031 2 View

The RL-loss of LLMs indeed demonstrates better effectiveness than SL-loss because the dataset lacks intermediate steps?

https://modelscope.cn/datasets/Haijian/Advanced-Math/dataPeview

22 March 2025 6,882 1 View

With RAG+LLM, are most of the issues in domain-specific intelligent customer service essentially resolved?

With Retrieval Augmented Generation + LLM, are most of the issues in domain-specific intelligent customer service essentially resolved?

19 March 2025 1,117 3 View

What major improvements have been made to deep learning architectures since the creation of 'Attention Is All You Need' in 2017?

15 March 2025 8,946 3 View

In which complex scenarios is reinforcement learning essential for controlling robotic arms?

15 March 2025 1,177 0 View

What technology is used in the DeepSeek token vocabulary module?

15 March 2025 7,331 1 View

Does directly adding relative-position-embedding and absolute-position-embedding provide the same length extrapolation advantages for LLMs by RoPE?

Does directly adding relative-position-embedding and absolute-position-embedding provide the same length extrapolation advantages for LLMs by RoPE (Rotary Position Embedding)?

15 March 2025 5,558 0 View

When training large models（LLMs）, which training speed metric is primarily considered: learning rate, batch size, or batch time?

15 March 2025 5,751 3 View

Compared to the baseline, how much does RoPE improve LLMs?

Rotary Positional Embedding RoFormer: Enhanced Transformer with Rotary Position Embedding

28 February 2025 5,650 2 View

Does true large-scale generalization exist in LLMs, meaning do LLMs solve math problems through generalization?

Do LLMs simply remember the solutions to similar math problems?

07 February 2025 6,286 2 View

Why I think LLMs' reinforcement learning is not REAL reinforcement learning?

include DeepSeek-R1-Zero. It is also supervised learning

24 January 2025 4,567 2 View

Is it ok to build a distributed search engine that computes whether a document contains a query without using ES to establish the key-document index?

Is it feasible to build a distributed search engine that computes whether a document contains a query without using ElasticSearch to establish the key-document index?

17 January 2025 9,633 0 View

Can stereo vision produce point clouds? It seems acceptable for measuring the distance of a single object, but are the point clouds reliable?

Can binocular vision produce accurate point clouds? It seems acceptable for measuring the distance of a single object, but are the point clouds reliable?

10 January 2025 8,371 1 View

What are the 3D segmentation studies without LiDAR?

What are the research works about 3d segmentation without Lidar?

06 January 2025 7,326 1 View

What are the 3D segmentation studies without RGB-D's depth?

What are the research works about 3d segmentation without RGB-D's depth?

06 January 2025 2,141 2 View

What are the biggest problems/challenges for this robot manipulation solution by imitation learning?

04 January 2025 8,719 0 View

Why use imitation learning for robotic arm manipulation, and what are the issues when starting from 3D reconstruction?

Why use imitation learning for robotic arm manipulation, and what are the issues when starting from 3D reconstruction? If we have the XYZ information from the 3D reconstruction, and then let the...

04 January 2025 6,093 2 View

For autonomous driving, do imitation learning and RL have the same number of corner cases?

For autonomous driving, do imitation learning and reinforcement learning have the same number of corner cases?

30 December 2024 2,215 1 View

Without LiDAR, how is the 3D box data, which is measured in meters, annotated?

How are the 3D bounding boxes for objects, measured in meters, and their positions in the camera XYZ coordinate system annotated?

29 December 2024 9,487 1 View

How is the 3D box data in the KITTI dataset, which is measured in meters, annotated?

How are the 3D bounding boxes for objects, measured in meters, and their positions in the camera XYZ coordinate system annotated?

29 December 2024 8,585 1 View

How to annotate the coordinates of an object to the camera on a 2D image?

How to annotate the XYZ coordinates of an object to the camera on a 2D image?

29 December 2024 7,027 1 View

What are the inputs and outputs of the robotic arm imitation learning model? is it end to end?

is it end to end?

28 December 2024 8,514 0 View

How can the fingers of a robotic arm be designed to be more strength?

26 December 2024 6,891 0 View

What is the principle of using reinforcement learning to train the mechanical module of a robotic dog?

the RL example CartPole, which is an inverted pendulum: when the inverted pendulum is disturbed, the algorithm keeps it balanced.

22 December 2024 4,938 3 View

Why use RL to train robot dogs' walking (posture), and why can't machine vision be used to determine how mechanical feet should move?

the reinforcement learning example CartPole, which is an inverted pendulum: when the inverted pendulum is disturbed, the algorithm keeps it balanced.

22 December 2024 3,878 1 View

All model can be viewed as sum formula of features?

all model can be viewed as sum formula of features?

18 December 2024 2,923 8 View

Not to mention whether it can cover all scenarios, it seems difficult to ensure the accuracy of the autonomous driving simulator?

Simulating precisely modeling collision forces and angles for different weights feels really difficult.

18 December 2024 2,607 0 View

Is it necessary to use visualization tools for the motion trajectory planning of a robotic arm?

If we have the 3d reconstruction info that contains the centimeter distances, is it necessary to use visualization tools for the motion trajectory planning of a robotic arm?

17 December 2024 1,660 3 View

In a broad sense, can all the machine learning tasks be viewed as classification?

In a broad sense, can all the deep learning tasks be viewed as classification?

10 December 2024 8,785 5 View

From the view of program development, profound mathematics, is an optimization of a problem within a module?

Within a specific problem, without the whole picture?

09 December 2024 7,246 2 View

Do I have to use RGB-D cameras for 3D reconstruction?

Do I must use RGB-D cameras for 3D reconstruction?

23 November 2024 2,239 2 View

Is there any work that involves inputting knowledge of known object/space sizes in advance into SLAM?

Is there any work that involves inputting knowledge of known object/space sizes into SLAM before SLAM?

23 November 2024 5,196 1 View

What are the limitations of the key points extracted by SIFT algorithm, and are there any intuitive examples?

In computer vision, what are the limitations of the key points extracted by SIFT algorithm, and in what situations are there limitations?

23 November 2024 5,717 1 View

What are the problems/challenges for this robot manipulation solution: 3D reconstruction then control the robot arm to take the thing at XYZ?

22 November 2024 7,179 0 View

Why do we need to use SLAM to locate the camera's position in real-time? When driving manually, can't we just visually assess the road conditions?

Of course, we need a large, complete overall map.

31 October 2024 289 1 View

Why is there a significant difference in speed, in human learning, such as solving problems, if human already can solve the problems?

20 October 2024 5,142 1 View

What are the differences between "chain of thought" and "writing more prompts"?

What are the differences between "chain of thought" and "writing more prompts" for using LLMs?

13 September 2024 8,816 1 View

For image+text, how is pre-training of Multimodal LLM generally done?

For image+text without video, how is pre-training of Multimodal Large Language Model generally done? Choice-1: Transform image to text, and then input all the text to LLM? Choice-2: Transform...

20 August 2024 5,884 4 View

The biggest problem with the CTR model is that users have limited purchasing power per day?

The biggest problem with the click-through rate model is that users have limited purchasing power per day, and it's not the case that sales increase linearly as a result of better recommendations.

02 August 2024 8,474 0 View

What exactly is RAG-LLM doing? Isn’t it data engineering?

What exactly is Retrieval Augmented Generation for Large Language Model doing? Isn’t it data engineering?

31 July 2024 7,976 3 View

After a lot of feature engineering for CTR modeling, it feels like it's basically the end of iteration? I mean, it's not cost-effective to keep doing?

After a lot of feature engineering for click-through rate modeling, it feels like it's basically the end of iteration? I mean, it's not cost-effective to keep doing it?

30 July 2024 5,581 0 View

All math can be explained by iterator of code?

all math can be traversed by code? all math can be translate to code?

27 July 2024 10,436 0 View

What is the effect for the CTR model, adding the tag-id with the highest number of user clicks/purchases on the item's tag as user-side features?

23 June 2024 3,681 1 View

The primary problem that LLMs solved is small sample learning, right?

The primary problem that large language models solved is small sample learning, right?

04 June 2024 3,404 2 View

based on GPT-3, Text generation really solves all tasks, or solves only text generation?

based on GPT-3

02 June 2024 5,553 3 View

ChatGPTs can really replace search engine?

Or they are only complement to each other

02 June 2024 6,082 3 View

Recommender system, which is more important, pay-AUC or click-AUC?

For example, if offline click-AUC improves from 0.77 to 0.82 VS pay-AUC improves from 0.88 to 0.91, which online gain will be greater?

22 May 2024 3,655 1 View

In deep learning, images can be encoded in pixels, how is audio encoded?

It seems it is also through several dimensions.

14 May 2024 8,159 3 View

[Swin-Transformer] Is each token (to-embedding) value an integer?

Swin-Transformer transform the image to tokens to input to transformer. Is each token (before-embedding) value an integer? In practice, where is this done?...

14 May 2024 1,192 2 View

Do you have to use paper and pen to do physics research?

Do you must use paper and pen to do physics research?

09 May 2024 1,250 4 View

Tagging on the item would theoretically solve all recall problems for searching. Do you agree?

Tagging on the item means adding related tags on the item for searching.

04 March 2024 1,181 2 View

Is there some evaluation in Arabic for stanfordnlp CoreNLP vs ElastcSearch default segmenter?

For word segmentation. Thank you very much!

04 March 2024 7,656 1 View

What is the best Arabic word segmentation tool in order to use in search engines?

I have a search engine based on ElasticSearch. Thank you very much!

04 March 2024 8,263 2 View

Why the best way to learn math is to do math ?

19 February 2024 4,021 3 View

What is the essential difference between "World Model" and "Reward Is Enough"?

Yann LeCun --> World Model Deepmind -->《Reward Is Enough》

19 February 2024 2,033 0 View

"machine learning from zero" is《the bitter lesson》?

learning like a child/baby

16 February 2024 8,787 1 View

Machine learning is to predict unseen data, but LLM is to memorize all the right data?

large language model

16 February 2024 1,384 1 View

"learn how to learn" means searching the reward automatically?

Human is searching the reward to verify some questions, while human predict the answers based on large learned memory.

16 February 2024 6,815 1 View

Why transformer is better than CNN in image tasks?

such as image classification

16 February 2024 1,642 2 View

What are some milestones in AI4math?

solving math by AI

16 February 2024 2,820 0 View

Search Algorithms, for short-text/keywords query, How much of an improvement is the model over the rules?

Not long-text query

04 February 2024 9,069 1 View

Are there any problems with search algorithms that use 2gram to split text?

04 February 2024 2,513 1 View

AlphaGo can surpass humans because for each input of the model in training, there is a 100% correct answer?

AlphaGo can surpass humans because for each input of the model, there is a 100% correct answer as the target label? And humans will make mistakes in situations like 1%.

16 January 2024 1,167 1 View

Neural network is actually part of the software2.0 ?

Data is part of the code. Neural network is actually code for fuzzy match.

12 January 2024 601 3 View

If I prepare the hardware myself, what are some good resources for doing robotics research?

11 January 2024 4,700 0 View

Writing papers with overly oblique and overly specialised words that don't really make sense?

27 December 2023 1,652 2 View

What capabilities does ChatGPT lack to become an AI-doctor/AI-lawyer/AI-teacher?

If ChatGPT wants to be an AI-teacher/AI-lawyer/AI-doctor, what important capabilities does it lack?

25 December 2023 5,235 2 View

The most important thing in ChatGPT's conditions for becoming an AI-doctor/AI-lawyer/AI-teacher is the accuracy of the model results?

25 December 2023 7,666 2 View

Why can't we use one image to predict the next, along the ideas of GPT?

Why can't we use one image to predict the next, along the lines of GPT?

20 December 2023 4,949 3 View

When is it better to program with CUDA?

13 December 2023 6,533 1 View

What’s the difficulty in implementing a robotic arm to pick up a glass of water?

24 November 2023 2,750 0 View

How much has mathematics directly contributed to the development of deep learning?

Do you feel that deep learning is mainly an engineering contribution?

11 November 2023 7,238 1 View

Are large language models relatively unsuitable for high-precision tasks?

Are LLMs relatively unsuitable for high-precision tasks?

10 November 2023 6,547 2 View

The method is simple and effective. How to write a CS paper that will help it be accepted?

The method is simple and effective. How to write a computer science paper that will help it be accepted?

31 October 2023 8,131 1 View

The method is simple and effective. How to write an AI paper that will help it be accepted?

31 October 2023 6,212 4 View

Are computer science papers generally not as complex as mathematics papers?

31 October 2023 2,219 2 View

For computer science, can some methods in the paper be written without experiments, just theoretical analysis of the results?

30 October 2023 8,853 10 View

Is writing papers and actual programming two different fields of knowledge?

How big is the difference between what is written in many AI papers and its real code?

23 October 2023 1,138 1 View

Do all deep learning solve the similarity of things?

26 September 2023 8,224 7 View

What is the principle that allows transformers to learn super-long sequences?

26 September 2023 4,581 3 View

What problem is theoretical deep learning trying to solve?

26 September 2023 2,159 1 View

The meticulous and careful manual annotation one by one is irreplaceable?

This part seems extremely difficult to optimize.

26 September 2023 9,069 1 View

Is data augmentation, which creates something from nothing, substantially reliable?

Data augmentation creates something from nothing?

25 September 2023 4,224 4 View

What are some substantial and reliable advancements in data augmentation?

Data augmentation creates something from nothing?

25 September 2023 9,343 4 View

What percentage of the rise of deep learning in 2012 is due to mathematical contributions, and what percentage is due to engineering contributions?

20 September 2023 641 0 View

Is it inevitable that pre-training + few-shot learning will not be as good as sufficient data in a specific field?

Less training data, Less model performance. Is it inevitable that pre-training + few-shot learning will not be as good as sufficient data in a specific field?

09 September 2023 4,472 4 View

Small sample learning, why is it called Few-Shot Learning, not Few-Data Learning?

08 September 2023 831 1 View

Please list the top conference papers on AI you have read, in which large sections of mathematics have played a key role?

08 September 2023 3,735 0 View

The essential result of GPT pre-training is sentence similarity, right?

universal sentence similarity

08 September 2023 9,152 1 View

How to accurately define whether an AI paper is solid?

07 September 2023 8,548 2 View

Some of the major mathematical sections in top AI conference papers are hard to understand, what should I do?

For example: 《Efficient Second-Order Plane Adjustment》

07 September 2023 9,012 0 View

How can artificial intelligence break through the existing deep learning/neural network framework, and what are the directions?

07 September 2023 9,856 1 View

Has OpenAI released any solutions or approaches for task-oriented dialogue?

03 September 2023 5,846 2 View

If computing power is further improved, can computer vision achieve the 'emergent capability' of ChatGPT?

03 September 2023 9,573 3 View

What are the differences in task-oriented dialogue before and after the release of ChatGPT?

03 September 2023 966 1 View

If each NLP task has an accuracy of 90%, after integrating them into the LLM, the accuracy of each NLP task becomes 85%, right?

If each NLP task has an accuracy of 90%, after integrating them into the large language model, the accuracy of each NLP task becomes 85%, right?

25 August 2023 923 1 View

There must be some negative interactions between multiple NLP tasks in a LLM, right?

For example, if the accuracy of each NLP task is 90%, after integrating them into a large language model, the accuracy of each NLP task becomes 85%.

25 August 2023 7,678 1 View

How to establish a network with IEEE members?

How to establish a network with IEEE members that can help become an IEEE Fellow?

23 August 2023 9,801 1 View

How to do a computer science paper with a title starting with "Revisit"?

《Revisiting ...》

23 August 2023 5,587 2 View

If the paper is 1 page less than the maximum required limit, does it have a big impact on its acceptance?

For example, if the maximum requirement is 6 pages of main content, but my paper only has 5 pages.

23 August 2023 6,689 3 View

What are some comparison articles between fine-tuning LLM and fine-tuning BERT under supervised learning?

LLM with >= 6B parameters vs BERT-Large/BERT-Base

26 July 2023 9,476 1 View

How much does writing affect the acceptance of a paper?

If a paper is innovative but written poorly, but the ideas are clearly expressed, what is the likelihood of it being accepted?

26 July 2023 5,136 4 View

What are the most important points in writing a paper? Do we need to strictly follow the format of previously accepted papers?

Thank you

26 July 2023 5,894 4 View

ChatGPT, The difference of using reward to guide policy vs using the dataset of reward to train policy?

ChatGPT, The difference of using reward to guide policy vs using the dataset of reward to train policy? Actually, the good quality data is the final goal for both?

06 April 2023 8,535 1 View

A question bother me a long time: What is the difference between RL-for-text-generation and delete-0-reward-model-predictions?

For text gereration. Thank you very much!

16 March 2023 533 1 View

What are the pretrained-language-model that is obviously better than BERT and RoBERTa?

The BERT is described in the paper 《BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding》. The RoBERTa is described in the paper 《RoBERTa: A Robustly Optimized BERT...

20 January 2021 1,095 2 View

Based on transformer, how to improve the text generation results?

If I do not pretrain the text generation model like BART, how to improve the result based on transformer like tensor2tensor? What are the improvement ideas for transformer in text generation task?

19 August 2020 7,519 3 View

Are there any deep models that are better than BERT-CRF in NER task?

Named entity recognition (NER) is task that mark tags of the input text sequence. BERT-CRF is a good NER model. I want to find a better NER model. Or I want to improve the BERT-CRF model. What...

19 August 2020 7,380 2 View

Question of pretraining text-generation task, it seems that pretraining is not work for a small model?

My task is to generate keywords from sentences. I pretrain a text-generation model. I mask the sentences' tokens and predict the whole sentences' tokens. Pretraining batch_size = 8 and step =...

29 July 2020 5,915 1 View

A way to init sentence embedding for unsupervised text clustering, better than glove wordvec?

For unsupervised text clustering, the key thing is the init embedding for text. If we want to use https://github.com/facebookresearch/deepcluster for text, the problem for text is how to get the...

17 July 2020 8,052 5 View

What is the principle of Unsupervised Data Augmentation (UDA)? Why does UDA work?

UDA(https://github.com/google-research/uda) could achieve good accuracy by only 20 training data on text classification. But I find it is hard to reproduce the result on my own dataset. So I...

06 June 2020 1,732 2 View

What are the simplest methods for the label noise problem?

If I have enough low quality data from unsupervised methods or rule-based methods. Do you think removing the wrong data predicted by trained model is a simple but effective method?

03 June 2020 4,309 3 View

Data quantity is not low but data quality is low, what are the best practices now?

Text classification task, if data quantity is low but data quality is not low. We could use data augment methods for improvement. But the situation is that data quantity is not low and data...

02 June 2020 3,251 15 View

What is the model architectural difference between transductive GCN and inductive GraphSAGE?

Difference of the model design. It seems the difference is that GraphSAGE sample the data. But what is the difference in model architecture.

06 May 2020 8,054 3 View

What is difference between transductive and inductive in GNN?

It seems in GNN(graph neural network), in transductive situation, we input the whole graph and we mask the label of valid data and predict the label for the valid data. But is seems in inductive...

06 May 2020 4,757 3 View

Which kind of model is better for keyword-set classification?

There exists a similar task that is named text classification. But I want to find a kind of model that the inputs are keyword set. And the keyword set is not from a sentence. For example: input...

26 December 2019 4,588 3 View

Which kind of model is better for keyword-set classification?

There exists a similar task that is named text classification. But I want to find a kind of model that the inputs are keyword set. And the keyword set is not from a sentence. For example: input...

26 December 2019 2,803 6 View

NL2SQL, for real industrial application, what strategy to locate the exact table?

The datasets like WikiSQL is that the table corresponding to question is given. But in real industrial application, we have 100+ tables for 1 new question. Thank you!

10 December 2019 4,999 0 View

Is there room to improve the model? If the train data accuracy is 99.8% but test data accuracy is 90%?

I understand this is a wide question. But there can be some suggestions. I can try some methods which I do not know. I think the model is already prefect on train data. But the test accuracy is...

12 October 2019 5,322 5 View

NMT, What if we do not pass input for decoder?

For transformer-based neural machine translation (NMT), take English-Chinese for example, we pass English for encoder and use decoder input(Chinese) attend to encoder output, then final output....

16 September 2019 7,525 1 View

What is the difference between attend and attention?

Attention is the mechanism described in the paper: "Attention Is All You Need". Attend is an operation of Tensorflow or PyTorch.

12 September 2019 7,142 4 View

Why TREC set two task: document ranking and passage ranking?

TREC is https://microsoft.github.io/TREC-2019-Deep-Learning/ I am new to text retrieval. Still can not understand why set the two similar task. Thank you very much.

06 August 2019 1,301 3 View

What is the most popular loss for doc ranking and What is the most popular loss for text similarity?

Based on my understanding, both the doc ranking task and text similarity task take sentence pairs as model input. We use different loss to get better result for each of them. Thank you very much.

05 August 2019 3,761 4 View

What can NLI do for a chatbot?

Natural Language Inference(NLI) is the task of predicting the labels(entailment, contradiction, and neutral,) for sentence pairs. People invent a lot of deep model to solve this problem. But I...

05 August 2019 4,837 3 View

For text match problem, what is the different between question-question match and question-answer match?

I know question-question match is a text similarity problem. What about question-answer match or question-doc match? It is used in information retrieval. question-question match is indeed text...

03 August 2019 8,893 3 View

How to prepare the data for text generation task?

First, I'm not sure whether the model contains the encoder during training. EOS means end-of-sentence. Encoder and decoder are part of transformer network. If without-encoder, training...

23 March 2019 9,402 2 View

The principle of LM deep model?

Language model(LM) is the task of predicting the next word. Does the deep model need the encoder? From the ptb code of tensor2tensor, I find the deep model do not contains the encoder. Or both...

22 March 2019 9,756 2 View

What is the difference between TextGAN and LM for text generation?

I'm new to LeakGAN or SeqGAN or TextGAN. I know GAN is to generate text and let discriminator un-judge-able to real text and gen-text. LM(language model) is the task of predicting the next word...

11 March 2019 4,667 5 View

What is the reason for the speedup of transformer-xl?

The inference speed of transformer-xl is faster than transformer. Why? If state reuse is the reason, so it is compared by 2 32seq_len + state-reuse vs 1 64seq_len + no-state-reuse?

25 February 2019 9,338 3 View

In ChatGPT, what is the difference between Reinforcement-Learning-from-Human-Feedback and Data-Re-Label-from-Reward?

RLHF vs TrainingData-Label-Again-based-on-Reward. Reward come from human labeling.

01 January 1970 6,915 3 View

How can a low-level employee working in an IT company, without supervising students like a university professor, become an IEEE fellow?

01 January 1970 8,024 1 View

How to become an IEEE fellow while working in a company without being a university professor?

01 January 1970 9,812 0 View

In ChatGPT, can we remove the reward-model part?

We collected the [good]/[bad] feedback from the web page. Then we remove the [bad] feedback data. Then we only use the [good] feedback data to train the text-generation policy-model. The [good]...

01 January 1970 5,095 3 View

What to research? except data-edit or data-load for LLM

LLM = large language model

01 January 1970 391 2 View

Deep learning now is solving the problem of AI-agent remembering. Is it right?

Deep learning want to have generalization ability. And now deep learning is solving the problem of AI-agent remembering. Is it right?

01 January 1970 547 5 View

How can independent researchers become an IEEE fellow without supervising students?

01 January 1970 8,156 1 View

What is the different between Reinforcement-Learning-On-NLP and Re-Label-That-Data

Reinforcement-Learning-On-NLP means that using reward to update model. Re-Label-That-Data means using reward to label-again the related data and then re-train.

01 January 1970 2,949 3 View

ChatGPT's core function is to fuzzy-match all the texts in the world to a small set of texts.

Do you agree?

01 January 1970 9,046 2 View

In reinforcement learning, there are no so-called 'reward', there are only 'positive and negative' collected data.

Is it right?

01 January 1970 8,132 4 View

For ChatGPT，if you can collect all the possible pre-train data, then you can just remove the bad-feedback data from predictions.

For ChatGPT，if you can collect all the possible pre-train data, then you can just remove the bad-feedback data from predictions for reward model. if you can not collect all the possible pre-train...

01 January 1970 3,886 2 View

Is there a way to become an IEEE fellow without becoming a doctoral supervisor at a university?

01 January 1970 5,121 1 View

GPT save the candidate to-label data into big model, so to simplify the labeling difficulty. The labeler originally need to ...

GPT save the candidate to-label data into big model, so to simplify the labeling difficulty. The labeler originally need to write the whole answer by themselves.

01 January 1970 7,359 0 View

For ChatGPT, there is no essential difference between reinforce learning and supervised learning, here.

For ChatGPT, human-feedback's goal is to fix the wrong data in policy-model's dataset. There is no essential difference between reinforce learning and supervised learning, here. Is it right?

01 January 1970 7,770 0 View

Is there a promising future for someone over 40 years old who is still writing code and has not become a manager?

Is there a promising future for someone over 40 years old who is still writing code, and has not become a manager in IT company?

01 January 1970 748 2 View

Why let the machine learn to think, think may be right or wrong. How about just let the machine memorize all the correct answers?

The bottleneck of LLMs is that it is actually impossible to label all knowledge? DeepLearning/LLMs are ultimately efficiency problem of data production?

01 January 1970 3,377 3 View

Why are top researchers all studying theoretical deep learning?

01 January 1970 9,416 1 View

For physics, is mathematics more of a tool or a language?

01 January 1970 897 3 View

Do you know any scholars/researchers who, without a PhD, ended up as researchers at an institute?

01 January 1970 3,557 3 View

Mathematics is and is only a language and a tool, not all of science?

01 January 1970 992 2 View

ChatGPT is not intelligence, but memory?

Next-token-prediction is not intelligence, but memory?

01 January 1970 1,787 1 View

Isn't this how humans learn? First remember some things, then make some guesses about new things based on existing memories, just like a NN?

Isn't this how humans learn? First remember some things, then make some guesses about new things based on existing memories, just like a neural network? So, do you feel that the current path of...

01 January 1970 8,074 6 View

How many high-quality papers on average are required to become an IEEE fellow?

01 January 1970 9,808 0 View

How big is the difference between what is written in many AI papers and its real code?

So, is writing papers and actual programming two different fields of knowledge?

01 January 1970 9,460 0 View

Humans first remember something, then make some guess about new thing based on memory, just like NN, so do you feel that deep learning can lead to AGI

Humans first remember some things, then make some guesses about new things based on memory, just like neural networks, so do you feel that deep learning can lead to AGI (Artificial General...

01 January 1970 3,869 1 View

DeepLearning/LLMs are ultimately efficiency problem of data production?

Why let the machine learn to think, think may be right or wrong. How about just let the machine memorize all the correct answers? The bottleneck of LLMs is that it is actually impossible to label...

01 January 1970 8,529 1 View

Is LLM/ChatGPT actually moving further and further away from AlphaGo-style AI?

01 January 1970 6,796 3 View

Do you know any scholars/researchers who, without a PhD, ended up becoming university professors?

01 January 1970 2,012 0 View

The bottleneck of LLMs is that it is actually impossible to label all data?

DeepLearning/LLMs are ultimately efficiency problem of data production? Why let the machine learn to think, think may be right or wrong. How about just let the machine memorize all the correct...

01 January 1970 7,970 0 View

For computer science, is mathematics more of a tool or a language?

01 January 1970 3,222 3 View

For ChatGPT, human-feedback's goal is to fix the wrong data in policy-model's dataset.

For ChatGPT, human-feedback's goal is to fix the wrong data in policy-model's dataset. There is no essential difference between reinforce learning and supervised learning, here. Is it right?

01 January 1970 7,261 1 View

The evaluation system in academia is fairer than in the industry, right?

01 January 1970 557 1 View

University teachers have jobs such as teaching, so what percent of their time is spent on research?

01 January 1970 4,911 12 View

Is the English present-tense/past-tense/singular-plural/third-person-singular purely redundant?

We can totally get the sentence meaning without them.

01 January 1970 9,906 2 View

We do the WORLD MODEL research, but in the end, we still have to let large model memorize all the correct answers?

Researching world model + reinforcement learning, and in the end realize that we still need to label a lot of data?

01 January 1970 6,826 3 View

Is it reasonable for engineering research to focus on academic papers?

for example, computer science

01 January 1970 9,987 2 View

The real benefit of large language model is its big capability, not benefit of few-shot learning ability?

The more benefit of large language model is its big capability, not benefit of few-shot learning ability?

01 January 1970 3,261 3 View

Rethink neural network: it seems deep-learning = deep-memory?

memorize-ability > generalize-ability

01 January 1970 1,661 3 View

Are computer science papers generally not as profound as mathematics papers?

01 January 1970 6,526 1 View

For computer science, sometimes, writing can elevate a paper to a very high level. Right?

01 January 1970 5,516 1 View

Predicting, will AGI ultimately be derived from mathematical derivation, or from engineering experiments?

01 January 1970 9,547 0 View

Why didn't anyone at NVIDIA win a Turing Award?

Why didn't anyone at NVIDIA company win a Turing Award?

01 January 1970 3,002 0 View

Given unlimited time to an average music team can they create a TOP level song?

01 January 1970 2,701 1 View

Can a Beethoven/Mozart level song be created given unlimited time to the average musician?

01 January 1970 900 1 View

What are the differences in technical proficiency between Elon Musk and Geoffrey Hinton?

01 January 1970 9,446 0 View

Use math to describe the code, making the code difficult to understand. Use code to describe math, making math easier to understand?

right?

01 January 1970 768 4 View