1answer.
Ask question
Login Signup
Ask question
All categories
  • English
  • Mathematics
  • Social Studies
  • Business
  • History
  • Health
  • Geography
  • Biology
  • Physics
  • Chemistry
  • Computers and Technology
  • Arts
  • World Languages
  • Spanish
  • French
  • German
  • Advanced Placement (AP)
  • SAT
  • Medicine
  • Law
  • Engineering
Taya2010 [7]
3 years ago
13

Z is a summer intern working on spam classification in your company. The dataset consists of 10 million non-spam emails (class 0

) and 10 thousand spam emails (class 1). Z considers the following steps of conducting experiments:
Step 1: Shuffle the dataset and split it into the train, validation, and test sets.
Step 2: Train logistic regression models on the train set with different hyper-parameters.
Step 3: Identify the best hyper-parameter using the validation set and report the results on the test set in accuracy.

Do you agree with the above experimental setup? If No, what is the major issue? Provide your suggestions in one or two sentences.
Computers and Technology
1 answer:
siniylev [52]3 years ago
6 0

Answer:

No, the data points of class 0 and class 1 are imbalanced and the text should be converted to a vector before used

Explanation:

The non-spam emails of class 0 have 10 million rows of data while class 1 of spam emails have 10 thousand rows. The data points are imbalanced and would result in an inaccurate prediction for the model. Either class 0 be downsampled or class 1 be upsampled to improve the prediction of the model.

The text of the emails should also be converted to vectors before using it in the model using natural language processing (NLP) techniques.

You might be interested in
Employee names are stored using a maximum of 50 characters. This information is an example of which of the following?
enyata [817]

Answer: Data model

Explanation: Data model is the arrangement of the information in the form of table structure to maintain it in the form of database.They help in the keeping the entities in the sequence and can be tracked when required.Example- vendors records,customer record etc.

Other options are incorrect because the data retrieval is the regaining of the data from database, record instance is the parts of the database records  and meta data give knowledge about other data.

8 0
4 years ago
Is a trade group that promotes wireless technology and owns the trademark for the term “wi-fi”
ivann1987 [24]

Answer:

1. Wifi Alliance

2. Cells

Hope this helps! Have a great day! :)

7 0
4 years ago
Read 2 more answers
Generally, what is the term for storage locations in a processor?
oksano4ka [1.4K]

Answer: Registers

Explanation:

Registers are small storage locations identified by different types of registers. The function of the register is to provide data for immediate processing to the CPU. These registers hold data temporarily and provide easy access of data to the processor.

6 0
3 years ago
Item 19 A student pushes a box across a rough horizontal floor. If the amount of work done by the student on the box is 100 J an
tatiyna

Answer:

60 J

Explanation:

Given,

Work done to push the box = 100 J

Energy dissipated by friction = 40 J.

According to work energy theorem work done is equal to change in kinetic energy.

Change in Kinetic energy = Work done by student + Work done by friction

Change in Kinetic energy = 100 J - 40 J

Work done by the friction will be negative because it will act in opposite direction of force applied.

Change in Kinetic energy = 60 J

4 0
4 years ago
The ______ sets rules for both the transport of data packets and the addressing system for a network such as the ARPANET
Phantasy [73]

The tool that sets rules for both the transport of data packets and the addressing system for a network such as the ARPANET is called; Protocol

<h3>Understanding Protocols</h3>

In computer systems communications, a protocol is defined as a set of rules and regulations that allow a network of nodes to transport and receive data information.

Now, each transport layer in a system of network such as ARAPNET will always have a protocol set such as TCP and IP.

Read more about internet protocols at; brainly.com/question/17820678

3 0
2 years ago
Other questions:
  • g Write a program using integers usernum and x as input, and output usernum divided by x four times. For example, if the input i
    14·1 answer
  • Can i add ADSence on bangla content in blogsopt website?
    12·1 answer
  • How would asking questions or defining problems be used in this career?<br> -
    13·1 answer
  • Pls help due tonight<br>Will give brainiest
    13·2 answers
  • GMI = $4,666.67 Total Monthly Deductions $1,131.00 What is the Net Monthly Income (GMI - Total Monthly Deductions) =
    9·1 answer
  • Hannah wants to write a book about how scientists and society interact, and she has generated ideas for chapters. Which chapter
    13·1 answer
  • What is the collection of programs designed to operate control and manage the actual operation of the computer equipment​
    14·1 answer
  • A service that enables a customer to build and run their own applications but doesn't include extensive access to hardware and m
    8·1 answer
  • Step of opening browser on computer​
    15·1 answer
  • What is the difference between margin and padding property?
    7·1 answer
Add answer
Login
Not registered? Fast signup
Signup
Login Signup
Ask question!